3D pose estimation of vehicles from monocular videos using deep learning

Cheng, Qing

3D pose estimation of vehicles from monocular videos using deep learning

Files

18-cheng-MSc.pdf (10.4 MB)

Date

2018

Authors

Cheng, Qing

Abstract

In this thesis, we present a novel approach, Deep3DP, to perform 3D pose estimation of vehicles from monocular images intended for autonomous driving scenarios. A robust deep neural network is applied to simultaneously perform 3D dimension proximity estimation, 2D part localization, and 2D part visibility prediction. In the inference phase, these learned features are fed to a pose estimation algorithm to recover the 3D location, 3D orientation, and 3D dimensions of the vehicles with the help of a set of 3D vehicle models. Our approach can perform these six tasks simultaneously in real time and handle highly occluded or truncated vehicles. The experiment results show that our approach achieves state-of-the-art performance on six tasks and outperforms most of the monocular methods on the challenging KITTI benchmark.

URI

http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-119721
http://elib.uni-stuttgart.de/handle/11682/11972
http://dx.doi.org/10.18419/opus-11955

Collections

05 Fakultät Informatik, Elektrotechnik und Informationstechnik

Full item page

3D pose estimation of vehicles from monocular videos using deep learning

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By