Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://dx.doi.org/10.18419/opus-11955
Autor(en): Cheng, Qing
Titel: 3D pose estimation of vehicles from monocular videos using deep learning
Erscheinungsdatum: 2018
Dokumentart: Abschlussarbeit (Master)
Seiten: 78
URI: http://nbn-resolving.de/urn:nbn:de:bsz:93-opus-ds-119721
http://elib.uni-stuttgart.de/handle/11682/11972
http://dx.doi.org/10.18419/opus-11955
Zusammenfassung: In this thesis, we present a novel approach, Deep3DP, to perform 3D pose estimation of vehicles from monocular images intended for autonomous driving scenarios. A robust deep neural network is applied to simultaneously perform 3D dimension proximity estimation, 2D part localization, and 2D part visibility prediction. In the inference phase, these learned features are fed to a pose estimation algorithm to recover the 3D location, 3D orientation, and 3D dimensions of the vehicles with the help of a set of 3D vehicle models. Our approach can perform these six tasks simultaneously in real time and handle highly occluded or truncated vehicles. The experiment results show that our approach achieves state-of-the-art performance on six tasks and outperforms most of the monocular methods on the challenging KITTI benchmark.
Enthalten in den Sammlungen:05 Fakultät Informatik, Elektrotechnik und Informationstechnik

Dateien zu dieser Ressource:
Datei Beschreibung GrößeFormat 
18-cheng-MSc.pdf10,65 MBAdobe PDFÖffnen/Anzeigen


Alle Ressourcen in diesem Repositorium sind urheberrechtlich geschützt.