Vehicle 3d Localization in Road Scenes VIA a Monocular Moving Camera

Yanting Zhang,Aotian Zheng,Ke Han,Yizhou Wang,Jenq-Neng Hwang
DOI: https://doi.org/10.1109/icassp39728.2021.9413487
2021-01-01
Abstract:Knowing the 3D locations of the surrounding vehicles is of vital importance in autonomous driving scenarios. It can be pretty challenging to make an accurate estimation from a monocular moving camera. In this paper, we present an effective vehicle 3D localization method, that utilizes 2D key-points predicted from a trained CNN to model the vehicles’ structure, from which the ground points are further inferred. An adaptive ground plane estimation method is exploited under the monocular camera for 3D geometric back-projection. Benefiting from tracking, we also take into account temporal information of the same object to ensure the trajectory consistency. Viewpoint and size knowledge are also considered for refinement. The evaluation on the KITTI benchmark for on-road vehicles shows the effectiveness of our proposed approach with promising 3D localization results.
What problem does this paper attempt to address?