Depth-aware Imbalance Learning for Monocular 6dof Vehicle Pose Estimation

He Liu,Huaping Liu,Fuchun Sun
DOI: https://doi.org/10.1109/cac53003.2021.9727689
2021-01-01
Abstract:Vehicle pose estimation from a monocular image is a crucial and challenging problem in computer vision and robotics. Generally, current benchmark methods mostly combine expensive LIDAR sensors and stereo RGB images for 6 degrees of freedom(6DoF) pose estimation algorithms; however, single RGB image-based approaches suffer from a dramatically decreased performance. In this paper, we propose a method to reduce the gap by reformulating the monocular 6DoF vehicle pose estimation problem as a depth-aware imbalance learning. We integrate classification networks with regression networks to solve the imbalance. To address the problem of inaccurate prediction depth with a monocular image, we further design a depth-aware structure, which extracts a more reliable feature of location and improves 3D surroundings perception. As a result, our method surpasses the state-of-the-art methods, even with straightforward end-to-end training and limited imbalanced data. The experimental results on the challenging Apollocar3d [1] dataset show that the method outperforms the state-of-the-art significantly, achieving improvements on the standard metric up to 37.4%.
What problem does this paper attempt to address?