KPV3D: Enhancing Multi-View 3D Vehicle Detection with 2D Keypoint Priors

Ziying Yao,Zhongxia Xiong,Xuan Liu,Xinkai Wu
DOI: https://doi.org/10.1109/tim.2024.3443353
IF: 5.6
2024-01-01
IEEE Transactions on Instrumentation and Measurement
Abstract:3D detection from multi-view images has advanced significantly with its cost-effective deployment in autonomous driving. Present approaches construct 3D representations from multi-view images or employ object queries distributed in 3D space to achieve object localization, but relatively few methods incorporate more explicit image priors as guidance. Hence, there is still untapped potential for discovering significant information from more explicit and richer 2D clues. Focusing on important perception categories in autonomous driving, we aim to further improve vehicle detection performance and propose KPV3D, which leverages 2D bounding box and vehicle keypoint information to construct spatial feature correlation. Based on high-quality 2D object and vehicle keypoint priors, KPV3D generates directional and pivotal semantic information for adaptive 3D queries, and using a point-global feature aggregation module for effective feature capture. Additionally, a reprojection loss is proposed to further incorporate keypoint priors for refining 3D prediction results. Experiments on public dataset nuScenes are conducted to verify the effectiveness of the proposed method.
What problem does this paper attempt to address?