TSC-BEV: Temporal-Spatial Feature Consistency 3D Object Detection

JingYun Fang,Mingxi Zhuang,Guangming Wang,Hesheng Wang
DOI: https://doi.org/10.1109/cac59555.2023.10451172
2023-01-01
Abstract:Multi-View 3D object detection plays an important role in autonomous driving. Many methods have designed temporal fusion modules to integrate BEV (Bird's Eye View) features from different time steps. However, they seldom consider how to guide the parameter updates in the temporal fusion module to achieve better performance. In this paper, we propose two kinds of loss named Temporal Feature Consistency Loss and Spatial Feature Consistency Loss. The Temporal Feature Consistency Loss emphasizes the relationship between features from the same obstacle at different time steps. It aims to ensure consistency in the representation of an obstacle across consecutive frames. On the other hand, Spatial Feature Consistency Loss focuses on the relationship among all obstacles of the same category. It enables the model to learn a shared common feature representation for obstacles belonging to the same category. Our method is evaluated on the nuScenes 3D object detection benchmark, achieving a performance of 52.1% NDS, with average translation error outperforming baseline by 2 points.
What problem does this paper attempt to address?