Anchor-Based Transformer for Temporal LiDAR 3D Object Detection

Rongqi Gu,Fei Wu,Peigen Liu,Chu Yang,Yaohan Lu,Guang Chen
DOI: https://doi.org/10.1109/icarm62033.2024.10715948
2024-01-01
Abstract:LiDAR plays an important role in autonomous driving since it can provide accurate 3D scene structure information by generating point cloud data. A key challenge in 3D object detection comes from the sparse distribution of 3D points caused by occlusion, the distant target or the limitation of LiDAR attributes, resulting in high rates of False Negatives. In this paper, we introduce an anchor-based transformer for temporal LiDAR 3D object detection, which can reduce the missed detections by utilizing temporal prior knowledge. Specifically, the proposed method first utilizes the sparse convolution to process the point voxels to generate downsample BEV feature representations, which are then fed into our anchor based transformer to establish long-range attention. Then, the proposal anchors from previous and current frames are aggregated and encoded as the queries to extract all candidate targets features in the deformable attention module. This process can enhance the target features weakened by sparse point distribution, thereby elevating the overall detection performance. On the proposed Port dataset, our method demonstrates a significant improvement over existing state-of-the-art methods, thereby verifying the effectiveness of our proposed approach.
What problem does this paper attempt to address?