CISF-BEV: A Complementary Interaction Sparse Fusion Network in BEV for 3D Object Detection

Qian Li,Jiayi Qin,Yujie Feng,Jie Chen
DOI: https://doi.org/10.1109/cisat62382.2024.10695404
2024-01-01
Abstract:Fusing LiDAR and camera information for 3D object detection is of increasing interest in autonomous driving. However, the point cloud still lacks much geometric position information, due to the sparsity of point clouds, which is not conducive to the fusion of LiDAR and camera. In this paper, we propose a complementary interaction sparse fusion method in BEV (CISF-BEV), to solve the above problem. First, we design the Point Cloud Complementary Interaction (PCI) module, which adopts dense pseudo point cloud BEV features to complement the empty areas of point cloud BEV features generated due to the occlusion of the objects or too far away to be scanned by LiDAR, to achieve the feature complementation and interaction. Second, we propose the Sparse Sampling Fusion (SSF) module that fuses the complementary features with pseudo point cloud features in a sparse sampling manner to quickly and efficiently assign semantic features to the point cloud, which is more conducive to generating accurate detection results. Experiments on the KITTI dataset show that CISF-BEV achieves very superior 3D object detection performance, particularly in the accuracy of occluded objects and distant objects.
What problem does this paper attempt to address?