Enhanced 3D Object Detection Using 4D Radar and Vision Fusion with Segmentation Assistance

Xuemei Chen,Yaohan Jia,Pengfei Ren,Zeyuan Xu,Wenzhe shan
DOI: https://doi.org/10.21203/rs.3.rs-5358941/v1
2024-01-01
Abstract:4D radar exhibits robustness to complex lighting and adverse weather conditions, providing unique data characteristics compared to LiDAR for 3D target detection. However, due to the sparsity of 4D radar point clouds, the performance of most 3D target detection algorithms is limited. To address this, this paper propose a 3D object detection model based on fine-grained point cloud segmentation. Our approach first enriches the point cloud data using a radar reference point module to compensate for its sparsity. The point cloud is then pillarized, and semantic information is extracted through a simple segmentation network. Finally, 3D object detection is achieved by fusing point cloud features and semantic information using an attention mechanism. Extensive experiments conducted on the VoD dataset demonstrate that our model achieves a mean average precision (mAP) that is 5\% higher than the baseline on the validation set, with notable improvements of 4\% for bicycles and 8\% for pedestrians. These results narrow the performance gap with LiDAR-based models, highlighting the effectiveness of our segmentation-assisted detection approach.the source codes are released at https://github.com/Huniki/RVASANET.git
What problem does this paper attempt to address?