LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection

Jingyu Song,Lingjun Zhao,Katherine A. Skinner
2024-02-19
Abstract:We propose LiRaFusion to tackle LiDAR-radar fusion for 3D object detection to fill the performance gap of existing LiDAR-radar detectors. To improve the feature extraction capabilities from these two modalities, we design an early fusion module for joint voxel feature encoding, and a middle fusion module to adaptively fuse feature maps via a gated network. We perform extensive evaluation on nuScenes to demonstrate that LiRaFusion leverages the complementary information of LiDAR and radar effectively and achieves notable improvement over existing methods.
Computer Vision and Pattern Recognition,Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the insufficient performance of existing LiDAR (Light Detection and Ranging) and radar fusion methods in 3D object detection in autonomous vehicles. Specifically, the existing LiDAR and radar fusion methods have the following problems: 1. **Performance Gap**: Existing LiDAR and radar fusion methods perform worse than using LiDAR alone or the fusion method of LiDAR and camera in some cases. This is mainly due to the sparsity and noise problems of radar data, resulting in poor fusion effects. 2. **Lack of Adaptability**: The existing fusion methods lack the ability to adaptively fuse features of different modalities and cannot fully utilize the complementary information of LiDAR and radar. For example, radar performs better at long distances and in bad weather conditions, while LiDAR has more advantages in close - range and high - resolution aspects. 3. **Hard Constraints**: Some existing fusion methods need to impose hard constraints, such as limiting the detection category or detection range, to improve performance, which limits the flexibility and extensiveness of their practical applications. To solve these problems, the paper proposes a new method named LiRaFusion, aiming to improve the fusion of LiDAR and radar in the following ways: - **Early Fusion Module**: An early fusion module is designed to jointly encode the voxel features of LiDAR and radar, so as to extract features more effectively. - **Adaptive Intermediate Fusion Module**: An adaptive gating network is introduced, which can adaptively fuse feature maps, especially for improvement in the Bird - Eye - View (BEV) feature space. - **Extensive Evaluation**: Extensive evaluation has been carried out on the nuScenes dataset, proving that LiRaFusion can effectively utilize the complementary information of LiDAR and radar and significantly improve the performance of existing methods. In general, the goal of this paper is to fill the performance gap of existing LiDAR and radar fusion methods in 3D object detection by designing a new fusion architecture and improve the perception ability of autonomous vehicles in various scenarios and environmental conditions.