Abstract:We propose LiRaFusion to tackle LiDAR-radar fusion for 3D object detection to fill the performance gap of existing LiDAR-radar detectors. To improve the feature extraction capabilities from these two modalities, we design an early fusion module for joint voxel feature encoding, and a middle fusion module to adaptively fuse feature maps via a gated network. We perform extensive evaluation on nuScenes to demonstrate that LiRaFusion leverages the complementary information of LiDAR and radar effectively and achieves notable improvement over existing methods.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the insufficient performance of existing LiDAR (Light Detection and Ranging) and radar fusion methods in 3D object detection in autonomous vehicles. Specifically, the existing LiDAR and radar fusion methods have the following problems: 1. **Performance Gap**: Existing LiDAR and radar fusion methods perform worse than using LiDAR alone or the fusion method of LiDAR and camera in some cases. This is mainly due to the sparsity and noise problems of radar data, resulting in poor fusion effects. 2. **Lack of Adaptability**: The existing fusion methods lack the ability to adaptively fuse features of different modalities and cannot fully utilize the complementary information of LiDAR and radar. For example, radar performs better at long distances and in bad weather conditions, while LiDAR has more advantages in close - range and high - resolution aspects. 3. **Hard Constraints**: Some existing fusion methods need to impose hard constraints, such as limiting the detection category or detection range, to improve performance, which limits the flexibility and extensiveness of their practical applications. To solve these problems, the paper proposes a new method named LiRaFusion, aiming to improve the fusion of LiDAR and radar in the following ways: - **Early Fusion Module**: An early fusion module is designed to jointly encode the voxel features of LiDAR and radar, so as to extract features more effectively. - **Adaptive Intermediate Fusion Module**: An adaptive gating network is introduced, which can adaptively fuse feature maps, especially for improvement in the Bird - Eye - View (BEV) feature space. - **Extensive Evaluation**: Extensive evaluation has been carried out on the nuScenes dataset, proving that LiRaFusion can effectively utilize the complementary information of LiDAR and radar and significantly improve the performance of existing methods. In general, the goal of this paper is to fill the performance gap of existing LiDAR and radar fusion methods in 3D object detection by designing a new fusion architecture and improve the perception ability of autonomous vehicles in various scenarios and environmental conditions.

LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection

Fusing LiDAR and Radar with Pillars Attention for 3D Object Detection

Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

InterFusion: Interaction-based 4D Radar and LiDAR Fusion for 3D Object Detection

SparseFusion3D: Sparse Sensor Fusion for 3D object detection by Radar and Camera in Environmental Perception

RI-Fusion: 3D Object Detection Using Enhanced Point Features With Range-Image Fusion for Autonomous Driving.

L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection

RCM-Fusion: Radar-Camera Multi-Level Fusion for 3D Object Detection

Deep LiDAR-Radar-Visual Fusion for Object Detection in Urban Environments

Radar Voxel Fusion for 3D Object Detection

Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection

DLFusion: Painting-Depth Augmenting-LiDAR for Multimodal Fusion 3D Object Detection

Radar and Camera Fusion for Multi-Task Sensing in Autonomous Driving

FS-Net: LiDAR-Camera Fusion With Matched Scale for 3D Object Detection in Autonomous Driving

Multi-Modal and Multi-Scale Fusion 3D Object Detection of 4D Radar and LiDAR for Autonomous Driving

${\mathsf{EZFusion}}$: A Close Look at the Integration of LiDAR, Millimeter-Wave Radar, and Camera for Accurate 3D Object Detection and Tracking

LEF: Late-to-Early Temporal Fusion for LiDAR 3D Object Detection

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

RGB-LiDAR fusion for accurate 2D and 3D object detection

BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection

GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object Detection