Abstract:As one of the automotive sensors that have emerged in recent years, 4D millimeter-wave radar has a higher resolution than conventional 3D radar and provides precise elevation measurements. But its point clouds are still sparse and noisy, making it challenging to meet the requirements of autonomous driving. Camera, as another commonly used sensor, can capture rich semantic information. As a result, the fusion of 4D radar and camera can provide an affordable and robust perception solution for autonomous driving systems. However, previous radar-camera fusion methods have not yet been thoroughly investigated, resulting in a large performance gap compared to LiDAR-based methods. Specifically, they ignore the feature-blurring problem and do not deeply interact with image semantic information. To this end, we present a simple but effective multi-stage sampling fusion (MSSF) network based on 4D radar and camera. On the one hand, we design a fusion block that can deeply interact point cloud features with image features, and can be applied to commonly used single-modal backbones in a plug-and-play manner. The fusion block encompasses two types, namely, simple feature fusion (SFF) and multiscale deformable feature fusion (MSDFF). The SFF is easy to implement, while the MSDFF has stronger fusion abilities. On the other hand, we propose a semantic-guided head to perform foreground-background segmentation on voxels with voxel feature re-weighting, further alleviating the problem of feature blurring. Extensive experiments on the View-of-Delft (VoD) and TJ4DRadset datasets demonstrate the effectiveness of our MSSF. Notably, compared to state-of-the-art methods, MSSF achieves a 7.0% and 4.0% improvement in 3D mean average precision on the VoD and TJ4DRadSet datasets, respectively. It even surpasses classical LiDAR-based methods on the VoD dataset.

TL-4DRCF:A Two-Level 4D Radar-Camera Fusion Method for Object Detection in Adverse Weather

Fusing LiDAR and Radar with Pillars Attention for 3D Object Detection

L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection

MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving

Multi-Modal and Multi-Scale Fusion 3D Object Detection of 4D Radar and LiDAR for Autonomous Driving

Deep LiDAR-Radar-Visual Fusion for Object Detection in Urban Environments

Camera-Radar Fusion with Radar Channel Extension and Dual-CBAM-FPN for Object Detection

HVDetFusion: A Simple and Robust Camera-Radar Fusion Framework

V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion

3D object detection algorithm based on multi-sensor segmental fusion of frustum association for autonomous driving

Radar Voxel Fusion for 3D Object Detection

InterFusion: Interaction-based 4D Radar and LiDAR Fusion for 3D Object Detection

Radar and Camera Fusion for Multi-Task Sensing in Autonomous Driving

TransFusion: Multi-Modal Robust Fusion for 3D Object Detection in Foggy Weather Based on Spatial Vision Transformer

Radar-Camera Sensor Fusion for Joint Object Detection and Distance Estimation in Autonomous Vehicles

ROFusion: Efficient Object Detection using Hybrid Point-wise Radar-Optical Fusion

3D Object Detection Algorithm in Adverse Weather Conditions Based on LiDAR-Radar Fusion

Radar-Lidar Fusion for Object Detection by Designing Effective Convolution Networks

DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection

SparseFusion3D: Sparse Sensor Fusion for 3D object detection by Radar and Camera in Environmental Perception

Timely Fusion of Surround Radar/Lidar for Object Detection in Autonomous Driving Systems