Abstract:The need for a vehicle to perceive information about the external environmental as an independent intelligent individual has grown with the progress of intelligent driving from primary driver assistance to high-level autonomous driving. The ability of a common independent sensing unit to sense the external environment is limited by the sensor’s own characteristics and algorithm level. Hence, a common independent sensing unit fails to obtain comprehensive sensing information independently under conditions such as rain, fog, and night. Accordingly, an extended network-based fusion target detection algorithm for millimeter-wave radar and vision fusion is proposed in this work by combining the complementary perceptual performance of in-vehicle sensing elements, cost effectiveness, and maturity of independent detection technologies. Feature-level fusion is first used in this work according to the analysis of technical routes of the millimeter-wave radar and vision fusion. Training and test evaluation of the algorithm are carried out on the nuScenes dataset and test data from a homemade data acquisition platform. An extended investigation on the RetinaNet one-stage target detection algorithm based on the VGG-16+FPN backbone detection network is then conducted in this work to introduce millimeter-wave radar images as auxiliary information for visual image target detection. We use two-channel radar and three-channel visual images as inputs of the fusion network. We also propose an extended VGG-16 network applicable to millimeter-wave radar and visual fusion and an extended feature pyramid network. Test results showed that the mAP of the proposed network improves by 2.9% and the small target accuracy is enhanced by 18.73% compared with those of the reference network for pure visual image target detection. This finding verified the detection capability and algorithmic feasibility of the proposed extended fusion target detection network for visually insensitive targets.

TransFusion: Multi-Modal Robust Fusion for 3D Object Detection in Foggy Weather Based on Spatial Vision Transformer

A Fusion Method Aiming at Environmental Perception of Autonomous Vehicle Based on Visual Scheme

Fusing LiDAR and Radar with Pillars Attention for 3D Object Detection

Millimeter-Wave Radar and Vision Fusion Target Detection Algorithm Based on an Extended Network

TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers

CenterTransFuser: radar point cloud and visual information fusion for 3D object detection

3D object detection algorithm based on multi-sensor segmental fusion of frustum association for autonomous driving

L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection

3D Object Detection Algorithm in Adverse Weather Conditions Based on LiDAR-Radar Fusion

Robust-FusionNet: Deep Multimodal Sensor Fusion for 3-D Object Detection Under Severe Weather Conditions

Radar Enlighten the Dark: Enhancing Low-Visibility Perception for Automated Vehicles with Camera-Radar Fusion

DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles

AFTR: A Robustness Multi-Sensor Fusion Model for 3D Object Detection Based on Adaptive Fusion Transformer

TransCAR: Transformer-based Camera-And-Radar Fusion for 3D Object Detection

MVFusion: Multi-View 3D Object Detection with Semantic-aligned Radar and Camera Fusion

Multi-Task Foreground-Aware Network with Depth Completion for Enhanced RGB-D Fusion Object Detection Based on Transformer

V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion

InterFusion: Interaction-based 4D Radar and LiDAR Fusion for 3D Object Detection

FusionViT: Hierarchical 3D Object Detection via LiDAR-Camera Vision Transformer Fusion

ROFusion: Efficient Object Detection using Hybrid Point-wise Radar-Optical Fusion

Deep LiDAR-Radar-Visual Fusion for Object Detection in Urban Environments