Abstract:Dense depth recovery is crucial in autonomous driving, serving as a foundational element for obstacle avoidance, 3D object detection, and local path planning. Adverse weather conditions, including haze, dust, rain, snow, and darkness, introduce significant challenges to accurate dense depth estimation, thereby posing substantial safety risks in autonomous driving. These challenges are particularly pronounced for traditional depth estimation methods that rely on short electromagnetic wave sensors, such as visible spectrum cameras and near-infrared LiDAR, due to their susceptibility to diffraction noise and occlusion in such environments. To fundamentally overcome this issue, we present a novel approach for robust metric depth estimation by fusing a millimeter-wave Radar and a monocular infrared thermal camera, which are capable of penetrating atmospheric particles and unaffected by lighting conditions. Our proposed Radar-Infrared fusion method achieves highly accurate and finely detailed dense depth estimation through three stages, including monocular depth prediction with global scale alignment, quasi-dense Radar augmentation by learning Radar-pixels correspondences, and local scale refinement of dense depth using a scale map learner. Our method achieves exceptional visual quality and accurate metric estimation by addressing the challenges of ambiguity and misalignment that arise from directly fusing multi-modal long-wave features. We evaluate the performance of our approach on the NTU4DRadLM dataset and our self-collected challenging ZJU-Multispectrum dataset. Especially noteworthy is the unprecedented robustness demonstrated by our proposed method in smoky scenarios. Our code will be released at \url{

Semantic-guided Depth Completion from Monocular Images and 4D Radar Data

MFF-Net: Towards Efficient Monocular Depth Completion With Multi-Modal Feature Fusion

Least Square Estimation Network for Depth Completion

RaViDeep: Target Detection Based on Deep Fusion of Radar and Vision in Berthing Scenarios

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Radar-Camera Pixel Depth Association for Depth Completion

DepthSSC: Depth-Spatial Alignment and Dynamic Voxel Resolution for Monocular 3D Semantic Scene Completion

RIDERS: Radar-Infrared Depth Estimation for Robust Sensing

Recent Advances in Conventional and Deep Learning-Based Depth Completion: A Survey

RSDCN: A Road Semantic Guided Sparse Depth Completion Network

3-D Grid-Based VDBSCAN Clustering and Radar—Monocular Depth Completion

Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints

Radar and Camera Fusion for Multi-Task Sensing in Autonomous Driving

RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric Scale

Depth Estimation from Monocular Images and Sparse Radar Data

Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

Self-supervised Sparse-to-Dense: Self-supervised Depth Completion from LiDAR and Monocular Camera

Deep Depth Completion from Extremely Sparse Data: A Survey

Depth Completion via Inductive Fusion of Planar LIDAR and Monocular Camera

Object Semantics Give Us the Depth We Need: Multi-task Approach to Aerial Depth Completion

Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics