Abstract:Inferring the depth of images is a fundamental inverse problem within the field of Computer Vision since depth information is obtained through 2D images, which can be generated from infinite possibilities of observed real scenes. Benefiting from the progress of Convolutional Neural Networks (CNNs) to explore structural features and spatial image information, Single Image Depth Estimation (SIDE) is often highlighted in scopes of scientific and technological innovation, as this concept provides advantages related to its low implementation cost and robustness to environmental conditions. In the context of autonomous vehicles, state-of-the-art CNNs optimize the SIDE task by producing high-quality depth maps, which are essential during the autonomous navigation process in different locations. However, such networks are usually supervised by sparse and noisy depth data, from Light Detection and Ranging (LiDAR) laser scans, and are carried out at high computational cost, requiring high-performance Graphic Processing Units (GPUs). Therefore, we propose a new lightweight and fast supervised CNN architecture combined with novel feature extraction models which are designed for real-world autonomous navigation. We also introduce an efficient surface normals module, jointly with a simple geometric 2.5D loss function, to solve SIDE problems. We also innovate by incorporating multiple Deep Learning techniques, such as the use of densification algorithms and additional semantic, surface normals and depth information to train our framework. The method introduced in this work focuses on robotic applications in indoor and outdoor environments and its results are evaluated on the competitive and publicly available NYU Depth V2 and KITTI Depth datasets.

A Comparison Study of Depth Map Estimation in Indoor Environments Using pix2pix and CycleGAN

Depth Generation Network: Estimating Real World Depth From Stereo And Depth Images

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

A Robust Monocular Depth Estimation Framework Based on Light-Weight ERF-Pspnet for Day-Night Driving Scenes

Depth Estimation from Monocular Image and Coarse Depth Points Based on Conditional GAN

Depth Estimation of Traffic Scenes from Image Sequence Using Deep Learning.

Deeper into Self-Supervised Monocular Indoor Depth Estimation

Generative Adversarial Networks for Unsupervised Monocular Depth Prediction

Unpaired Single-Image Depth Synthesis with cycle-consistent Wasserstein GANs

Conditional Generative Adversarial Network for Monocular Image Depth Map Prediction

Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs

Domain-Transferred Synthetic Data Generation for Improving Monocular Depth Estimation

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

On Deep Learning Techniques to Boost Monocular Depth Estimation for Autonomous Navigation

Environment reconstruction on depth images using Generative Adversarial Networks

SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation With Stacked Generative Adversarial Networks

GAM-Depth: Self-Supervised Indoor Depth Estimation Leveraging a Gradient-Aware Mask and Semantic Constraints

A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion

A Systematic Comparison of Depth Map Representations for Face Recognition

Sparse-to-Continuous: Enhancing Monocular Depth Estimation using Occupancy Maps

SGTBN: Generating Dense Depth Maps from Single-Line LiDAR