Abstract:Object avoidance for autonomous driving is a vital factor in safe driving. When a vehicle travels from any random start places to any target positions in the milieu, an appropriate route must prevent static and moving obstacles. Having the accurate depth of each barrier in the scene can contribute to obstacle prevention. In recent years, precise depth estimation systems can be attributed to notable advances in Deep Neural Networks and hardware facilities/equipment. Several depth estimation methods for autonomous vehicles usually utilize lasers, structured light, and other reflections on the object surface to capture depth point clouds, complete surface modeling, and estimate scene depth maps. However, estimating precise depth maps is still challenging due to the computational complexity and time-consuming process issues. On the contrary, image-based depth estimation approaches have recently come to attention and can be applied for a broad range of applications. A vast majority of camera depth estimation methods intend to determine the depth map of the whole input image using binocular cameras or a 3D camera, which is time-consuming too. In this paper, a novel approach is proposed that predicts the depth of the head obstacle using only a 2D mono camera. The bounding boxes of barriers are extracted through a deep neural network at the first stage. Rather than those methods, which calculate the depth map of the entire image pixels, in this paper, the average depth of each bounding box is calculated and assigned as labels. Then labels and feature vectors (four values of the bounding box) are set as input data of the proposed method. This network maps feature vectors of the previous stage to the estimated depth values. The results suggest that the model can reasonably predict the depths of obstacles on the Kitti dataset.

Predicting Depth from Semantic Segmentation using Game Engine Dataset

Depth Generation Network: Estimating Real World Depth From Stereo And Depth Images

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Binocular Depth Estimation Using Convolutional Neural Network With Siamese Branches.

RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation

Playing for Depth

Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations

A Deep Joint Network for Monocular Depth Estimation Based on Pseudo-Depth Supervision

Deep Monocular Depth Estimation Based on Content and Contextual Features

A Real-Time Semi-Dense Depth-Guided Depth Completion Network

Semi-Supervised Monocular Depth Estimation with Left-Right Consistency Using Deep Neural Network

Self-supervised Depth Estimation Leveraging Global Perception and Geometric Smoothness Using On-board Videos

Semisupervised learning-based depth estimation with semantic inference guidance

To Complete or to Estimate, That is the Question: A Multi-Task Approach to Depth Completion and Monocular Depth Estimation

Learning Depth Estimation from Memory Infusing Monocular Cues: A Generalization Prediction Approach

Depth Is All You Need for Monocular 3D Detection

SemHint-MD: Learning from Noisy Semantic Labels for Self-Supervised Monocular Depth Estimation

Monocular Depth Estimation Using Cues Inspired by Biological Vision Systems

KDepthNet: Mono-Camera Based Depth Estimation for Autonomous Driving

Unsupervised Learning of Depth from Monocular Videos Using 3D-2D Corresponding Constraints