A Self-Supervised Method of Single-Image Depth Estimation by Feeding Forward Information Using Max-Pooling Layers

Shi Jinlong,Sun Yunhan,Bai Suqin,Sun Zhengxing,Tian Zhaohui
DOI: https://doi.org/10.1007/s00371-020-01832-6
IF: 2.835
2020-01-01
The Visual Computer
Abstract:We propose an encoder–decoder CNN framework to predict depth from one single image in a self-supervised manner. To this aim, we design three kinds of encoder based on the recent advanced deep neural network and one kind of decoder which can generate multiscale predictions. Eight loss functions are designed based on the proposed encoder–decoder CNN framework to validate the performance. For training, we take rectified stereo image pairs as input of the CNN, which is trained by reconstructing image via learning multiscale disparity maps. For testing, the CNN can estimate the accurate depth information by inputting only one single image. We validate our framework on two public datasets in contrast to the state-of-the-art methods and our designed different variants, and the performance of different encoder–decoder architectures and loss functions is evaluated to obtain the best combination, which proves that our proposed method performs very well for single-image depth estimation without the supervision of ground truth.
What problem does this paper attempt to address?