Depth Estimation from Single Monocular Images Using Deep Hybrid Network

Aleksei Grigorev,Feng Jiang,Seungmin Rho,Worku J. Sori,Shaohui Liu,Sergey Sai
DOI: https://doi.org/10.1007/s11042-016-4200-x
IF: 2.577
2016-01-01
Multimedia Tools and Applications
Abstract:Depth estimation is a significant task in the robotics vision. In this paper, we address the depth estimation from a single monocular image, which is a challenging problem in automated vision systems since a single image alone does not carry any additional measurements. To tackle our main objective, we design a deep hybrid neural network, which is composed of convolutional and recurrent layers (ReNet), where each ReNet layer is composed of the Long Short-Term Memory unit (LSTM), which is famous for the ability to memorize long-range context. In the proposed network, ReNet layers aim to enrich the features representation by directly capturing global context. The effective integration of ReNet and convolutional layers in the common CNN framework allows us to train the hybrid network in the end-to-end fashion. Experimental evaluation on the benchmarks dataset demonstrated, that hybrid network achieves the state-of-the-art results without any post-processing steps. Moreover, the composition of recurrent and convolutional layers provide more satisfying results.
What problem does this paper attempt to address?