Improving Unsupervised Learning of Monocular Depth and Ego-Motion Via Stereo Network

Mu He,Jin Xie,Jian Yang
DOI: https://doi.org/10.1007/978-3-030-88007-1_35
2021-01-01
Abstract:Unsupervised learning of monocular depth and ego-motion is a challenging task, which uses the photometric loss as the supervision to train the networks. Although existing unsupervised methods can get rid of expensive annotations, they are still limited in estimation accuracy. In this paper, we explore the use of stereo depth network for improving the performance of monocular depth estimation and ego-motion estimation. To this end, we propose a novel two-stage unsupervised learning framework. Specifically, in the first stage, we jointly train the stereo depth network and ego-motion network in an unsupervised manner, in order to get a more accurate ego-motion estimator. Then we transfer and freeze the egomotion network to the second stage, and only train the monocular depth network in this stage. Moreover, we propose a dense feature fusion module to further enhance the expressive ability of monocular depth network without increasing the number of network parameters. Extensive experiments on the KITTI andMake3D datasets demonstrate that our proposed method achieves superior performance on both monocular depth estimation and ego-motion estimation to existing unsupervised methods.
What problem does this paper attempt to address?