Towards Loss Balance and Consistent Model in Self-supervised Monocular Depth Estimation

Chengyuan Li,Lanqing Zhang,Xing Cai,Keyao Li,Ge Li,Thomas H. Li
DOI: https://doi.org/10.1109/ictai50040.2020.00138
2020-01-01
Abstract:Recently, self-supervised methods based on Convolutional Neural Networks (CNN) have achieved remarkable success in monocular depth estimation. To obtain higher quality depth maps, some of these approaches leverage traditional schemes to compute rough depth maps as proxy labels and adopt classic regression loss functions to minimize the differences between network-predicted depth maps and proxy labels. However, at proxy labels with large depth values, these methods suffer from a loss imbalance problem. To address this limitation and further improve the network performance, this article offers three key contributions. Firstly, a novel regression loss function is proposed, which can alleviate the loss imbalance problem and better handle rough proxy labels. Secondly, a dynamic mask is designed to accelerate network convergence. Thirdly, an innovative consistency loss is introduced, which can produce a more accurate and consistent model by maintaining consistency between the produced depth maps of each input image and its mirror. The effectiveness of our contributions is demonstrated by a series of ablation studies. Extensive experiments on KITTI dataset reveal that our approach achieves state-of-the-art results.
What problem does this paper attempt to address?