UnLearnerMC: Unsupervised Learning of Dense Depth and Camera Pose Using Mask and Cooperative Loss

Junning Zhang,Qunxing Su,Pengyuan Liu,Chao Xu,Zhengjun Wang
DOI: https://doi.org/10.1016/j.knosys.2019.105357
IF: 8.139
2019-01-01
Knowledge-Based Systems
Abstract:In this paper, we propose an unsupervised learning framework, named UnLearnerMC, for jointly training monocular depth, camera pose, and segmentation from videos. Existing unsupervised methods typically exploit optical flow consistency to train the segmentation network and eliminate interference from moving objects. Our key point is to create SegMaskNet and corresponding training losses for distinguishing between moving objects and static scenes. Specifically, we eliminate misleading by exploiting boundary masks when moving objects are beyond the view boundary. The SegMaskNet constantly adjusts the allocation of static and moving object pixels via a cooperative loss to minimize the total loss during training. For the moving areas, we can use the re-estimated static depth and pose to eliminate interference from moving areas by using the proposed photometric loop consistency loss. Experiments on KITTI datasets show that UnLearnerMC achieves state-of-the-art results in single-view depth and camera ego-motion, which illustrate the benefits of our approach.
What problem does this paper attempt to address?