SCVO: Scale-Consistent Depth and Pose for Unsupervised Visual Odometry

Huiqing Zhang,Yongjian Yang,Ji Li,Weikang Li
DOI: https://doi.org/10.23919/ccc55666.2022.9902340
2022-01-01
Abstract:The technology for Visual Odometry (VO) that estimates the position of moving objects through image sequences, has drawn significant attention in visual localization. The performance of current VO models is limited by the fragility of photometric consistency as the supervisory signal, which causes the accumulation of scale errors over time. Moreover, unidentified moving objects cause noisy signals during training. To tackle these challenges, we propose a novel monocular visual odometry model called SCVO, which requires only unlabeled video sequences and achieves scale-consistent pose estimation. Specifically, considering the relevant information of adjacent pixels, a improved depth consistency loss is introduced into our framework. Based on the original depth information constraints, we integrate SSIM to constrain the depth consistency of adjacent frames from both local and global perspectives. Based on source images and warped images, we further propose a depth-consistency mask to exclude mismatched pixels caused by dynamic objects and occlusions. Experiments on the KITTI dataset show that the SCVO has achieved superior performance than recently developed VO models.
What problem does this paper attempt to address?