Global-Context-Aware Visual Odometry System With Epipolar-Geometry-Constrained Loss Function

Zhong Ji,Keqin Nan,Yan Xu,Haoyuan Wang,Xuening Li,Jiale Bai
DOI: https://doi.org/10.1109/tim.2024.3370804
IF: 5.6
2024-03-15
IEEE Transactions on Instrumentation and Measurement
Abstract:Visual odometry (VO) plays a vital role in simultaneous localization and mapping (SLAM). Most of the current learning-based VO methods utilize convolutional neural network (CNN) as framework. However, CNN is weak in integrating global context information. In the design of loss function, the majority of these methods ignore the restraint relationship between translation and rotation prediction. In this work, we propose an end-to-end global-context-aware visual odometry with epipolar-geometry-constrained loss function (CEGVO) to estimate the relative 6-DoF poses of monocular camera. The proposed scheme designs an augmented-attention-enhanced global context block (GCB) on top of contextual CNN to learn the long-range dependencies and internal correlation. To overcome the problem of mutual restraint between the translation and rotation errors, an epipolar-geometry-constrained loss function is developed to simultaneously improve the prediction accuracy of both translation and rotation. The evaluation results on public datasets and self-collected dataset show that the proposed system outperforms the state-of-the-art (SOAT) learning-based VO methods with a large margin.
engineering, electrical & electronic,instruments & instrumentation
What problem does this paper attempt to address?