Abstract:This paper studies visual odometry (VO) from the perspective of deep learning. After tremendous efforts in the robotics and computer vision communities over the past few decades, state-of-the-art VO algorithms have demonstrated incredible performance. However, since the VO problem is typically formulated as a pure geometric problem, one of the key features still missing from current VO systems is the capability to automatically gain knowledge and improve performance through learning. In this paper, we investigate whether deep neural networks can be effective and beneficial to the VO problem. An end-to-end, sequence-to-sequence probabilistic visual odometry (ESP-VO) framework is proposed for the monocular VO based on deep recurrent convolutional neural networks. It is trained and deployed in an end-to-end manner, that is, directly inferring poses and uncertainties from a sequence of raw images (video) without adopting any modules from the conventional VO pipeline. It can not only automatically learn effective feature representation encapsulating geometric information through convolutional neural networks, but also implicitly model sequential dynamics and relation for VO using deep recurrent neural networks. Uncertainty is also derived along with the VO estimation without introducing much extra computation. Extensive experiments on several datasets representing driving, flying and walking scenarios show competitive performance of the proposed ESP-VO to the state-of-the-art methods, demonstrating a promising potential of the deep learning technique for VO and verifying that it can be a viable complement to current VO systems.

Visual Odometry with Deep Bidirectional Recurrent Neural Networks.

Self-supervised Visual-LiDAR Odometry with Flip Consistency

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth

Deep Visual Odometry with Adaptive Memory

Twinvo: Unsupervised Learning of Monocular Visual Odometry Using Bi-Direction Twin Network

End-to-end, sequence-to-sequence probabilistic visual odometry through deep neural networks

Leveraging Deep Learning for Visual Odometry Using Optical Flow

Guided Feature Selection for Deep Visual Odometry

Learning-based Image Enhancement for Visual Odometry in Challenging HDR Environments

Robust self-supervised monocular visual odometry based on prediction-update pose estimation network.

Self-Supervised Deep Visual Odometry with Online Adaptation

Spatio-temporal and geometry constrained network for automobile visual odometry

Self-Supervised Deep Visual Odometry Based on Geometric Attention Model

DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry

Salient Sparse Visual Odometry With Pose-Only Supervision

LSTM Pose Machines.

Unsupervised Monocular Visual-Inertial Odometry Network

XVO: Generalized Visual Odometry via Cross-Modal Self-Training

DF-VO: What Should Be Learnt for Visual Odometry?

DeepVO: A Deep Learning approach for Monocular Visual Odometry

An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation