VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors

Thien Hoang Nguyen,Shenghai Yuan,Lihua Xie
2023-03-20
Abstract:In this work, we propose a simultaneous localization and mapping (SLAM) system using a monocular camera and Ultra-wideband (UWB) sensors. Our system, referred to as VRSLAM, is a multi-stage framework that leverages the strengths and compensates for the weaknesses of each sensor. Firstly, we introduce a UWB-aided 7 degree-of-freedom (scale factor, 3D position, and 3D orientation) global alignment module to initialize the visual odometry (VO) system in the world frame defined by the UWB anchors. This module loosely fuses up-to-scale VO and ranging data using either a quadratically constrained quadratic programming (QCQP) or nonlinear least squares (NLS) algorithm based on whether a good initial guess is available. Secondly, we provide an accompanied theoretical analysis that includes the derivation and interpretation of the Fisher Information Matrix (FIM) and its determinant. Thirdly, we present UWBaided bundle adjustment (UBA) and UWB-aided pose graph optimization (UPGO) modules to improve short-term odometry accuracy, reduce long-term drift as well as correct any alignment and scale errors. Extensive simulations and experiments show that our solution outperforms UWB/camera-only and previous approaches, can quickly recover from tracking failure without relying on visual relocalization, and can effortlessly obtain a global map even if there are no loop closures.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve accurate, reliable and consistent localization in autonomous mobile robots, especially in terms of short - term and long - term localization accuracy. Specifically, the paper proposes a Visual - Range Simultaneous Localization and Mapping (SLAM) system using a monocular camera and an Ultra - Wideband (UWB) sensor, called VR - SLAM. This system aims to fuse the advantages of different sensors through a multi - stage framework and compensate for their respective weaknesses to solve the following three main problems: 1. **Global Affine Transformation (GAT) Problem**: - Find the 3D affine transformation matrix that can scale, rotate and translate the data of the Visual Odometry (VO) system (including reference frames, key frames and map points) into the world coordinate system. This is a crucial step in the initialization phase, ensuring that the visual data is aligned with the UWB data in the same coordinate system. 2. **Visual - Range Odometry (VRO) Problem**: - Since VO estimates will drift over time while UWB ranging does not, it is necessary to fuse visual and ranging measurements in a tightly - coupled manner to reduce cumulative errors and improve overall accuracy. This is the core problem during the system operation, ensuring long - term localization accuracy. 3. **UWB - Assisted Visual Relocalization (UVR) Problem**: - Although loop closure is a useful tool for correcting long - term drift or relocating when the tracking front - end fails, it is vulnerable to perceptual confusion. The method proposed in the paper combines visual and ranging data, avoids accepting wrong loop closure candidates, and accurately relocates when tracking is lost, or at least re - initializes the system in the world coordinate system as soon as possible. By solving the above problems, the VR - SLAM system can generate a globally consistent map without loop closure, even when there is no direct loop closure between multiple operations. In addition, the system can quickly recover from tracking failures when visual relocalization is not feasible, thus providing more extensive, accurate and reliable localization and mapping results.