Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset

Fuzhang Han,Yufei Wei,Yanmei Jiao,Zhuqing Zhang,Yiyuan Pan,Wenjun Huang,Li Tang,Huan Yin,Xiaqing Ding,Rong Xiong,Yue Wang
2024-12-06
Abstract:Map-based localization is crucial for the autonomous movement of robots as it provides real-time positional feedback. However, existing VINS and SLAM systems cannot be directly integrated into the robot's control loop. Although VINS offers high-frequency position estimates, it suffers from drift in long-term operation. And the drift-free trajectory output by SLAM is post-processed with loop correction, which is non-causal. In practical control, it is impossible to update the current pose with future information. Furthermore, existing SLAM evaluation systems measure accuracy after aligning the entire trajectory, which overlooks the transformation error between the odometry start frame and the ground truth frame. To address these issues, we propose a multi-cam multi-map visual inertial localization system, which provides real-time, causal and drift-free position feedback to the robot control loop. Additionally, we analyze the error composition of map-based localization systems and propose a set of evaluation metric suitable for measuring causal localization performance. To validate our system, we design a multi-camera IMU hardware setup and collect a long-term challenging campus dataset. Experimental results demonstrate the higher real-time localization accuracy of the proposed system. To foster community development, both the system and the dataset have been made open source <a class="link-external link-https" href="https://github.com/zoeylove/Multi-cam-Multi-map-VILO/tree/main" rel="external noopener nofollow">this https URL</a>.
Robotics
What problem does this paper attempt to address?
The problems that this paper attempts to solve are the limitations of existing visual - inertial navigation systems (VINS) and simultaneous localization and mapping (SLAM) systems in the application of robot control loops. Specifically: 1. **Real - time and Drift - free**: Although VINS can provide high - frequency position estimates, it will accumulate drift during long - term operation. Although SLAM can eliminate drift through loop - closure correction, this correction is non - causal, that is, it needs future data to update the current pose, which is impossible to achieve in actual control. 2. **Insufficiency of Evaluation Metrics**: Existing VINS and SLAM evaluation systems usually measure accuracy after aligning the entire trajectory, which ignores the transformation error between the odometry start frame and the ground - truth frame. In addition, metrics such as ATE (Absolute Trajectory Error) measure the final trajectory after multiple loop - closure corrections, which makes the performance evaluation too optimistic and does not conform to the feedback position used in actual navigation. 3. **Challenges of Multi - camera and Multi - map**: Existing systems perform poorly under long - term changes in large - scale environments, especially in multi - map fusion. These systems usually require a globally consistent map or need to merge multiple sub - maps online, which requires overlap between sub - maps. However, directly using multiple isolated maps and switching the corresponding sub - maps in different areas is a more flexible and reasonable method. To solve these problems, the author proposes a multi - camera and multi - map visual - inertial localization system (VILO), which can provide real - time, causal and drift - free position feedback and is suitable for the position control loop of robots. In addition, the author also analyzes the error composition of map - based localization systems and proposes a set of evaluation metrics suitable for evaluating causal localization performance. To verify the system performance, the author designs a multi - sensor hardware platform and collects a campus data set containing long - term appearance and structural changes. The experimental results show that the system has high accuracy and robustness in real - time localization accuracy.