Advancements in Translation Accuracy for Stereo Visual-Inertial Initialization

Han Song,Zhongche Qu,Zhi Zhang,Zihan Ye,Cong Liu
2024-08-18
Abstract:As the current initialization method in the state-of-the-art Stereo Visual-Inertial SLAM framework, ORB-SLAM3 has limitations. Its success depends on the performance of the pure stereo SLAM system and is based on the underlying assumption that pure visual SLAM can accurately estimate the camera trajectory, which is essential for inertial parameter estimation. Meanwhile, the further improved initialization method for ORB-SLAM3, known as Stereo-NEC, is time-consuming due to applying keypoint tracking to estimate gyroscope bias with normal epipolar constraints. To address the limitations of previous methods, this paper proposes a method aimed at enhancing translation accuracy during the initialization stage. The fundamental concept of our method is to improve the translation estimate with a 3 Degree-of-Freedom (DoF) Bundle Adjustment (BA), independently, while the rotation estimate is fixed, instead of using ORB-SLAM3's 6-DoF BA. Additionally, the rotation estimate will be updated by considering IMU measurements and gyroscope bias, unlike ORB-SLAM3's rotation, which is directly obtained from stereo visual odometry and may yield inferior results when operating in challenging scenarios. We also conduct extensive evaluations on the public benchmark, the EuRoC dataset, demonstrating that our method excels in accuracy.
Robotics
What problem does this paper attempt to address?
This paper aims to address the limitations in accuracy and efficiency of current stereo - visual - inertial initialization methods. Specifically, the paper points out: 1. **Limitations of ORB - SLAM3**: - The initialization of ORB - SLAM3 depends on the performance of the pure stereo - visual SLAM system. It is assumed that pure visual SLAM can accurately estimate the camera trajectory, which is crucial for inertial parameter estimation. - In challenging scenarios, the rotation estimation of ORB - SLAM3 is directly obtained from stereo - visual odometry, which may lead to poor results. 2. **Limitations of Stereo - NEC**: - Although Stereo - NEC estimates the gyroscope bias by using stereo normal epipolar constraints and further optimizes inertial parameters, its feature - matching process is time - consuming, resulting in a running - time delay. To overcome these limitations, the paper proposes a new initialization method called ETA (Enhanced Translation Accuracy), with the main goal of improving the translation accuracy in the initialization stage. The core ideas of this method include: - **3 - Degrees - of - Freedom (3 - DoF) Bundle Adjustment (BA)**: Independently optimize the translation estimate while fixing the rotation estimate, instead of using the 6 - DoF BA of ORB - SLAM3. - **IMU Measurements and Gyroscope Bias**: Update the rotation estimate by considering IMU measurements and gyroscope bias, thereby improving the accuracy of rotation estimation in challenging scenarios. The paper demonstrates the superiority of this method in accuracy through extensive evaluation on the public benchmark dataset EuRoC, especially on the V2_03_difficult dataset, which contains complex factors such as motion blur and illumination changes and imposes high requirements on state estimation.