A Real-Time Dynamic SLAM Algorithm Based on the Fusion of Visual, Inertial, and Semantic Information

Yinglong Wang,Xiaoxiong Liu,Minkun Zhao,Xinlong Xu
DOI: https://doi.org/10.3390/ijgi13050163
IF: 3.4
2024-05-14
ISPRS International Journal of Geo-Information
Abstract:A deep learning-based Visual Inertial SLAM technique is proposed in this paper to ensure accurate autonomous localization of mobile robots in environments with dynamic objects. Addressing the limitations of real-time performance in deep learning algorithms and the poor robustness of pure visual geometry algorithms, this paper presents a deep learning-based Visual Inertial SLAM technique. Firstly, a non-blocking model is designed to extract semantic information from images. Then, a motion probability hierarchy model is proposed to obtain prior motion probabilities of feature points. For image frames without semantic information, a motion probability propagation model is designed to determine the prior motion probabilities of feature points. Furthermore, considering that the output of inertial measurements is unaffected by dynamic objects, this paper integrates inertial measurement information to improve the estimation accuracy of feature point motion probabilities. An adaptive threshold-based motion probability estimation method is proposed, and finally, the positioning accuracy is enhanced by eliminating feature points with excessively high motion probabilities. Experimental results demonstrate that the proposed algorithm achieves accurate localization in dynamic environments while maintaining real-time performance.
geography, physical,remote sensing,computer science, information systems
What problem does this paper attempt to address?
This paper attempts to address the problem of real-time autonomous localization of mobile robots in dynamic environments. Specifically, existing SLAM technologies typically assume that the environment is static, but in the real world, there are many dynamic objects (such as pedestrians, fast-moving vehicles, etc.) that adversely affect pose estimation. Therefore, the paper proposes a real-time dynamic SLAM algorithm based on the fusion of visual, inertial, and semantic information (VIS-SLAM) to improve localization accuracy and real-time performance in dynamic environments. ### Main Issues 1. **Limitations of Existing SLAM Technologies**: - **Static Environment Assumption**: Most existing SLAM technologies assume that the environment is static, which is unrealistic in practical applications. - **Impact of Dynamic Objects**: Dynamic objects (such as pedestrians, vehicles, etc.) severely affect the accuracy of pose estimation. - **Insufficient Real-time Performance**: Although deep learning algorithms are powerful, their real-time performance is poor, while pure visual geometric algorithms have poor robustness. 2. **Challenges of Localization in Dynamic Environments**: - **Identification and Handling of Dynamic Feature Points**: It is necessary to effectively identify and exclude dynamic feature points to avoid their interference with localization. - **Fusion of Multi-sensor Information**: Utilizing information from multiple sensors (such as visual, inertial, etc.) to improve the accuracy and robustness of localization. ### Solutions The paper proposes the following methods to address the above issues: 1. **Non-blocking Model Design**: - Designed a non-blocking model to extract semantic information from images, reducing the impact of the deep learning module on the real-time performance of the system and improving the execution speed of the algorithm. 2. **Motion Probability Grading and Propagation Model**: - Proposed a motion probability grading model that assigns different prior motion probabilities to objects based on different semantic types. - Designed a motion probability propagation model to estimate the prior motion probability of feature points for image frames without extracted semantic information through the feature matching relationship and semantic information of adjacent frames. 3. **Visual-Inertial Tight Coupling Feature Optimization Algorithm**: - Utilizing the tight coupling of visual and inertial measurement units (IMU), combined with prior semantic information, designed an adaptive threshold motion consistency detection algorithm to further optimize the motion probability estimation of feature points and improve localization accuracy. ### Experimental Results Experimental results show that the proposed algorithm achieves good localization accuracy in highly dynamic environments while maintaining real-time performance. ### Conclusion By fusing visual, inertial, and semantic information, this paper proposes a real-time dynamic SLAM algorithm that effectively solves the problem of autonomous localization of mobile robots in dynamic environments, improving localization accuracy and real-time performance.