Visual SLAM in dynamic environments based on object detection

Yong-bao Ai,Ting Rui,Xiao-qiang Yang,Jia-lin He,Lei Fu,Jian-bin Li,Ming Lu
DOI: https://doi.org/10.1016/j.dt.2020.09.012
IF: 4.035
2021-10-01
Defence Technology
Abstract:A great number of visual simultaneous localization and mapping (VSLAM) systems need to assume static features in the environment. However, moving objects can vastly impair the performance of a VSLAM system which relies on the static-world assumption. To cope with this challenging topic, a real-time and robust VSLAM system based on ORB-SLAM2 for dynamic environments was proposed. To reduce the influence of dynamic content, we incorporate the deep-learning-based object detection method in the visual odometry, then the dynamic object probability model is added to raise the efficiency of object detection deep neural network and enhance the real-time performance of our system. Experiment with both on the TUM and KITTI benchmark dataset, as well as in a real-world environment, the results clarify that our method can significantly reduce the tracking error or drift, enhance the robustness, accuracy and stability of the VSLAM system in dynamic scenes.
engineering, multidisciplinary
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily addresses the problem of Visual Simultaneous Localization and Mapping (VSLAM) in dynamic environments. Specifically: 1. **Existing Issues**: - Most existing VSLAM systems rely on the assumption of a static environment, i.e., they assume that the environment contains only static objects. - The presence of moving objects can severely affect the performance of these systems. 2. **Solution**: - A new method based on the ORB-SLAM2 framework combined with deep learning object detection technology is proposed. - A Dynamic Object Probability Model (DOP) is introduced to distinguish between dynamic and static regions in the scene. - Experiments have validated that this method improves robustness and accuracy in dynamic environments. 3. **Main Contributions**: - A new SLAM framework is proposed, which combines object detection to reduce the impact of moving objects on camera pose estimation and dense 3D point cloud mapping. - A new Dynamic Object Probability Model is developed to enhance the system's ability to separate dynamic objects in VSLAM. Through experiments on the TUM and KITTI benchmark datasets, as well as tests in real-world environments, it has been demonstrated that this method can significantly reduce tracking errors and drift, improving the robustness, accuracy, and stability of VSLAM systems.