D2SLAM: Semantic visual SLAM based on the Depth-related influence on object interactions for Dynamic environments

Ayman Beghdadi,Malik Mallem,Lotfi Beji
DOI: https://doi.org/10.48550/arXiv.2210.08647
2023-02-28
Abstract:Considering the scene's dynamics is the most effective solution to obtain an accurate perception of unknown environments for real vSLAM applications. Most existing methods attempt to address the non-rigid scene assumption by combining geometric and semantic approaches to determine dynamic elements that lack generalization and scene awareness. We propose a novel approach that overcomes these limitations by using scene-depth information to improve the accuracy of the localization from geometric and semantic modules. In addition, we use depth information to determine an area of influence of dynamic objects through an Object Interaction Module that estimates the state of both non-matched and non-segmented key points. The obtained results on TUM-RGBD dataset clearly demonstrate that the proposed method outperforms the state-of-the-art.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main objective of this paper is to address the accuracy issues of visual Simultaneous Localization and Mapping (vSLAM) in dynamic environments. Specifically, existing vSLAM methods typically rely on geometric and semantic approaches to identify dynamic elements when dealing with dynamic scenes. However, these methods have certain limitations, such as a lack of generalization ability and scene awareness. This paper proposes a new method, D2SLAM, which improves the estimation of keypoint states in geometric and semantic modules by incorporating depth information and introduces an Object Interaction Module (OIM) to evaluate the states of unmatched and unsegmented keypoints within the influence regions of dynamic objects. The main contributions are as follows: 1. **Proposed an efficient method**: This method outperforms the state-of-the-art on the dedicated TUM-RGBD dataset. 2. **Refinement of the probability function**: By introducing a probability function for keypoint state estimation in the geometric and semantic modules through adaptive depth-related thresholds and influence factors. 3. **Object Interaction Module (OIM)**: This module considers the influence regions of dynamic objects and can estimate the states of unmatched and unsegmented keypoints without requiring additional semantic categories. Experimental results show that this method significantly outperforms the state-of-the-art in terms of relative pose error (RPE) in dynamic environments and also achieves notable improvements in absolute trajectory error (ATE). Additionally, this method does not rely on extra semantic information and has a lower execution time.