D2SLAM: Semantic visual SLAM based on the Depth-related influence on object interactions for Dynamic environments

Ayman Beghdadi,Malik Mallem,Lotfi Beji

DOI: https://doi.org/10.48550/arXiv.2210.08647

2023-02-28

Abstract:Considering the scene's dynamics is the most effective solution to obtain an accurate perception of unknown environments for real vSLAM applications. Most existing methods attempt to address the non-rigid scene assumption by combining geometric and semantic approaches to determine dynamic elements that lack generalization and scene awareness. We propose a novel approach that overcomes these limitations by using scene-depth information to improve the accuracy of the localization from geometric and semantic modules. In addition, we use depth information to determine an area of influence of dynamic objects through an Object Interaction Module that estimates the state of both non-matched and non-segmented key points. The obtained results on TUM-RGBD dataset clearly demonstrate that the proposed method outperforms the state-of-the-art.

Robotics,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The main objective of this paper is to address the accuracy issues of visual Simultaneous Localization and Mapping (vSLAM) in dynamic environments. Specifically, existing vSLAM methods typically rely on geometric and semantic approaches to identify dynamic elements when dealing with dynamic scenes. However, these methods have certain limitations, such as a lack of generalization ability and scene awareness. This paper proposes a new method, D2SLAM, which improves the estimation of keypoint states in geometric and semantic modules by incorporating depth information and introduces an Object Interaction Module (OIM) to evaluate the states of unmatched and unsegmented keypoints within the influence regions of dynamic objects. The main contributions are as follows: 1. **Proposed an efficient method**: This method outperforms the state-of-the-art on the dedicated TUM-RGBD dataset. 2. **Refinement of the probability function**: By introducing a probability function for keypoint state estimation in the geometric and semantic modules through adaptive depth-related thresholds and influence factors. 3. **Object Interaction Module (OIM)**: This module considers the influence regions of dynamic objects and can estimate the states of unmatched and unsegmented keypoints without requiring additional semantic categories. Experimental results show that this method significantly outperforms the state-of-the-art in terms of relative pose error (RPE) in dynamic environments and also achieves notable improvements in absolute trajectory error (ATE). Additionally, this method does not rely on extra semantic information and has a lower execution time.

D2SLAM: Semantic visual SLAM based on the Depth-related influence on object interactions for Dynamic environments

DAM-SLAM: depth attention module in a semantic visual SLAM based on objects interaction for dynamic environments

Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes

Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2

DM-SLAM: A Feature-Based SLAM System for Rigid Dynamic Scenes

DGS-SLAM: A Fast and Robust RGBD SLAM in Dynamic Environments Combined by Geometric and Semantic Information

Ds-Slam: A Semantic Visual Slam Towards Dynamic Environments

DMOT-SLAM: Visual SLAM in Dynamic Environments with Moving Object Tracking

Semantic stereo visual SLAM toward outdoor dynamic environments based on ORB-SLAM2

A visual dynamic-SLAM method based semantic segmentation and multi-view geometry

VDO-SLAM: A Visual Dynamic Object-aware SLAM System

SOF-SLAM: A Semantic Visual SLAM for Dynamic Environments

Towards Real-time Semantic RGB-D SLAM in Dynamic Environments

Semantics Aware Dynamic SLAM Based on 3D MODT

BY-SLAM: Dynamic Visual SLAM System Based on BEBLID and Semantic Information Extraction

MSeg-SLAM: A Semantic Visual SLAM System for Dynamic Scenes.

ESD-SLAM: An efficient semantic visual SLAM towards dynamic environments

V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting

Semantic visual SLAM in dynamic environment

RVD-SLAM: A Real-Time Visual SLAM Toward Dynamic Environments Based on Sparsely Semantic Segmentation and Outlier Prior