Abstract:Simultaneous localization and mapping (SLAM) has emerged as a critical technology enabling robots to navigate in unknown environments, drawing extensive attention within the robotics research community. However, traditional visual SLAM ignores the presence of dynamic objects in indoor scenes, and dynamic point features of dynamic objects can lead to incorrect data correlation, making the traditional visual SLAM is difficult to accurately estimate the camera's pose when the objects in the scenes are moving. Using only point features cannot fully extract geometric information in dynamic indoor scenes, reducing the system's robustness. To solve this problem, we develop a RGB-D SLAM system called DIG-SLAM. Firstly, the objects' contour regions are extracted using the YOLOv7 instance segmentation method, serving as a prerequisite for determining dynamic objects and constructing a semantic information map. Meanwhile, the line features are extracted using the line segment detector (LSD) algorithm, and the redundant line features are optimized via K-means clustering. Secondly, moving consistency checks combined with instance partitioning determine dynamic regions, and the point and line features of the dynamic regions are removed. Finally, the combination of static line features and point features optimizes the camera pose. Meanwhile, a static semantic octree map is created to provide richer and higher-level scene understanding and perception capabilities for robots or autonomous systems. The experimental results on the Technische Universität München (TUM) dataset show that the average absolute trajectory error of the developed DIG-SLAM is reduced by 28.68% compared with the dynamic semantic SLAM (DS-SLAM). Compared with other dynamic SLAM methods, the proposed system shows better camera pose estimation accuracy and system's robustness in dynamic indoor environments and better map building in real indoor scenes.

ATY-SLAM: A Visual Semantic SLAM for Dynamic Indoor Environments.

VSLAM Optimization Method in Dynamic Scenes Based on YOLO-Fastest

YPD-SLAM: A Real-Time VSLAM System for Handling Dynamic Indoor Environments

YDD-SLAM: Indoor Dynamic Visual SLAM Fusing YOLOv5 with Depth Information

Study on Slam Algorithm Based on Object Detection in Dynamic Scene

AHY-SLAM: Toward Faster and More Accurate Visual SLAM in Dynamic Scenes Using Homogenized Feature Extraction and Object Detection Method

YG-SLAM: GPU-Accelerated RGBD-SLAM Using YOLOv5 in a Dynamic Environment

ADM-SLAM: Accurate and Fast Dynamic Visual SLAM with Adaptive Feature Point Extraction, Deeplabv3pro, and Multi-View Geometry

Dynamic SLAM: A Visual SLAM in Outdoor Dynamic Scenes

A Real-Time VSLAM Based on Deep Features and Object Detection for Dynamic Environments

Real-time Visual SLAM based YOLO-Fastest for Dynamic Scenes

RGB-D Visual SLAM Based on Yolov4-Tiny in Indoor Dynamic Environment

AFO-SLAM: an improved visual SLAM in dynamic scenes using acceleration of feature extraction and object detection

Real-Time SLAM Based on Dynamic Feature Point Elimination in Dynamic Environment

DIG-SLAM: An Accurate RGB-D SLAM Based on Instance Segmentation and Geometric Clustering for Dynamic Indoor Scenes

DOTF-SLAM: Real-Time Dynamic SLAM Using Dynamic Odject Tracking and Key-Point Filtering

Visual Semantic SLAM Based on Examination of Moving Consistency in Dynamic Scenes.

RVD-SLAM: A Real-Time Visual SLAM Toward Dynamic Environments Based on Sparsely Semantic Segmentation and Outlier Prior

A Lightweight Visual Simultaneous Localization and Mapping Method with a High Precision in Dynamic Scenes

Visual SLAM in dynamic environments based on object detection

YVG‐SLAM: Dynamic Feature Removal SLAM Algorithm Without A Priori Assumptions Based on Object Detection and View Geometry