Abstract:Simultaneous localization and mapping (SLAM) is a fundamental problem in robotics and computer vision. It involves the task of a robot or an autonomous system navigating an unknown environment, simultaneously creating a map of the surroundings, and accurately estimating its position within that map. While significant progress has been made in SLAM over the years, challenges still need to be addressed. One prominent issue is robustness and accuracy in dynamic environments, which can cause uncertainties and errors in the estimation process. Traditional methods using temporal information to differentiate static and dynamic objects have limitations in accuracy and applicability. Nowadays, many research trends have leaned towards utilizing deep learning-based methods which leverage the capabilities to handle dynamic objects, semantic segmentation, and motion estimation, aiming to improve accuracy and adaptability in complex scenes. This article proposed an approach to enhance monocular visual odometry's robustness and precision in dynamic environments. An enhanced algorithm using the semantic segmentation algorithm DeeplabV3+ is used to identify dynamic objects in the image and then apply the motion consistency check to remove feature points belonging to dynamic objects. The remaining static feature points are then used for feature matching and pose estimation based on ORB-SLAM2 using the Technical University of Munich (TUM) dataset. Experimental results show that our method outperforms traditional visual odometry methods in accuracy and robustness, especially in dynamic environments. By eliminating the influence of moving objects, our method improves the accuracy and robustness of visual odometry in dynamic environments. Compared to the traditional ORB-SLAM2, the results show that the system significantly reduces the absolute trajectory error and the relative pose error in dynamic scenes. Our approach has significantly improved the accuracy and robustness of the SLAM system's pose estimation.

USD-SLAM: A Universal Visual SLAM Based on Large Segmentation Model in Dynamic Environments

DM-SLAM: A Feature-Based SLAM System for Rigid Dynamic Scenes

RGB‐D SLAM with Moving Object Tracking in Dynamic Environments

Real-Time Visual-Inertial Localization Using Semantic Segmentation Towards Dynamic Environments

DGS-SLAM: A Fast and Robust RGBD SLAM in Dynamic Environments Combined by Geometric and Semantic Information

Dynamic SLAM: A Visual SLAM in Outdoor Dynamic Scenes

ADM-SLAM: Accurate and Fast Dynamic Visual SLAM with Adaptive Feature Point Extraction, Deeplabv3pro, and Multi-View Geometry

Visual SLAM in dynamic environments based on object detection

DMS-SLAM: A General Visual SLAM System for Dynamic Scenes with Multiple Sensors

DRV-SLAM: An Adaptive Real-Time Semantic Visual SLAM Based on Instance Segmentation Toward Dynamic Environments

CD-SLAM:A Real-Time Stereo Visual-Inertial SLAM for Complex Dynamic Environments with Semantic and Geometric Information

MISD-SLAM: Multimodal Semantic SLAM for Dynamic Environments

Ds-Slam: A Semantic Visual Slam Towards Dynamic Environments

LOCALIZATION AND MAPPING IN DYNAMIC ENVIRONMENT USING MOVING OBJECTS SEGMENTATION FOR AUTONOMOUS DRIVING

DMOT-SLAM: Visual SLAM in Dynamic Environments with Moving Object Tracking

VDO-SLAM: A Visual Dynamic Object-aware SLAM System

RLD-SLAM: A Robust Lightweight VI-SLAM for Dynamic Environments Leveraging Semantics and Motion Information

A visual dynamic-SLAM method based semantic segmentation and multi-view geometry

SIIS-SLAM: A Vision SLAM Based on Sequential Image Instance Segmentation

Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes