USD-SLAM: A Universal Visual SLAM Based on Large Segmentation Model in Dynamic Environments

Jingwei Wang,Yizhang Ren,Zhiwei Li,Xiaoming Xie,Zilong Chen,Tianyu Shen,Huaping Liu,Kunfeng Wang
DOI: https://doi.org/10.1109/lra.2024.3498781
IF: 5.2
2024-01-01
IEEE Robotics and Automation Letters
Abstract:Visual Simultaneous Localization and Mapping (SLAM) has been widely adopted in autonomous driving and robotics. While most SLAM systems operate effectively in static or low-dynamic environments, achieving precise pose estimation in diverse unknown dynamic environments continues to pose a significant challenge. This paper introduces an advanced universal visual SLAM system (USD-SLAM) that combines a universal large segmentation model with a 3D spatial motion state constraint module to accurately handle any dynamic objects present in the environment. Our system first employs a large segmentation model guided by precise prompts to identify movable regions accurately. Based on the identified movable object regions, 3D spatial motion state constraints are exploited to remove the moving object regions. Finally, the moving object regions are excluded for subsequent tracking, localization, and mapping, ensuring stable and high-precision pose estimation. Experimental results demonstrate that our method can robustly operate in various dynamic and static environments without additional training, providing higher localization accuracy compared to other advanced dynamic SLAM systems.
What problem does this paper attempt to address?