RU-SLAM: A Robust Deep-Learning Visual Simultaneous Localization and Mapping (SLAM) System for Weakly Textured Underwater Environments

Zhuo Wang,Qin Cheng,Xiaokai Mu
DOI: https://doi.org/10.3390/s24061937
IF: 3.9
2024-03-19
Sensors
Abstract:Accurate and robust simultaneous localization and mapping (SLAM) systems are crucial for autonomous underwater vehicles (AUVs) to perform missions in unknown environments. However, directly applying deep learning-based SLAM methods to underwater environments poses challenges due to weak textures, image degradation, and the inability to accurately annotate keypoints. In this paper, a robust deep-learning visual SLAM system is proposed. First, a feature generator named UWNet is designed to address weak texture and image degradation problems and extract more accurate keypoint features and their descriptors. Further, the idea of knowledge distillation is introduced based on an improved underwater imaging physical model to train the network in a self-supervised manner. Finally, UWNet is integrated into the ORB-SLAM3 to replace the traditional feature extractor. The extracted local and global features are respectively utilized in the feature tracking and closed-loop detection modules. Experimental results on public datasets and self-collected pool datasets verify that the proposed system maintains high accuracy and robustness in complex scenarios.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve the problem of achieving an accurate and robust Visual SLAM (Visual Simultaneous Localization and Mapping) system in the weakly - textured underwater environment. Specifically, the paper proposes a deep - learning - based Visual SLAM system named RU - SLAM to address the following challenges: 1. **Weak - texture problem**: The underwater environment usually has fewer texture features, which makes it difficult for traditional hand - crafted - feature - based methods (such as SIFT, ORB, and Shi - Tomasi) to extract consistent feature points and descriptors. 2. **Image degradation problem**: Underwater images often suffer from degradation due to turbidity, insufficient illumination, etc., which affects the detection and matching of feature points. 3. **Difficulty in key - point annotation**: In the underwater environment, due to poor image quality, it is difficult to accurately annotate key points, which limits the application of supervised learning methods. To solve these problems, the paper proposes the following innovations: 1. **Designed the UWNet feature generator**: UWNet combines the Channel Attention (CA) and Spatial Attention (SA) modules, as well as the Deformable Convolution (DCN) module, to improve the accuracy of local feature points and descriptors. 2. **Introduced knowledge distillation and self - supervised learning**: By generating pseudo - underwater images based on an improved underwater imaging physical model and using the self - supervised learning method to train UWNet, the problem of inaccurate key - point annotation is solved. 3. **Integrated into ORB - SLAM3**: UWNet is integrated into ORB - SLAM3 to replace the traditional feature extractor, achieving high precision and robustness in complex scenarios. ### Experimental verification The paper conducted experiments on multiple public datasets and a self - collected pond dataset to verify the performance of the RU - SLAM system. The experimental results show that the system performs excellently in low - light and weakly - textured underwater environments and can maintain high precision and robustness. ### Summary By proposing the RU - SLAM system, the paper effectively solves the problems of weak texture, image degradation, and difficulty in key - point annotation faced by Visual SLAM in the underwater environment, providing strong support for the mission execution of Autonomous Underwater Vehicles (AUVs) in unknown environments.