Abstract:Accurate and robust simultaneous localization and mapping (SLAM) systems are crucial for autonomous underwater vehicles (AUVs) to perform missions in unknown environments. However, directly applying deep learning-based SLAM methods to underwater environments poses challenges due to weak textures, image degradation, and the inability to accurately annotate keypoints. In this paper, a robust deep-learning visual SLAM system is proposed. First, a feature generator named UWNet is designed to address weak texture and image degradation problems and extract more accurate keypoint features and their descriptors. Further, the idea of knowledge distillation is introduced based on an improved underwater imaging physical model to train the network in a self-supervised manner. Finally, UWNet is integrated into the ORB-SLAM3 to replace the traditional feature extractor. The extracted local and global features are respectively utilized in the feature tracking and closed-loop detection modules. Experimental results on public datasets and self-collected pool datasets verify that the proposed system maintains high accuracy and robustness in complex scenarios.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to solve the problem of achieving an accurate and robust Visual SLAM (Visual Simultaneous Localization and Mapping) system in the weakly - textured underwater environment. Specifically, the paper proposes a deep - learning - based Visual SLAM system named RU - SLAM to address the following challenges: 1. **Weak - texture problem**: The underwater environment usually has fewer texture features, which makes it difficult for traditional hand - crafted - feature - based methods (such as SIFT, ORB, and Shi - Tomasi) to extract consistent feature points and descriptors. 2. **Image degradation problem**: Underwater images often suffer from degradation due to turbidity, insufficient illumination, etc., which affects the detection and matching of feature points. 3. **Difficulty in key - point annotation**: In the underwater environment, due to poor image quality, it is difficult to accurately annotate key points, which limits the application of supervised learning methods. To solve these problems, the paper proposes the following innovations: 1. **Designed the UWNet feature generator**: UWNet combines the Channel Attention (CA) and Spatial Attention (SA) modules, as well as the Deformable Convolution (DCN) module, to improve the accuracy of local feature points and descriptors. 2. **Introduced knowledge distillation and self - supervised learning**: By generating pseudo - underwater images based on an improved underwater imaging physical model and using the self - supervised learning method to train UWNet, the problem of inaccurate key - point annotation is solved. 3. **Integrated into ORB - SLAM3**: UWNet is integrated into ORB - SLAM3 to replace the traditional feature extractor, achieving high precision and robustness in complex scenarios. ### Experimental verification The paper conducted experiments on multiple public datasets and a self - collected pond dataset to verify the performance of the RU - SLAM system. The experimental results show that the system performs excellently in low - light and weakly - textured underwater environments and can maintain high precision and robustness. ### Summary By proposing the RU - SLAM system, the paper effectively solves the problems of weak texture, image degradation, and difficulty in key - point annotation faced by Visual SLAM in the underwater environment, providing strong support for the mission execution of Autonomous Underwater Vehicles (AUVs) in unknown environments.

RU-SLAM: A Robust Deep-Learning Visual Simultaneous Localization and Mapping (SLAM) System for Weakly Textured Underwater Environments

Performance Analysis of Feature Extraction Methods Towards Underwater Vslam

Robust Indoor Localization and Map Matching Algorithm with Visual SLAM

Robust Inertial-aided Underwater Localization and Navigation based on Imaging Sonar Keyframes

An Underwater Multisensor Fusion Simultaneous Localization and Mapping System Based on Image Enhancement

An Underwater Localization Method Based on Visual SLAM for the Near-Bottom Environment

Enhancing Underwater SLAM Navigation and Perception: A Comprehensive Review of Deep Learning Integration

Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle

View-based Underwater SLAM Using a Stereo Camera

Underwater Slam Algorithm Based on Image Sonar Salient Target Detection

A Visual–Inertial Pressure Fusion-Based Underwater Simultaneous Localization and Mapping System

Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater Environments

SVIn2: An Underwater SLAM System using Sonar, Visual, Inertial, and Depth Sensor

BASL-AD SLAM: A Robust Deep-Learning Feature-Based Visual SLAM System With Adaptive Motion Model

Real-Time Visual SLAM for Autonomous Underwater Hull Inspection Using Visual Saliency

Changing management trends in penetrating colon trauma

A Robust Deep Learning Enhanced Monocular SLAM System for Dynamic Environments

DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features.

An Improved Underwater Visual SLAM through Image Enhancement and Sonar Fusion

RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments