An Effective Stereo SLAM with High-Level Primitives in Underwater Environment

Shuo Xu,Teng Ma,Ye Li,Shuoshuo Ding,Jingxuan Gao,Jiahao Xia,Haodong Qi,Huayu Gu
DOI: https://doi.org/10.1088/1361-6501/ace645
2023-01-01
Abstract:Visual simultaneous localization and mapping (SLAM) algorithms face challenges in complex underwater scenarios, such as turbidity, dynamism, and low texture, where point features are unreliable and can lead to weakened or even failed systems. To overcome these issues, high-level object features are considered due to their accuracy and robustness. In this paper, we introduce an effective object-level SLAM method that employs a stereo camera to enhance the navigation robustness of autonomous underwater vehicles and generates a detailed semantic map. Point features and object features are integrated to serve the proposed approach. We begin by detecting 2D objects in images using a state-of-the-art neural network, followed by obtaining 3D objects described by the general model through the principle of multi-view geometry and eventually constructing semantic landmarks. To account for object data association, we present an object match method that takes into consideration the stereo camera characteristics in a single stereo frame and a filter-based approach to track the landmarks in odometry. Experiments are also conducted using the KITTI dataset and our sequences collected from the pool and coast. The evaluation results indicate that the proposed method can improve the performance of ORBSLAM2 in terms of both navigation robustness and mapping information in underwater scenarios.
What problem does this paper attempt to address?