High-Speed Stereo Visual SLAM for Low-Powered Computing Devices

Ashish Kumar,Jaesik Park,Laxmidhar Behera
2024-10-05
Abstract:We present an accurate and GPU-accelerated Stereo Visual SLAM design called Jetson-SLAM. It exhibits frame-processing rates above 60FPS on NVIDIA's low-powered 10W Jetson-NX embedded computer and above 200FPS on desktop-grade 200W GPUs, even in stereo configuration and in the multiscale setting. Our contributions are threefold: (i) a Bounded Rectification technique to prevent tagging many non-corner points as a corner in FAST detection, improving SLAM accuracy. (ii) A novel Pyramidal Culling and Aggregation (PyCA) technique that yields robust features while suppressing redundant ones at high speeds by harnessing a GPU device. PyCA uses our new Multi-Location Per Thread culling strategy (MLPT) and Thread-Efficient Warp-Allocation (TEWA) scheme for GPU to enable Jetson-SLAM achieving high accuracy and speed on embedded devices. (iii) Jetson-SLAM library achieves resource efficiency by having a data-sharing mechanism. Our experiments on three challenging datasets: KITTI, EuRoC, and KAIST-VIO, and two highly accurate SLAM backends: Full-BA and ICE-BA show that Jetson-SLAM is the fastest available accurate and GPU-accelerated SLAM system (Fig. 1).
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve high - speed and high - precision stereo - vision SLAM (Simultaneous Localization and Mapping) on low - power - consumption computing devices. Specifically, the paper proposes solutions to the following challenges: 1. **Limited computing resources**: The computing power of low - power - consumption embedded devices (such as NVIDIA Jetson - NX) is limited, which leads to poor performance of existing SLAM systems when running on these devices, and problems such as frame - rate drops and tracking failures are likely to occur. 2. **Slow front - end processing**: In stereo - vision SLAM, front - end processing (feature detection, extraction, matching, and stereo - matching) is very time - consuming, especially in multi - scale settings. This limits the overall speed and real - time performance of the system. 3. **Low - quality feature points**: Existing feature detection methods (such as FAST) may mis - detect non - corner points, resulting in a decline in the precision of the SLAM system. To solve these problems, the paper proposes a system named Jetson - SLAM, and its main contributions include: 1. **Bounded Rectification**: By improving the FAST feature detection algorithm, it prevents non - corner points from being mis - detected as corner points, thereby improving the precision of SLAM. 2. **Pyramidal Culling and Aggregation (PyCA)**: A new pyramid - style feature screening and aggregation technique that can efficiently generate high - quality multi - scale features on the GPU while reducing the number of redundant features and improving processing speed. 3. **Front - end - intermediate - end - back - end design**: A new intermediate - end component is introduced, which is responsible for stereo - matching, feature - matching, and tracking, and avoids the memory transfer overhead between the CPU and GPU through a data - sharing mechanism, improving resource utilization efficiency. Through these innovations, Jetson - SLAM can achieve a processing speed of over 60FPS on low - power - consumption embedded devices while maintaining high precision. Experimental results show that Jetson - SLAM performs better than existing SLAM systems on datasets such as KITTI, EuRoC, and KAIST - VIO.