MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting

Yan Song Hu,Nicolas Abboud,Muhammad Qasim Ali,Adam Srebrnjak Yang,Imad Elhajj,Daniel Asmar,Yuhao Chen,John S. Zelek
2024-09-20
Abstract:Real-time SLAM with dense 3D mapping is computationally challenging, especially on resource-limited devices. The recent development of 3D Gaussian Splatting (3DGS) offers a promising approach for real-time dense 3D reconstruction. However, existing 3DGS-based SLAM systems struggle to balance hardware simplicity, speed, and map quality. Most systems excel in one or two of the aforementioned aspects but rarely achieve all. A key issue is the difficulty of initializing 3D Gaussians while concurrently conducting SLAM. To address these challenges, we present Monocular GSO (MGSO), a novel real-time SLAM system that integrates photometric SLAM with 3DGS. Photometric SLAM provides dense structured point clouds for 3DGS initialization, accelerating optimization and producing more efficient maps with fewer Gaussians. As a result, experiments show that our system generates reconstructions with a balance of quality, memory efficiency, and speed that outperforms the state-of-the-art. Furthermore, our system achieves all results using RGB inputs. We evaluate the Replica, TUM-RGBD, and EuRoC datasets against current live dense reconstruction systems. Not only do we surpass contemporary systems, but experiments also show that we maintain our performance on laptop hardware, making it a practical solution for robotics, A/R, and other real-time applications.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve real - time high - precision and high - memory - efficiency dense 3D reconstruction on devices with limited resources. Specifically, the paper proposes a new monocular real - time photometric SLAM system - MGSO (Monocular GSO), which combines photometric SLAM and 3D Gaussian Splatting (3DGS) techniques, aiming to balance hardware simplicity, speed and map quality. Existing SLAM systems based on 3DGS are difficult to achieve optimality simultaneously in these aspects, especially when initializing 3D Gaussian points, it is difficult to perform SLAM operations simultaneously. MGSO initializes 3DGS by using photometric SLAM to provide a dense and structured point cloud, thereby accelerating the optimization process, generating a more efficient map, reducing the number of required Gaussian points, and finally achieving high - quality, memory - efficient and fast 3D reconstruction. The main contributions of the paper are: 1. Proposing a real - time dense SLAM system that can utilize the synergy between photometric SLAM and 3DGS. 2. The system can operate with only a monocular camera. 3. Experiments show that this system outperforms other dense SLAM systems in terms of speed, map quality and memory efficiency.