Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM

Yan Song Hu,Dayou Mao,Yuhao Chen,John Zelek
2024-08-07
Abstract:Initial applications of 3D Gaussian Splatting (3DGS) in Visual Simultaneous Localization and Mapping (VSLAM) demonstrate the generation of high-quality volumetric reconstructions from monocular video streams. However, despite these promising advancements, current 3DGS integrations have reduced tracking performance and lower operating speeds compared to traditional VSLAM. To address these issues, we propose integrating 3DGS with Direct Sparse Odometry, a monocular photometric SLAM system. We have done preliminary experiments showing that using Direct Sparse Odometry point cloud outputs, as opposed to standard structure-from-motion methods, significantly shortens the training time needed to achieve high-quality renders. Reducing 3DGS training time enables the development of 3DGS-integrated SLAM systems that operate in real-time on mobile hardware. These promising initial findings suggest further exploration is warranted in combining traditional VSLAM systems with 3DGS.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main objective of this paper is to address the issues of low tracking performance and slow execution speed of 3D Gaussian Splatting (3DGS) in Visual Simultaneous Localization and Mapping (VSLAM) applications. Specifically, although 3DGS can generate high-quality volumetric reconstructions, it falls short in terms of real-time performance and tracking accuracy compared to traditional VSLAM methods. To solve these problems, the authors propose a method that combines 3DGS with Direct Sparse Odometry (DSO). DSO is a pixel-based VSLAM system that creates denser point clouds by tracking high-gradient pixels instead of feature points, thereby accelerating the training process of 3DGS. Additionally, the researchers improved DSO by adding extra points to optimize the performance of 3DGS. These points do not participate in pose estimation but help increase the density of the point cloud. Experimental results show that using the improved DSO output as input significantly shortens the training time of 3DGS, especially in the early stages of training. This indicates that by integrating DSO with 3DGS, it is possible to achieve faster tracking speeds while maintaining high rendering quality, thereby advancing the capability of 3DGS-integrated VSLAM systems to run in real-time on mobile hardware.