GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians

Dasong Gao,Peter Zhi Xuan Li,Vivienne Sze,Sertac Karaman

2024-09-14

Abstract:Constructing a high-fidelity representation of the 3D scene using a monocular camera can enable a wide range of applications on mobile devices, such as micro-robots, smartphones, and AR/VR headsets. On these devices, memory is often limited in capacity and its access often dominates the consumption of compute energy. Although Gaussian Splatting (GS) allows for high-fidelity reconstruction of 3D scenes, current GS-based SLAM is not memory efficient as a large number of past images is stored to retrain Gaussians for reducing catastrophic forgetting. These images often require two-orders-of-magnitude higher memory than the map itself and thus dominate the total memory usage. In this work, we present GEVO, a GS-based monocular SLAM framework that achieves comparable fidelity as prior methods by rendering (instead of storing) them from the existing map. Novel Gaussian initialization and optimization techniques are proposed to remove artifacts from the map and delay the degradation of the rendered images over time. Across a variety of environments, GEVO achieves comparable map fidelity while reducing the memory overhead to around 58 MBs, which is up to 94x lower than prior works.

Robotics

What problem does this paper attempt to address?

The paper aims to address the memory efficiency issues encountered when using monocular cameras for high-fidelity 3D scene reconstruction on mobile devices (such as micro-robots, smartphones, and AR/VR headsets). Specifically, existing SLAM methods based on Gaussian Splatting (GS) consume a significant amount of memory resources when storing a large number of past images to reduce catastrophic forgetting. The memory required for these images is often more than twice that of the map itself. Therefore, the paper proposes GEVO (Memory-Efficient Monocular Visual Odometry Using Gaussians), a monocular SLAM framework based on Gaussian point clouds, which significantly reduces memory overhead by rendering rather than storing past images to guide the optimization process. The main contributions of GEVO are: 1. **Occupancy-Preserving Initialization**: Reducing erroneous occlusions by identifying Gaussian point clouds in free regions. 2. **Consistency-Aware Optimization**: Proposing a two-stage optimization process, first locally optimizing newly inserted Gaussian point clouds, and then selectively optimizing parts of the global map that are inconsistent and visible in the current window, thereby reducing the impact of Incomplete Occlusion (IRO). Experimental results show that GEVO can achieve comparable map fidelity to existing methods in various environments while reducing memory overhead to approximately 58MB, up to 94 times less than previous work.

GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians

Design of an Enhanced Visual Odometry by Building and Matching Compressive Panoramic Landmarks Online

MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting

GSL-VO: A Geometric-Semantic Information Enhanced Lightweight Visual Odometry in Dynamic Environments

Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting

Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis

Compact 3D Gaussian Splatting For Dense Visual SLAM

LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting

Gaussian Splatting SLAM

MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements

RGBD GS-ICP SLAM

ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion

Deep Visual Odometry with Adaptive Memory

MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization

Monocular Gaussian SLAM with Language Extended Loop Closure

GMMap: Memory-Efficient Continuous Occupancy Map Using Gaussian Mixture Model

GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure