Abstract:In this work, we explore the possibility of training high-parameter 3D Gaussian splatting (3DGS) models on large-scale, high-resolution datasets. We design a general model parallel training method for 3DGS, named RetinaGS, which uses a proper rendering equation and can be applied to any scene and arbitrary distribution of Gaussian primitives. It enables us to explore the scaling behavior of 3DGS in terms of primitive numbers and training resolutions that were difficult to explore before and surpass previous state-of-the-art reconstruction quality. We observe a clear positive trend of increasing visual quality when increasing primitive numbers with our method. We also demonstrate the first attempt at training a 3DGS model with more than one billion primitives on the full MatrixCity dataset that attains a promising visual quality.

What problem does this paper attempt to address?

The main goal of this paper is to address the efficient training of high-parameter 3D Gaussian Splatting (3DGS) models on large-scale, high-resolution datasets. Specifically, the researchers designed a general model parallel training method called RetinaGS to overcome the computational and memory limitations encountered when processing large-scale scene reconstructions. The main contributions of the paper can be summarized as follows: 1. **Distributed Training Method**: The research team proposed a distributed training framework that can effectively allocate computational tasks to multiple GPUs, making it possible to train 3DGS models on large-scale, high-resolution datasets that are difficult to handle on a single GPU. 2. **RetinaGS Method**: By dividing the 3D space into a series of convex subspaces and assigning a portion of the Gaussian splatting (i.e., "splat") to each subspace, RetinaGS can achieve efficient distributed training while maintaining equivalence to single-GPU training. This method allows researchers to explore the training effects at larger scales of Gaussian splatting and higher resolutions. 3. **Scalability Verification**: The researchers demonstrated a significant improvement in reconstruction quality when increasing the number of Gaussian splats. Additionally, they showcased the first successful case of training on a city-level dataset (MatrixCity dataset) containing a large number of images using over one billion Gaussian splats. 4. **Experimental Results**: Validation on multiple high-resolution 3D reconstruction datasets showed that the 3DGS models trained with RetinaGS outperformed existing 3DGS methods on datasets such as Mip-NeRF360, Mega-NeRF, ScanNet++, and MatrixCity. 5. **Large-Scale Model Training**: The paper mentioned the use of 64 A100 GPUs for a 10-day training period, successfully training a 3DGS model with over one billion Gaussian splats on a dataset containing 141,652 images (totaling over 14.8 billion pixels). This scale had not been reported before. In summary, the method proposed in this paper addresses the challenges of training 3DGS models on large-scale, high-resolution datasets and achieves unprecedented visual quality and reconstruction accuracy.

RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians

CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians

EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation

GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

On Scaling Up 3D Gaussian Splatting Training

Taming 3DGS: High-Quality Radiance Fields with Limited Resources

A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes

Unbounded-GS: Extending 3D Gaussian Splatting with Hybrid Representation for Unbounded Large-Scale Scene Reconstruction

PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting

GaRField++: Reinforced Gaussian Radiance Fields for Large-Scale 3D Scene Reconstruction

VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction

DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus

LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors

Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering

GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module

GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction

SRGS: Super-Resolution 3D Gaussian Splatting