Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation

Minye Wu,Tinne Tuytelaars
2024-08-19
Abstract:Recent advancements in photo-realistic novel view synthesis have been significantly driven by Gaussian Splatting (3DGS). Nevertheless, the explicit nature of 3DGS data entails considerable storage requirements, highlighting a pressing need for more efficient data representations. To address this, we present Implicit Gaussian Splatting (IGS), an innovative hybrid model that integrates explicit point clouds with implicit feature embeddings through a multi-level tri-plane architecture. This architecture features 2D feature grids at various resolutions across different levels, facilitating continuous spatial domain representation and enhancing spatial correlations among Gaussian primitives. Building upon this foundation, we introduce a level-based progressive training scheme, which incorporates explicit spatial regularization. This method capitalizes on spatial correlations to enhance both the rendering quality and the compactness of the IGS representation. Furthermore, we propose a novel compression pipeline tailored for both point clouds and 2D feature grids, considering the entropy variations across different levels. Extensive experimental evaluations demonstrate that our algorithm can deliver high-quality rendering using only a few MBs, effectively balancing storage efficiency and rendering fidelity, and yielding results that are competitive with the state-of-the-art.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of achieving efficient storage representation of 3D scenes while maintaining high-quality rendering. Specifically, existing 3D Gaussian Splatting (3DGS) methods, although performing well in real-time rendering, require a large amount of storage space due to their explicit raw data representation. To solve this problem, the authors propose a new hybrid representation method—Implicit Gaussian Splatting (IGS). This method combines explicit point clouds with implicit feature embeddings, enhances spatial correlation through a multi-level tri-plane architecture, and introduces a hierarchical progressive training scheme along with a dedicated compression pipeline. As a result, it significantly reduces storage requirements while maintaining high-quality rendering. Experimental results show that IGS achieves excellent rendering effects with minimal storage overhead across various datasets.