Lightweight Predictive 3D Gaussian Splats

Junli Cao,Vidit Goel,Chaoyang Wang,Anil Kag,Ju Hu,Sergei Korolev,Chenfanfu Jiang,Sergey Tulyakov,Jian Ren
2024-06-28
Abstract:Recent approaches representing 3D objects and scenes using Gaussian splats show increased rendering speed across a variety of platforms and devices. While rendering such representations is indeed extremely efficient, storing and transmitting them is often prohibitively expensive. To represent large-scale scenes, one often needs to store millions of 3D Gaussians, occupying gigabytes of disk space. This poses a very practical limitation, prohibiting widespread adoption.Several solutions have been proposed to strike a balance between disk size and rendering quality, noticeably reducing the visual quality. In this work, we propose a new representation that dramatically reduces the hard drive footprint while featuring similar or improved quality when compared to the standard 3D Gaussian splats. When compared to other compact solutions, ours offers higher quality renderings with significantly reduced storage, being able to efficiently run on a mobile device in real-time. Our key observation is that nearby points in the scene can share similar representations. Hence, only a small ratio of 3D points needs to be stored. We introduce an approach to identify such points which are called parent points. The discarded points called children points along with attributes can be efficiently predicted by tiny MLPs.
Graphics,Artificial Intelligence
What problem does this paper attempt to address?
This paper proposes a solution to the problem of low storage efficiency in 3D object and scene representation. Current methods, such as Gaussian Splats-based 3D representation, have improved rendering speed but come with high storage and transmission costs. In this paper, the authors introduce a lightweight predictive 3D Gaussian representation method that significantly reduces disk storage occupancy while maintaining or improving the quality compared to standard 3D Gaussian points. Compared to existing compact solutions, this method offers higher quality rendering with significantly reduced storage requirements, making it suitable for real-time operation on mobile devices. The authors observe that points in the scene can share similar representations, so only a small number of "parent points" need to be stored, while other "child points" can be predicted using small multi-layer perceptrons (MLPs). This approach allows for storing point clouds at lower densities while reducing attribute storage, thus reducing storage requirements. Through the optimization process, unimportant parent points can be removed along with all their child points to accommodate certain areas of complex geometry. Experimental results show significant reductions in storage occupancy compared to standard Gaussian points, compact Gaussian points, and other lightweight solutions, while also improving image reconstruction quality and peak signal-to-noise ratio (PSNR). The paper also compares with other methods, demonstrating that its model size is smaller while maintaining high quality, making it suitable for real-time operation on resource-limited devices.