MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes

Xinjie Zhang,Zhening Liu,Yifan Zhang,Xingtong Ge,Dailan He,Tongda Xu,Yan Wang,Zehong Lin,Shuicheng Yan,Jun Zhang
2024-10-17
Abstract:4D Gaussian Splatting (4DGS) has recently emerged as a promising technique for capturing complex dynamic 3D scenes with high fidelity. It utilizes a 4D Gaussian representation and a GPU-friendly rasterizer, enabling rapid rendering speeds. Despite its advantages, 4DGS faces significant challenges, notably the requirement of millions of 4D Gaussians, each with extensive associated attributes, leading to substantial memory and storage cost. This paper introduces a memory-efficient framework for 4DGS. We streamline the color attribute by decomposing it into a per-Gaussian direct color component with only 3 parameters and a shared lightweight alternating current color predictor. This approach eliminates the need for spherical harmonics coefficients, which typically involve up to 144 parameters in classic 4DGS, thereby creating a memory-efficient 4D Gaussian representation. Furthermore, we introduce an entropy-constrained Gaussian deformation technique that uses a deformation field to expand the action range of each Gaussian and integrates an opacity-based entropy loss to limit the number of Gaussians, thus forcing our model to use as few Gaussians as possible to fit a dynamic scene well. With simple half-precision storage and zip compression, our framework achieves a storage reduction by approximately 190$\times$ and 125$\times$ on the Technicolor and Neural 3D Video datasets, respectively, compared to the original 4DGS. Meanwhile, it maintains comparable rendering speeds and scene representation quality, setting a new standard in the field.
Computer Vision and Pattern Recognition,Graphics
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the storage and memory overhead issues of 4D Gaussian Splatting (4DGS) in dynamic scene reconstruction. Although 4DGS can capture complex dynamic 3D scenes with high fidelity and achieve fast rendering speeds using 4D Gaussian representations and GPU - friendly rasterizers, it requires millions of 4D Gaussian points to represent dynamic scenes, which leads to huge memory and storage costs. Specifically, rendering a birthday scene requires as many as 13 million Gaussian points and occupies approximately 7.79GB of storage space, which poses a serious challenge to practical applications, especially those on resource - constrained devices. Therefore, the paper proposes a memory - efficient 4D Gaussian Splatting framework (MEGA), aiming to significantly reduce storage requirements by reducing the number of required Gaussian points and optimizing the properties of each Gaussian point while maintaining rendering quality and speed comparable to the original 4DGS.