SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field

Zetian Song,Wenhong Duan,Yuhuai Zhang,Shiqi Wang,Siwei Ma,Wen Gao
2024-02-26
Abstract:Representing the Neural Radiance Field (NeRF) with the explicit voxel grid (EVG) is a promising direction for improving NeRFs. However, the EVG representation is not efficient for storage and transmission because of the terrific memory cost. Current methods for compressing EVG mainly inherit the methods designed for neural network compression, such as pruning and quantization, which do not take full advantage of the spatial correlation of voxels. Inspired by prosperous digital image compression techniques, this paper proposes SPC-NeRF, a novel framework applying spatial predictive coding in EVG compression. The proposed framework can remove spatial redundancy efficiently for better compression performance.Moreover, we model the bitrate and design a novel form of the loss function, where we can jointly optimize compression ratio and distortion to achieve higher coding efficiency. Extensive experiments demonstrate that our method can achieve 32% bit saving compared to the state-of-the-art method VQRF on multiple representative test datasets, with comparable training time.
Computer Vision and Pattern Recognition,Multimedia
What problem does this paper attempt to address?
The paper primarily addresses the storage and transmission efficiency issues of Neural Radiance Fields (NeRF) models based on Explicit Voxel Grid (EVG) by proposing a new method called SPC-NeRF (Spatial Predictive Compression for Voxel Based Radiance Field). Specifically, the paper points out that although the EVG representation has the potential to improve the training and rendering speed of NeRF models, this representation generates a large number of parameters, leading to increased storage and computational burdens. Existing compression methods such as pruning and quantization fail to fully exploit the spatial correlation between voxels. To address the above issues, the authors propose the SPC-NeRF framework, which combines spatial predictive coding techniques to remove spatial redundancy in the voxel grid, thereby achieving more efficient compression performance. Additionally, the authors design a new loss function to model the bit rate, balancing model size and rendering distortion to achieve higher coding efficiency. Experimental results show that compared to the state-of-the-art method VQRF, SPC-NeRF can achieve a 32% bit saving on multiple representative test datasets while maintaining comparable training time. Through detailed analysis and comparison, the paper demonstrates the effectiveness and superiority of the proposed method.