GaRField++: Reinforced Gaussian Radiance Fields for Large-Scale 3D Scene Reconstruction

Hanyue Zhang,Zhiliu Yang,Xinhe Zuo,Yuxin Tong,Ying Long,Chen Liu
2024-09-24
Abstract:This paper proposes a novel framework for large-scale scene reconstruction based on 3D Gaussian splatting (3DGS) and aims to address the scalability and accuracy challenges faced by existing methods. For tackling the scalability issue, we split the large scene into multiple cells, and the candidate point-cloud and camera views of each cell are correlated through a visibility-based camera selection and a progressive point-cloud extension. To reinforce the rendering quality, three highlighted improvements are made in comparison with vanilla 3DGS, which are a strategy of the ray-Gaussian intersection and the novel Gaussians density control for learning efficiency, an appearance decoupling module based on ConvKAN network to solve uneven lighting conditions in large-scale scenes, and a refined final loss with the color loss, the depth distortion loss, and the normal consistency loss. Finally, the seamless stitching procedure is executed to merge the individual Gaussian radiance field for novel view synthesis across different cells. Evaluation of Mill19, Urban3D, and MatrixCity datasets shows that our method consistently generates more high-fidelity rendering results than state-of-the-art methods of large-scale scene reconstruction. We further validate the generalizability of the proposed approach by rendering on self-collected video clips recorded by a commercial drone.
Computer Vision and Pattern Recognition,Artificial Intelligence,Robotics
What problem does this paper attempt to address?
The paper aims to address the challenges of scalability and accuracy in large-scale 3D scene reconstruction. Specifically: - **Scalability Issue**: Existing methods face difficulties in segmentation and reconstruction when dealing with large-scale scenes. To address this, the paper proposes a method based on 3D Gaussian Splatting (3DGS). This method divides large scenes into multiple cells and uses visibility-based camera selection and progressive point cloud expansion to associate candidate point clouds and camera views for each cell. - **Rendering Quality Enhancement**: The paper proposes three improvements to enhance rendering effects: - Ray-Gaussian intersection and a novel Gaussian density control strategy; - An appearance decoupling module based on Convolutional Kernel Attention Network (ConvKAN) to address appearance differences caused by uneven lighting conditions in large-scale scenes; - Refinement of the final loss function, including color loss, depth distortion loss, and normal consistency loss. - **Seamless Stitching**: By seamlessly stitching Gaussian radiance fields from different cells, the method achieves new view synthesis across cells. The paper validates the superior performance of its method in large-scale scene reconstruction through experiments on the Mill19, Urban3D, and MatrixCity datasets, and demonstrates its generalization capability on self-collected video clips.