NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation

Menglin Zhang,Xin Luo,Yunwei Lan,Chang Liu,Rui Li,Kaidong Zhang,Ganlin Yang,Dong Liu
2024-11-23
Abstract:Recent advances in NeRF inpainting have leveraged pretrained diffusion models to enhance performance. However, these methods often yield suboptimal results due to their ineffective utilization of 2D diffusion priors. The limitations manifest in two critical aspects: the inadequate capture of geometric information by pretrained diffusion models and the suboptimal guidance provided by existing Score Distillation Sampling (SDS) methods. To address these problems, we introduce GB-NeRF, a novel framework that enhances NeRF inpainting through improved utilization of 2D diffusion priors. Our approach incorporates two key innovations: a fine-tuning strategy that simultaneously learns appearance and geometric priors and a specialized normal distillation loss that integrates these geometric priors into NeRF inpainting. We propose a technique called Balanced Score Distillation (BSD) that surpasses existing methods such as Score Distillation (SDS) and the improved version, Conditional Score Distillation (CSD). BSD offers improved inpainting quality in appearance and geometric aspects. Extensive experiments show that our method provides superior appearance fidelity and geometric consistency compared to existing approaches.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that the existing NeRF (Neural Radiance Field) inpainting methods have poor performance when using 2D diffusion model priors. Specifically, the existing methods are insufficient in geometric information capture and optimization guidance, resulting in the generated NeRF models having poor performance in geometric accuracy and appearance consistency. These problems are mainly reflected in: 1. **Insufficient geometric information capture**: Pretrained diffusion models cannot effectively capture geometric information, especially when dealing with normal maps. 2. **Inadequate optimization guidance**: The existing Score Distillation Sampling (SDS) and Conditional Score Distillation (CSD) methods, due to the presence of random noise and unconditional noise prediction terms, lead to unnecessary changes during the optimization process, affecting the quality of the inpainting area. To solve these problems, the authors propose the GB - NeRF framework, which improves the effect of NeRF inpainting through the following two key innovations: 1. **Fine - tuning strategy**: A special fine - tuning strategy is introduced, enabling the diffusion model to learn appearance and geometric priors simultaneously. By training with high - quality RGB images and corresponding normal map pairs, the geometric understanding ability of the model is enhanced. 2. **Balanced Score Distillation (BSD)**: A new optimization technique BSD is proposed. By eliminating high - variability terms, it provides a more stable and consistent supervision signal, thereby improving the optimization efficiency and reconstruction quality. These improvements enable GB - NeRF to provide higher geometric accuracy and appearance fidelity when dealing with occluded areas, thus achieving significant performance improvement in the NeRF inpainting task.