Learning a Diffusion Prior for NeRFs

Guandao Yang,Abhijit Kundu,Leonidas J. Guibas,Jonathan T. Barron,Ben Poole
DOI: https://doi.org/10.48550/arXiv.2304.14473
2023-04-27
Computer Vision and Pattern Recognition
Abstract:Neural Radiance Fields (NeRFs) have emerged as a powerful neural 3D representation for objects and scenes derived from 2D data. Generating NeRFs, however, remains difficult in many scenarios. For instance, training a NeRF with only a small number of views as supervision remains challenging since it is an under-constrained problem. In such settings, it calls for some inductive prior to filter out bad local minima. One way to introduce such inductive priors is to learn a generative model for NeRFs modeling a certain class of scenes. In this paper, we propose to use a diffusion model to generate NeRFs encoded on a regularized grid. We show that our model can sample realistic NeRFs, while at the same time allowing conditional generations, given a certain observation as guidance.
What problem does this paper attempt to address?
The paper aims to address the challenge of generating Neural Radiance Fields (NeRFs) with only a limited number of views as supervision. Specifically, when the number of views in the training data is small, training NeRF becomes very challenging because it is an under-constrained problem. In this case, some form of inductive prior is needed to filter out poor local minima. To this end, the paper proposes a method that uses a diffusion model to generate NeRFs encoded on a regular grid. Experimental results show that the model can not only generate realistic NeRFs but also perform conditional generation based on specific observations. This approach provides a powerful prior model for downstream tasks such as single-view 3D reconstruction.