ReconFusion: 3D Reconstruction with Diffusion Priors

Rundi Wu,Ben Mildenhall,Philipp Henzler,Keunhong Park,Ruiqi Gao,Daniel Watson,Pratul P. Srinivasan,Dor Verbin,Jonathan T. Barron,Ben Poole,Aleksander Holynski
2023-12-06
Abstract:3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires tens to hundreds of input images, resulting in a time-consuming capture process. We present ReconFusion to reconstruct real-world scenes using only a few photos. Our approach leverages a diffusion prior for novel view synthesis, trained on synthetic and multiview datasets, which regularizes a NeRF-based 3D reconstruction pipeline at novel camera poses beyond those captured by the set of input images. Our method synthesizes realistic geometry and texture in underconstrained regions while preserving the appearance of observed regions. We perform an extensive evaluation across various real-world datasets, including forward-facing and 360-degree scenes, demonstrating significant performance improvements over previous few-view NeRF reconstruction approaches.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the issue of 3D reconstruction when using a limited number of input views, particularly the artifacts commonly seen in Neural Radiance Field (NeRF) methods. Specifically, NeRF methods typically require dozens to hundreds of input images to ensure high-quality reconstruction, leading to a time-consuming capture process. To solve this problem, the paper proposes the ReconFusion method, which utilizes a diffusion model as a regularizer to enhance NeRF's reconstruction capability with a small number of views. In this way, even with a limited number of input views, it can generate more realistic and consistent geometry and textures, reduce artifacts such as "floaters," and maintain the appearance quality of observed regions even in sparse view conditions. Additionally, the method has been extensively evaluated on multiple real-world datasets, demonstrating significant performance improvements over existing methods in various scenarios.