Abstract:Synthesizing novel view images from a few views is a challenging but practical problem. Existing methods often struggle with producing high-quality results or necessitate per-object optimization in such few-view settings due to the insufficient information provided. In this work, we explore leveraging the strong 2D priors in pre-trained diffusion models for synthesizing novel view images. 2D diffusion models, nevertheless, lack 3D awareness, leading to distorted image synthesis and compromising the identity. To address these problems, we propose DreamSparse, a framework that enables the frozen pre-trained diffusion model to generate geometry and identity-consistent novel view image. Specifically, DreamSparse incorporates a geometry module designed to capture 3D features from sparse views as a 3D prior. Subsequently, a spatial guidance model is introduced to convert these 3D feature maps into spatial information for the generative process. This information is then used to guide the pre-trained diffusion model, enabling it to generate geometrically consistent images without tuning it. Leveraging the strong image priors in the pre-trained diffusion models, DreamSparse is capable of synthesizing high-quality novel views for both object and scene-level images and generalising to open-set images. Experimental results demonstrate that our framework can effectively synthesize novel view images from sparse views and outperforms baselines in both trained and open-set category images. More results can be found on our project page: <a class="link-external link-https" href="https://sites.google.com/view/dreamsparse-webpage" rel="external noopener nofollow">this https URL</a>.

Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views

SparseFusion: Distilling View-conditioned Diffusion for 3D Reconstruction

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

Efficient 3D View Synthesis from Single-Image Utilizing Diffusion Priors

How to Use Diffusion Priors under Sparse Views?

Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors

VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis

DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views

Deceptive-NeRF/3DGS: Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction

GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion

Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views

MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View

DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models

ReconFusion: 3D Reconstruction with Diffusion Priors

Generating Material-Aware 3D Models from Sparse Views

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior

GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting

CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction.

Explicit 3D Reconstruction from Images with Dynamic Graph Learning and Rendering-Guided Diffusion