Abstract:Recent works in volume rendering, \textit{e.g.} NeRF and 3D Gaussian Splatting (3DGS), significantly advance the rendering quality and efficiency with the help of the learned implicit neural radiance field or 3D Gaussians. Rendering on top of an explicit representation, the vanilla 3DGS and its variants deliver real-time efficiency by optimizing the parametric model with single-view supervision per iteration during training which is adopted from NeRF. Consequently, certain views are overfitted, leading to unsatisfying appearance in novel-view synthesis and imprecise 3D geometries. To solve aforementioned problems, we propose a new 3DGS optimization method embodying four key novel contributions: 1) We transform the conventional single-view training paradigm into a multi-view training strategy. With our proposed multi-view regulation, 3D Gaussian attributes are further optimized without overfitting certain training views. As a general solution, we improve the overall accuracy in a variety of scenarios and different Gaussian variants. 2) Inspired by the benefit introduced by additional views, we further propose a cross-intrinsic guidance scheme, leading to a coarse-to-fine training procedure concerning different resolutions. 3) Built on top of our multi-view regulated training, we further propose a cross-ray densification strategy, densifying more Gaussian kernels in the ray-intersect regions from a selection of views. 4) By further investigating the densification strategy, we found that the effect of densification should be enhanced when certain views are distinct dramatically. As a solution, we propose a novel multi-view augmented densification strategy, where 3D Gaussians are encouraged to get densified to a sufficient number accordingly, resulting in improved reconstruction accuracy.

Optimized View and Geometry Distillation from Multi-view Diffuser

VDN-NeRF: Resolving Shape-Radiance Ambiguity Via View-Dependence Normalization

Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion

MCGS: Multiview Consistency Enhancement for Sparse-View 3D Gaussian Radiance Fields

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis

EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

Wonder3D: Single Image to 3D Using Cross-Domain Diffusion

MVDream: Multi-view Diffusion for 3D Generation

MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View

Progressive Radiance Distillation for Inverse Rendering with Gaussian Splatting

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models

GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting

HiFi-123: Towards High-fidelity One Image to 3D Content Generation

Efficient Multi-View Inverse Rendering Using a Hybrid Differentiable Rendering Method

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation