SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians

Hiba Dahmani,Moussab Bennehar,Nathan Piasco,Luis Roldao,Dzmitry Tsishkou

2024-04-06

Abstract:Implicit neural representation methods have shown impressive advancements in learning 3D scenes from unstructured in-the-wild photo collections but are still limited by the large computational cost of volumetric rendering. More recently, 3D Gaussian Splatting emerged as a much faster alternative with superior rendering quality and training efficiency, especially for small-scale and object-centric scenarios. Nevertheless, this technique suffers from poor performance on unstructured in-the-wild data. To tackle this, we extend over 3D Gaussian Splatting to handle unstructured image collections. We achieve this by modeling appearance to seize photometric variations in the rendered images. Additionally, we introduce a new mechanism to train transient Gaussians to handle the presence of scene occluders in an unsupervised manner. Experiments on diverse photo collection scenes and multi-pass acquisition of outdoor landmarks show the effectiveness of our method over prior works achieving state-of-the-art results with improved efficiency.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

This paper mainly addresses the problem that the 3D Gaussian Splatting (3DGS) method does not perform well in dealing with lighting variations, dynamic objects, and occlusions when processing an unconstrained collection of natural environment photos. Although the existing Neural Radiance Fields (NeRF) method excels in rendering realistic novel views, it has poor performance in handling dynamic scenes. To address these issues, the paper proposes SWAG (Splatting in the Wild with Appearance-conditioned Gaussians), which is the first natural environment extension for 3DGS. SWAG improves 3DGS in the following ways: 1. It introduces a learning-based embedding space to capture the appearance of each image, adapting to the photometric variations in rendering images. 2. It learns opacity variations related to the images to better handle dynamic objects and enhance the precision of scene reconstruction. 3. It proposes an unsupervised mechanism to train transient Gaussian distributions for handling occlusions in the scene. Experimental results show that SWAG not only improves the performance of 3DGS in various scenes but also achieves state-of-the-art levels in training and rendering speeds, surpassing previous works in rendering quality. Additionally, SWAG can generate new images with smooth visual transitions and remove dynamic objects from the captured scene in an unsupervised manner.

SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians

AAGS: Appearance-Aware 3D Gaussian Splattingwith Unconstrained Photo Collections

WildGaussians: 3D Gaussian Splatting in the Wild

Gaussian Splatting in Style

Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections

SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians

Gaussian Splatting LK

Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections

3D-HGS: 3D Half-Gaussian Splatting

Unbounded-GS: Extending 3D Gaussian Splatting with Hybrid Representation for Unbounded Large-Scale Scene Reconstruction

latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction

3D Gaussian Splatting as Markov Chain Monte Carlo

Monocular Dynamic Gaussian Splatting is Fast and Brittle but Smooth Motion Helps

Gaussians on their Way: Wasserstein-Constrained 4D Gaussian Splatting with State-Space Modeling

SpecGaussian with Latent Features: A High-quality Modeling of the View-dependent Appearance for 3D Gaussian Splatting

Variational Bayes Gaussian Splatting

WE-GS: An In-the-wild Efficient 3D Gaussian Representation for Unconstrained Photo Collections

A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction

Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation