Text-to-3D Using Gaussian Splatting

Zilong Chen,Feng Wang,Yikai Wang,Huaping Liu
DOI: https://doi.org/10.1109/cvpr52733.2024.02022
2024-01-01
Computer Vision and Pattern Recognition
Abstract:Automatic text-to-3D generation that combines Score Distillation Sampling(SDS) with the optimization of volume rendering has achieved remarkableprogress in synthesizing realistic 3D objects. Yet most existing text-to-3Dmethods by SDS and volume rendering suffer from inaccurate geometry, e.g., theJanus issue, since it is hard to explicitly integrate 3D priors into implicit3D representations. Besides, it is usually time-consuming for them to generateelaborate 3D models with rich colors. In response, this paper proposes GSGEN, anovel method that adopts Gaussian Splatting, a recent state-of-the-artrepresentation, to text-to-3D generation. GSGEN aims at generating high-quality3D objects and addressing existing shortcomings by exploiting the explicitnature of Gaussian Splatting that enables the incorporation of 3D prior.Specifically, our method adopts a progressive optimization strategy, whichincludes a geometry optimization stage and an appearance refinement stage. Ingeometry optimization, a coarse representation is established under 3D pointcloud diffusion prior along with the ordinary 2D SDS optimization, ensuring asensible and 3D-consistent rough shape. Subsequently, the obtained Gaussiansundergo an iterative appearance refinement to enrich texture details. In thisstage, we increase the number of Gaussians by compactness-based densificationto enhance continuity and improve fidelity. With these designs, our approachcan generate 3D assets with delicate details and accurate geometry. Extensiveevaluations demonstrate the effectiveness of our method, especially forcapturing high-frequency components. Our code is available athttps://github.com/gsgen3d/gsgen
What problem does this paper attempt to address?