Learning Reconstruction Models of Textured 3D Mesh Using StyleGAN2

Fei Wang,Yangjie Cao,Zhenqiang Li,Jie Li
DOI: https://doi.org/10.1007/978-981-97-5666-7_35
2024-01-01
Abstract:The current field of 3D generation has made significant progress, yet achieving high-fidelity 3D object reconstruction from a single-view image remains a challenging task. However, we find that recent StyleGAN-based 3D GANs are primarily used for generating 2D images of different viewpoints, then employing a multi-view approach to reconstruct 3D object. In this work, we propose CIR, a category-based single-view image 3D reconstruction algorithm, utilizing an improved StyleGAN2 network to generate high-fidelity 3D object. CIR has several merits compared to prior methods: (1) Compared with other 3D GANs that require pre-training, we propose an end-to-end image 3D generation framework combining Variational Autoencoders (VAE) and StyleGAN2 to reconstruct textured 3D mesh from single-view image. (2) CIR maps image to feature space through VAE, and then reconstructs shape and surface 3D attributes from well-decoupled latent vectors through the improved StyleGAN2 network. We evaluate the effectiveness of our framework on different datasets. Experimental results demonstrate that our reconstruction framework significantly improves the fidelity of object. The generated models exhibit higher accuracy in 3D evaluation metrics, making them deployable in any traditional graphics engine for downstream tasks.
What problem does this paper attempt to address?