3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh Generation

Rohit Das,Tzung-Han Lin,Ko-Chih Wang
2024-10-21
Abstract:Geometry and texture estimation from a single face image is an ill-posed problem since there is very little information to work with. The problem further escalates when the face is rotated at a different angle. This paper tries to tackle this problem by introducing a novel method for texture estimation from a single image by first using StyleGAN and 3D Morphable Models. The method begins by generating multi-view faces using the latent space of GAN. Then 3DDFA trained on 3DMM estimates a 3D face mesh as well as a high-resolution texture map that is consistent with the estimated face shape. The result shows that the generated mesh is of high quality with near to accurate texture representation.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the problem of generating high - quality multi - view images and 3D models from a single face image. Specifically, the paper focuses on the following aspects: 1. **Geometry and Texture Estimation**: Estimating the geometric structure and texture from a single face image is an ill - posed problem because the available information is very limited. When the face is rotated to different angles, this problem becomes more complex. The paper proposes a new method to solve this problem by using Style GAN and 3D Morphable Models (3DMM). 2. **Multi - view Image Generation**: The method proposed in the paper first uses the latent space of Style GAN to generate multi - view face images, and then uses the 3DMM trained by the 3D Dense Face Alignment (3DDFA) model to estimate the 3D face mesh and high - resolution texture maps that are consistent with the estimated face shape. 3. **3D Model Generation from a Single Image**: Generating high - quality 3D models from a single image is a key challenge in computer vision and graphics. Existing generative models have made significant progress in single - view face synthesis, but it is more complex to extend to multiple views. The 3D - GANTex framework proposed in the paper can generate multi - view images from a front - view image and estimate high - quality 3D geometric structures and textures. 4. **No Need for Annotated Data**: The method in the paper does not require annotated data, which makes it more adaptable in various applications, such as large - scale face recognition and virtual character creation. ### Main contributions of the paper: 1. **Proposing 3D - GANTex**: This is a new inference pipeline that embeds an image into the latent space and uses a parameterized mesh model to generate a front - view image and estimate the mesh and texture. 2. **Not Relying on Prior Information**: This method mainly relies on StyleGAN3 trained on the FFHQ dataset and 3DDFA v2 trained on 3DMM to handle self - occlusion problems, rather than detecting 2D landmarks. Through these methods, the paper has achieved remarkable results in generating high - quality multi - view images and 3D models, especially in dealing with facial expressions, non - rigid deformations and occlusions.