Abstract:Photorealistic multiview face synthesis from a single image is a challenging problem. Existing works mainly learn a texture mapping model from the source to the target faces. However, they rarely consider the geometric constraints on the internal deformation arising from pose variations, which causes a high level of uncertainty in face pose modeling, and hence, produces inferior results for large pose variations. Moreover, current methods typically suffer from undesired facial details loss due to the adoption of the de-facto standard encoder-decoder architecture without any skip connections (SCs). In this article, we directly learn and exploit geometric constraints and propose a fully deformable network to simultaneously model the deformations of both landmarks and faces for face synthesis. Specifically, our model consists of two parts: a deformable landmark learning network (DLLN) and a gated deformable face synthesis network (GDFSN). The DLLN converts an initial reference landmark to an individual-specific target landmark as delicate pose guidance for face rotation. The GDFSN adopts a dual-stream structure, with one stream estimating the deformation of two views in the form of convolution offsets according to the source pose and the converted target pose, and the other leveraging the predicted deformation offsets to create the target face. In this way, individual-aware pose changes are explicitly modeled in the face generator to cope with geometric transformation, by adaptively focusing on pertinent regions of the source face. To compensate for offset estimation errors, we introduce a soft-gating mechanism for adaptive fusion between deformable features and primitive features. Additionally, a pose-aligned SC (PASC) is tailored to propagate low-level input features to the appropriate positions in the output features for further enhancing the facial details and identity preservation. Extensive experiments on six benchmarks show that our approach performs favorably against the state-of-the-arts, especially with large pose changes. Code is available at https://github.com/cschengxu/FDFace.

3D Face Modeling via Weakly-supervised Disentanglement Network joint Identity-consistency Prior

Realistic Face Reenactment Via Self-Supervised Disentangling of Identity and Pose

Facial Landmark Disentangled Network with Variational Autoencoder

SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Learning Distribution Independent Latent Representation for 3D Face Disentanglement.

Semantically Disentangled Variational Autoencoder for Modeling 3D Facial Details

Disentangling Features in 3D Face Shapes for Joint Face Reconstruction and Recognition

Disentangled Representation Learning For 3d Face Shape

Disentangling Factors of Variation in Deep Representations Using Adversarial Training.

A Distribution Independence Based Method for 3D Face Shape Decomposition

Controllable Face Image Editing in a Disentanglement Way

Exploring Disentangled Feature Representation Beyond Face Identification

Reinforced Disentanglement for Face Swapping Without Skip Connection

DFIE3D: 3D-Aware Disentangled Face Inversion and Editing Via Facial-contrastive Learning

A Generative Framework for Self-Supervised Facial Representation Learning

3D Generative Model Latent Disentanglement via Local Eigenprojection

Achieving Privacy-Preserving Multi-View Consistency with Advanced 3D-Aware Face De-identification.

Semi-supervised 3D Face Representation Learning from Unconstrained Photo Collections.

VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment

Fully Deformable Network for Multiview Face Image Synthesis

Disjoint Pose and Shape for 3D Face Reconstruction