Abstract:Photorealistic multiview face synthesis from a single image is a challenging problem. Existing works mainly learn a texture mapping model from the source to the target faces. However, they rarely consider the geometric constraints on the internal deformation arising from pose variations, which causes a high level of uncertainty in face pose modeling, and hence, produces inferior results for large pose variations. Moreover, current methods typically suffer from undesired facial details loss due to the adoption of the de-facto standard encoder-decoder architecture without any skip connections (SCs). In this article, we directly learn and exploit geometric constraints and propose a fully deformable network to simultaneously model the deformations of both landmarks and faces for face synthesis. Specifically, our model consists of two parts: a deformable landmark learning network (DLLN) and a gated deformable face synthesis network (GDFSN). The DLLN converts an initial reference landmark to an individual-specific target landmark as delicate pose guidance for face rotation. The GDFSN adopts a dual-stream structure, with one stream estimating the deformation of two views in the form of convolution offsets according to the source pose and the converted target pose, and the other leveraging the predicted deformation offsets to create the target face. In this way, individual-aware pose changes are explicitly modeled in the face generator to cope with geometric transformation, by adaptively focusing on pertinent regions of the source face. To compensate for offset estimation errors, we introduce a soft-gating mechanism for adaptive fusion between deformable features and primitive features. Additionally, a pose-aligned SC (PASC) is tailored to propagate low-level input features to the appropriate positions in the output features for further enhancing the facial details and identity preservation. Extensive experiments on six benchmarks show that our approach performs favorably against the state-of-the-arts, especially with large pose changes. Code is available at https://github.com/cschengxu/FDFace.

A face template: Improving the face generation quality of multi-stage generative adversarial networks using coarse-grained facial priors

Two Birds with One Stone: Transforming and Generating Facial Images with Iterative GAN

Two Birds with One Stone: Iteratively Learn Facial Attributes with GANs.

Multi-Modal Face Stylization with a Generative Prior

An improved StyleGAN-based TextToFace model with Local-Global information Fusion

Audio-driven Talking Face Video Generation with Natural Head Pose

Fine-Granularity Face Sketch Synthesis

FaceChain: A Playground for Identity-Preserving Portrait Generation

Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network

DualG-GAN, a Dual-channel Generator based Generative Adversarial Network for text-to-face synthesis

HCGAN: hierarchical contrast generative adversarial network for unpaired sketch face synthesis

Fully Deformable Network for Multiview Face Image Synthesis

Face Synthesis from Visual Attributes via Sketch using Conditional VAEs and GANs

Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis.

Multi-Style Facial Sketch Synthesis through Masked Generative Modeling

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset

Multi-Level Cycle-Consistent Adversarial Networks with Attention Mechanism for Face Sketch-Photo Synthesis

Composition-Aided Face Photo-Sketch Synthesis.

Robust Face Sketch Synthesis Via Generative Adversarial Fusion of Priors and Parametric Sigmoid

Adversarially Regularized U-Net-based GANs for Facial Attribute Modification and Generation.

CMAFGAN: A Cross-Modal Attention Fusion based Generative Adversarial Network for attribute word-to-face synthesis