Abstract:Modifying facial attributes without the paired dataset proves to be a challenging task. Previously, approaches either required supervision from a ground-truth transformed image or required training a separate model for mapping every pair of attributes. These limit the scalability of the models to accommodate a larger set of attributes since the number of models that we need to train grows exponentially large. Another major drawback of the previous approaches is the unintentional gain of the identity of the person as they transform the facial attributes. We propose a method that allows for controllable and identity-aware transformations across multiple facial attributes using only a single model. Our approach is to train a generative adversarial network (GAN) with a multitask conditional discriminator that recognizes the identity of the face, distinguishes real images from fake, as well as identifies facial attributes present in an image. This guides the generator into producing an output that is realistic while preserving the person's identity and facial attributes. Through this framework, our model also learns meaningful image representations in a lower dimensional latent space and semantically associate separate parts of the encoded vector with both the person's identity and facial attributes. This opens up the possibility of generating new faces and other transformations such as making the face thinner or chubbier. Furthermore, our model only encodes the image once and allows for multiple transformations using the encoded vector. This allows for faster transformations since it does not need to reprocess the entire image for every transformation. We show the effectiveness of our proposed method through both qualitative and quantitative evaluations, such as ablative studies, visual inspection, and face verification. Competitive results are achieved compared to the main competition (CycleGAN), however, at great space and extensibility gain by using a single model.

Multimodal Face Synthesis From Visual Attributes

Facial Synthesis from Visual Attributes via Sketch using Multi-Scale Generators

Attribute-Guided Sketch Generation

Multimodal-driven Talking Face Generation, Face Swapping, Diffusion Model

Face Synthesis from Visual Attributes via Sketch using Conditional VAEs and GANs

Recognizing Facial Sketches by Generating Photorealistic Faces Guided by Descriptive Attributes

Towards Open-Set Identity Preserving Face Synthesis

Recognizing Minimal Facial Sketch by Generating Photorealistic Faces with the Guidance of Descriptive Attributes

Synthesis of High-Quality Visible Faces from Polarimetric Thermal Faces using Generative Adversarial Networks

CMOS-GAN: Semi-Supervised Generative Adversarial Model for Cross-Modality Face Image Synthesis

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning

Generative Adversarial Network-based Synthesis of Visible Faces from Polarimetric Thermal Faces

Quality Guided Sketch-to-Photo Image Synthesis

Two Birds with One Stone: Iteratively Learn Facial Attributes with GANs.

Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis.

Identity-aware Facial Expression Recognition via Deep Metric Learning based on Synthesized Images

Generating Face Images with Attributes for Free.

High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks

Controllable and Identity-Aware Facial Attribute Transformation

Generation of Non-Deterministic Synthetic Face Datasets Guided by Identity Priors

Generating bimodal privacy-preserving data for face recognition