Abstract:Facial attribute editing aims to manipulate single or multiple attributes on a given face image, i.e., to generate a new face image with desired attributes while preserving other details. Recently, the generative adversarial net (GAN) and encoder–decoder architecture are usually incorporated to handle this task with promising results. Based on the encoder–decoder architecture, facial attribute editing is achieved by decoding the latent representation of a given face conditioned on the desired attributes. Some existing methods attempt to establish an attribute-independent latent representation for further attribute editing. However, such attribute-independent constraint on the latent representation is excessive because it restricts the capacity of the latent representation and may result in information loss, leading to over-smooth or distorted generation. Instead of imposing constraints on the latent representation, in this work, we propose to apply an <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">attribute classification constraint</italic> to the generated image to just guarantee the correct change of desired attributes, i.e., to change what you want. Meanwhile, the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">reconstruction learning</italic> is introduced to preserve attribute-excluding details, in other words, to only change what you want. Besides, the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">adversarial learning</italic> is employed for visually realistic editing. These three components cooperate with each other forming an effective framework for high quality facial attribute editing, referred as <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">AttGAN</italic> . Furthermore, the proposed method is extended for <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">attribute style manipulation</italic> in an unsupervised manner. Experiments on two wild datasets, CelebA and LFW, show that the proposed method outperforms the state-of-the-art on realistic attribute editing with other facial details well preserved.

Image Attribute Modification Based on Text Guidance.

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge.

From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation

Image manipulation with natural language using Two-sided Attentive Conditional Generative Adversarial Network

Face Attribute Editing Based on Generative Adversarial Networks

Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation.

Prominent Attribute Modification using Attribute Dependent Generative Adversarial Network

Controllable Text-to-Image Generation with Enhanced Text Encoder and Edge-Preserving Embedding

Image Manipulation with Natural Language using Two-sidedAttentive Conditional Generative Adversarial Network

AttGAN: Facial Attribute Editing by Only Changing What You Want

DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis

Generative Adversarial Network Including Referring Image Segmentation For Text-Guided Image Manipulation

Clothing image attribute editing based on generative adversarial network, with reference to an upper garment

Text-Guided Co-Modulated Generative Adversarial Network for Image Inpainting

Spatial Attention Guided Local Facial Attribute Editing

Text-driven Face Image Generation and Manipulation via Multi-level Residual Mapper

Core-attributes enhanced generative adversarial networks for robust image enhancement

PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing

LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation

A Prior-Guided Generative Adversarial Net for Semantically Strict Ultrasound Images Augmentation.

Object-driven Text-to-Image Synthesis via Adversarial Training