Abstract:Generative Adversarial Network (GAN) has been widely used for image-to-image translation-based facial attribute editing. Existing GAN networks are likely to generate samples with anomalies, which may be caused by the lack of consistency preservation and feature entanglement. For preserving image consistency, many studies resorted to the design of the network framework and loss functions, e.g. cycle-consistency loss. However, the generator with the cycle-consistency loss could not well preserve the attribute-irrelevant features, and its feature-level noises may possibly cause synthesis abnormalities. For feature disentanglement, previous works were devoted to mining the implicit semantics of feature spaces, while these semantics are not stable and intuitive enough. For consistency preservation, we propose a target consistency loss to complement the cycle-consistency loss, and enable the network to learn to preserve features of the image more directly. Meanwhile, we filter out outlier feature maps to reduce the synthesis abnormalities and propose a dynamic dropout to better preserve the attribute-irrelevant features. For feature disentanglement, we encode the image semantics more stably and intuitively and propose an entropy regularization to decouple these semantics to allow independent editing of different attributes. The proposed modules are general and can be easily integrated with available image-to-image-based GAN models like StarGAN, AttGAN, and STGAN. Extensive experiments on CelebA dataset show that the our strategy can largely reduce the artifacts and better preserve the subtle facial features, and thus significantly improve the facial editing performance of these mainstream GAN models, in terms of FID, PSNR and SSIM. Additional experiments on realistic expression editing show that our method outperforms StarGAN on RaFD, and achieves much better generalization performances than the three baselines on datasets of FFHQ, RaFD and LFW.

TunaGAN: Interpretable GAN for Smart Editing

Creative and Diverse Artwork Generation Using Adversarial Networks

Style Transformer for Image Inversion and Editing

Towards Spatially Disentangled Manipulation of Face Images With Pre-Trained StyleGANs

Soft Generative Adversarial Network: Combating Mode Collapse in Generative Adversarial Network Training Via Dynamic Borderline Softening Mechanism

Self-Conditioned Generative Adversarial Networks for Image Editing

Two Birds with One Stone: Iteratively Learn Facial Attributes with GANs.

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

Self-Conditioned GANs for Image Editing

Two Birds with One Stone: Transforming and Generating Facial Images with Iterative GAN

Style Intervention: How to Achieve Spatial Disentanglement with Style-based Generators?

Interpreting the Latent Space of GANs for Semantic Face Editing

Rewriting Geometric Rules of a GAN

Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space Navigation

Editable Generative Adversarial Networks: Generating and Editing Faces Simultaneously

FEAT: Face Editing with Attention

Semantic Unfolding of StyleGAN Latent Space

Lightweight Facial Attribute Editing with Separable Latent Vector

Video2StyleGAN: Disentangling Local and Global Variations in a Video

EditGAN: High-Precision Semantic Image Editing

Consistency Preservation and Feature Entropy Regularization for GAN based Face Editing