Abstract:Affective computing and cognitive theory are widely used in modern human-computer interaction scenarios. Human faces, as the most prominent and easily accessible features, have attracted great attention from researchers. Since humans have rich emotions and developed musculature, there exist a lot of fine-grained expressions in real-world applications. However, it is extremely time-consuming to collect and annotate a large number of facial images, of which may even require psychologists to correctly categorize them. To the best of our knowledge, the existing expression datasets are only limited to several basic facial expressions, which are not sufficient to support our ambitions in developing successful human-computer interaction systems. To this end, a novel Fine-grained Facial Expression Database - F2ED is contributed in this paper, and it includes more than 200k images with 54 facial expressions from 119 persons. Considering the phenomenon of uneven data distribution and lack of samples is common in real-world scenarios, we further evaluate several tasks of few-shot expression learning by virtue of our F2ED, which are to recognize the facial expressions given only few training instances. These tasks mimic human performance to learn robust and general representation from few examples. To address such few-shot tasks, we propose a unified task-driven framework - Compositional Generative Adversarial Network (Comp-GAN) learning to synthesize facial images and thus augmenting the instances of few-shot expression classes. Extensive experiments are conducted on F2ED and existing facial expression datasets, i.e., JAFFE and FER2013, to validate the efficacy of our F2ED in pre-training facial expression recognition network and the effectiveness of our proposed approach Comp-GAN to improve the performance of few-shot recognition tasks.

EGGAN: Learning Latent Space for Fine-Grained Expression Manipulation.

Expression Conditional Gan for Facial Expression-to-Expression Translation.

FINE-GRAINED EXPRESSION MANIPULATION VIA STRUCTURED LATENT SPACE

Expression-Guided Attention GAN for Fine-Grained Facial Expression Editing

ExprGAN: Facial Expression Editing With Controllable Expression Intensity

GI-AEE - GAN Inversion Based Attentive Expression Embedding Network for Facial Expression Editing.

Toward Fine-grained Facial Expression Manipulation

Semantic prior guided fine-grained facial expression manipulation

Attention Based Facial Expression Manipulation

Two Birds with One Stone: Iteratively Learn Facial Attributes with GANs.

WEM-GAN: Wavelet transform based facial expression manipulation

Two Birds with One Stone: Transforming and Generating Facial Images with Iterative GAN

Comp-GAN

Facial Landmarks and Expression Label Guided Photorealistic Facial Expression Synthesis

EvoGAN: An evolutionary computation assisted GAN

Cascade EF-GAN: Progressive Facial Expression Editing with Local Focuses

Talking Face Generation with Expression-Tailored Generative Adversarial Network

Deep Realistic Facial Editing via Label-restricted Mask Disentanglement

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition

Local and Global Perception Generative Adversarial Network for Facial Expression Synthesis

Towards Localized Fine-Grained Control for Facial Expression Generation