Abstract:As an emerging research topic for proximity service (ProSe), automatic emotion recognition enables the machines to understand the emotional changes of human beings which can not only facilitate natural, effective, seamless, and advanced human–robot interaction or human–computer interface but also promote emotional health. Facial expression recognition (FER) is a vital task for emotion recognition. However, significant gap between human and machine exists in FER task. In this paper, we present a conditional generative adversarial network-based approach to alleviate the intra-class variations by individually controlling the facial expressions and learning the generative and discriminative representations simultaneously. The proposed framework consists of a generator G and three discriminators ( Di , Da , and Dexp ). The generator G transforms any query face image into another prototypic facial expression image with other factors preserved. Armed with action units condition, the generator G pays more attention to information relevant to facial expression. Three loss functions ( $L_{I}$ , La , and Lexp ) corresponding to the three discriminators ( Di , Da , and Dexp ) were designed to learn generative and discriminative representations. Moreover, after rendering the generated expression back to its original facial expression, cycle consistency loss is also applied to guarantee the identity and produce more constrained visual representations. Optimized by combining both synthesis and classification loss functions, the learnt representation is explicitly disentangled from other variations such as identity, head pose, and illumination. Qualitative and quantitative experimental results demonstrate the proposed FER system is effective for expression recognition.

3D-FERNet: A Facial Expression Recognition Network utilizing 3D information

DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition

Cgan Based Facial Expression Recognition for Human-Robot Interaction

SDNET: Lightweight Facial Expression Recognition For Sample Disequilibrium.

Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network.

Deep Representation of Facial Geometric and Photometric Attributes for Automatic 3D Facial Expression Recognition

Auto-FERNet: A Facial Expression Recognition Network With Architecture Search

Facial Expression Recognition Based on Zero-Addition Pretext Training and Feature Conjunction-Selection Network in Human–Robot Interaction

Facial expression recognition through multi-level features extraction and fusion

Efficient Facial Expression Recognition with Representation Reinforcement Network and Transfer Self-Training for Human–Machine Interaction

Facial Expression Recognition Using Hybrid Features of Pixel and Geometry

Landmarks-assisted Collaborative Deep Framework for Automatic 4D Facial Expression Recognition.

A Dual-Direction Attention Mixed Feature Network for Facial Expression Recognition

3-D Facial Expression Recognition via Attention-Based Multichannel Data Fusion Network

FERNet: A Deep CNN Architecture for Facial Expression Recognition in the Wild

Multi-Net Fusion: Exploring a Brain-Inspired Neural Network Model for Facial Expression Recognition

MGR3Net: Multigranularity Region Relation Representation Network for Facial Expression Recognition in Affective Robots

2D+3D Facial Expression Recognition via Discriminative Dynamic Range Enhancement and Multi-Scale Learning

Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks

Two-pathway attention network for real-time facial expression recognition

Joint spatial and scale attention network for multi-view facial expression recognition