Abstract:Facial expression data is characterized by a significant imbalance, with most collected data showing happy or neutral expressions and fewer instances of fear or disgust. This imbalance poses challenges to facial expression recognition (FER) models, hindering their ability to fully understand various human emotional states. Existing FER methods typically report overall accuracy on highly imbalanced test sets but exhibit low performance in terms of the mean accuracy across all expression classes. In this paper, our aim is to address the imbalanced FER problem. Existing methods primarily focus on learning knowledge of minor classes solely from minor-class samples. However, we propose a novel approach to extract extra knowledge related to the minor classes from both major and minor class samples. Our motivation stems from the belief that FER resembles a distribution learning task, wherein a sample may contain information about multiple classes. For instance, a sample from the major class surprise might also contain useful features of the minor class fear. Inspired by that, we propose a novel method that leverages re-balanced attention maps to regularize the model, enabling it to extract transformation invariant information about the minor classes from all training samples. Additionally, we introduce re-balanced smooth labels to regulate the cross-entropy loss, guiding the model to pay more attention to the minor classes by utilizing the extra information regarding the label distribution of the imbalanced training data. Extensive experiments on different datasets and backbones show that the two proposed modules work together to regularize the model and achieve state-of-the-art performance under the imbalanced FER task. Code is available at <a class="link-external link-https" href="https://github.com/zyh-uaiaaaa" rel="external noopener nofollow">this https URL</a>.

A New Joint Training Method for Facial Expression Recognition with Inconsistently Annotated and Imbalanced Data

SDNET: Lightweight Facial Expression Recognition For Sample Disequilibrium.

Facial Expression Recognition with Inconsistently Annotated Datasets

DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition

Joint spatial and scale attention network for multi-view facial expression recognition

Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to Boost Semi-Supervised Facial Expression Recognition

Real Emotion Seeker: Recalibrating Annotation for Facial Expression Recognition

Joint Deep Learning of Facial Expression Synthesis and Recognition

Efficient Facial Expression Recognition with Representation Reinforcement Network and Transfer Self-Training for Human–Machine Interaction

A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition

Facial Expression Recognition Based on Zero-Addition Pretext Training and Feature Conjunction-Selection Network in Human–Robot Interaction

Boosting Facial Expression Recognition by A Semi-Supervised Progressive Teacher

Multi-Head Attention Affinity Diversity Sharing Network for Facial Expression Recognition

Enhancing Facial Expression Recognition Under Data Uncertainty Based on Embedding Proximity

SAANet: Siamese Action-Units Attention Network for Improving Dynamic Facial Expression Recognition

Face2Exp: Combating Data Biases for Facial Expression Recognition

Suppressing Uncertainties for Large-Scale Facial Expression Recognition

Facial Expression Recognition with Contrastive Learning and Uncertainty-Guided Relabeling

Facial Emotion Recognition with Noisy Multi-task Annotations

Leave No Stone Unturned: Mine Extra Knowledge for Imbalanced Facial Expression Recognition

Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling