Abstract:Domain shift poses a significant challenge in Cross-Domain Facial Expression Recognition (CD-FER) due to the distribution variation between the source and target domains. Current algorithms mainly focus on learning domain-invariant features through global feature adaptation, while neglecting the transferability of local features across different domains. Additionally, these algorithms lack discriminative supervision during training on target datasets, resulting in deteriorated feature representation in the target domain. To address these limitations, we propose an Adaptive Global-Local Representation Learning and Selection (AGLRLS) framework. The framework incorporates global-local adversarial adaptation and semantic-aware pseudo label generation to enhance the learning of domain-invariant and discriminative feature representation during training. Meanwhile, a global-local prediction consistency learning is introduced to improve classification results during inference. Specifically, the framework consists of separate global-local adversarial learning modules that learn domain-invariant global and local features independently. We also design a semantic-aware pseudo label generation module, which computes semantic labels based on global and local features. Moreover, a novel dynamic threshold strategy is employed to learn the optimal thresholds by leveraging independent prediction of global and local features, ensuring filtering out the unreliable pseudo labels while retaining reliable ones. These labels are utilized for model optimization through the adversarial learning process in an end-to-end manner. During inference, a global-local prediction consistency module is developed to automatically learn an optimal result from multiple predictions. To validate the effectiveness of our framework, we conduct comprehensive experiments and analysis based on a fair evaluation benchmark. The results demonstrate that the proposed framework outperforms the current competing methods by a substantial margin.

Learning Effective Global Receptive Field for Facial Expression Recognition

DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition

Cgan Based Facial Expression Recognition for Human-Robot Interaction

SDNET: Lightweight Facial Expression Recognition For Sample Disequilibrium.

Combining 2D Gabor and Local Binary Pattern for Facial Expression Recognition Using Extreme Learning Machine

Enhanced Discriminative Global-Local Feature Learning with Priority for Facial Expression Recognition

Efficient Facial Expression Recognition with Representation Reinforcement Network and Transfer Self-Training for Human–Machine Interaction

Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network.

Learning Cognitive Features as Complementary for Facial Expression Recognition

Facial expression recognition through multi-level features extraction and fusion

Learning Informative and Discriminative Features for Facial Expression Recognition in the Wild

Enhanced Dual-Level Representations for Facial Expression Recognition

Dynamic Resolution Guidance for Facial Expression Recognition

Two-pathway attention network for real-time facial expression recognition

Fast and Efficient Facial Expression Recognition Using a Gabor Convolutional Network

Facial Expression Recognition with Contrastive Learning and Uncertainty-Guided Relabeling

Learning Associative Representation for Facial Expression Recognition.

Joint spatial and scale attention network for multi-view facial expression recognition

Adaptively Enhancing Facial Expression Crucial Regions via Local Non-Local Joint Network

Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition