Abstract:With the development of deep learning technology, the performance of facial expression recognition (FER) has been significantly improved. The current main challenge comes from the confusion of facial expressions caused by the highly nonlinear changes of facial expressions. However, the existing FER methods based on Convolutional Neural Networks (CNN) often ignore the underlying relationship between expressions which is crucial to meliorate the performance of recognition for confusable expressions. And the methods based on Graph Convolutional Networks (GCN) can capture the relationship between vertices, but the aggregation degree of subgraphs generated by these methods is low. They are easy to include unconfident neighbors, which increases the learning difficulty of the network. To solve the above problems, this paper proposes a method to recognize facial expressions on the high aggregation subgraphs (HASs) by combing the advantages of CNN extracting features and GCN modeling complex graph patterns. Specifically, we formulate FER as a vertex prediction problem. Considering the importance of high-order neighbors and higher efficiency, we utilize vertex confidence to find high-order neighbors. Then we construct the HASs based on the top embedding features of these high-order neighbors. And we utilize the GCN to perform reasoning and infer the class of vertices for HASs without a large number of overlapping subgraphs. Our method captures the underlying relationship between expressions on the HASs and improves the accuracy and efficiency of FER. Experimental results on both the in-the-lab datasets and the in-the-wild datasets show that our method achieves higher recognition accuracy than several state-of-the-art methods. This highlights the benefit of the underlying relationship between expressions for FER.

G-FAN: Graph-Based Feature Aggregation Network for Video Face Recognition.

Feature Agglomeration Networks for Single Stage Face Detection

Selective Domain-Invariant Feature Alignment Network for Face Anti-Spoofing.

Learning Discriminative Aggregation Network for Video-Based Face Recognition and Person Re-identification

GCF: Graph Convolutional Networks for Facial Expression Recognition

Video-based Facial Expression Recognition Using Graph Convolutional Networks.

Facial Expression Recognition on the High Aggregation Subgraphs

Face recognition with the robust feature extracted by the generalized Foley-Sammon transform

Fianet: Video Object Detection Via Joint Feature-Level and Instance-Level Aggregation

Multi-Stream Facial Adaptive Network for Expression Recognition from a Single Image

FA-GAN: Face Augmentation GAN for Deformation-Invariant Face Recognition

3D Dense Face Alignment with Fused Features by Aggregating CNNs and GCNs

Attention-Driven Graph Neural Network for Deep Face Super-Resolution

Multi-scale spatio-temporal feature adaptive aggregation for video-based Person Re -identification

Recurrent Embedding Aggregation Network for Video Face Recognition

Adaptive graph-based feature normalization for facial expression recognition

FSGAN: Subject Agnostic Face Swapping and Reenactment

Frontal-Centers Guided Face: Boosting Face Recognition by Learning Pose-Invariant Features

Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation

Confusable facial expression recognition with geometry-aware conditional network