Abstract:Expressions serve as intuitive reflections of a person's psychological state, making the extraction of effective features for accurate facial expression recognition a crucial research problem. However, when facial information is incomplete, the existing convolutional neural networks face some challenges in extracting features. To address this issue, this paper introduces a pyramidal convolutional attention residual network(PCARNet) based on the ResNet18. PCARNet combines the pyramidal convolution module and an improved convolutional attention mechanism to effectively extract expression features and achieve high-precision facial expression recognition. The proposed model utilizes pyramidal convolution to extract facial expression features at multiple scales, capturing both global and local information of the face. Grouped convolution is employed to reduce the computational complexity and the number of parameters. Additionally, to avoid the adverse effects of channel dimensionality reduction on the attention mechanism and enhance the capacity for information exchange across channels, the Share MLP module within the convolutional attention mechanism was replaced by a one-dimensional convolution with adaptive kernel size. The improved convolutional attention mechanism assigns weights to the extracted multiscale features based on both channel and spatial dimensions, enhancing the representation of crucial facial features. Experimental results demonstrate the high recognition accuracy of the proposed method on public datasets such as Fer2013, RAF-DB, and CK+. The accuracies achieved are 73.725%, 87.516%, and 95.455%, respectively. Compared to other methods, the proposed approach shows improvements of at least 1.4%, 2.4%, and 0.25% on the respective datasets, confirming its high reliability and performance.

Facial Expression Recognition with an Attention Network Using a Single Depth Image

Real-Time Facial Landmark Detection by Attention-driven Lightweight Network

Improving RGB-D Face Recognition via Transfer Learning from a Pretrained 2D Network.

Attention mechanism-based CNN for facial expression recognition

Multi-Stream Facial Adaptive Network for Expression Recognition from a Single Image

Facial Expression Recognition Method Combined with Attention Mechanism

A facial expression recognition network based on attention double branch enhanced fusion

A Cascade Attention Based Facial Expression Recognition Network by Fusing Multi-Scale Spatio-Temporal Features

A discriminative multiscale feature extraction network for facial expression recognition in the wild

Facial expression recognition based on improved depthwise separable convolutional network

Distract Your Attention: Multi-Head Cross Attention Network for Facial Expression Recognition

Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network

Facial Expression Recognition System Based On Deep Residual Fusion Neural Network

Facial Expression Recognition Methods in the Wild Based on Fusion Feature of Attention Mechanism and LBP

An Efficient Channel Attention CNN for Facial Expression Recognition

ATTENTION BASED CONVOLUTIONAL NEURAL NETWORK FOR FACIAL EXPRESSION RECOGNITION

Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition

Multi-Pose Facial Expression Recognition Based on Generative Adversarial Network

Occlusion Aware Facial Expression Recognition Using CNN With Attention Mechanism

A Novel Attention Residual Network Expression Recognition Method

ExpNet: Landmark-Free, Deep, 3D Facial Expressions