Abstract:Facial expression recognition (FER) in the wild is an active and challenging field of research. A system for automatic FER finds use in a wide range of applications related to advanced human–computer interaction (HCI), human–robot interaction (HRI), human behavioral analysis, gaming and entertainment, etc. Since their inception, convolutional neural networks (CNNs) have attained state‐of‐the‐art accuracy in the facial analysis task. However, recognizing facial expressions in the wild with high confidence running on a low‐cost embedded device remains challenging. To this end, this study presents an efficient dual‐channel ensembled deep CNN (DCE‐DCNN) for FER in the wild. Initially, two DCNNs, namely the DCNNG and DCNNS , are trained separately on the grayscale and Scharr‐convolved vertical gradient facial images, respectively. The proposed network later integrates the two pre‐trained DCNNs to obtain the dual‐channel integrated DCNN (DCI‐DCNN). Finally, all three neural networks, namely the DCNNG , DCNNS , and DCI‐DCNN, are jointly fine‐tuned to get a single dual‐channel‐multi‐output model. The multi‐output model produces three prediction scores for the given input facial image. The prediction scores are thus fused using the max‐voting ensemble scheme to obtain the DCE‐DCNN with the final classification label. On the FER2013, RAF‐DB, NCAER‐S, AffectNet, and CKPlus benchmark FER datasets, the proposed DCE‐DCNN consistently outperforms the two individual DCNNs and numerous state‐of‐the‐art CNNs. Moreover, the network achieves competitive recognition accuracy on all four FER in the wild datasets with reduced memory storage size and parameters. The proposed DCE‐DCNN model with high throughput on resource‐limited embedded devices is suitable for applications that seek real‐time classification of facial expressions in the wild with high confidence.

Facial Expression Recognition in Video Using 3D-CNN Deep Features Discrimination

Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks

Facial expression recognition in videos using hybrid CNN & ConvLSTM

Face Recognition Using $Sf_{3}CNN$ With Higher Feature Discrimination

Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network.

A Video Sequence Face Expression Recognition Method Based on Squeeze-and-Excitation and 3DPCA Network

Visual Scene-aware Hybrid Neural Network Architecture for Video-based Facial Expression Recognition

A dual‐channel ensembled deep convolutional neural network for facial expression recognition in the wild

Spatio-Temporal Facial Expression Recognition Using Convolutional Neural Networks and Conditional Random Fields

A novel modular deep fully convolutional network for efficient low resolution facial expression recognition

Multi-feature based automatic facial expression recognition using deep convolutional neural network

Micro-Facial Expression Recognition in Video Based on Optimal Convolutional Neural Network (MFEOCNN) Algorithm

Deep and Shallow Covariance Feature Quantization for 3D Facial Expression Recognition

Learning Face Expression Features from Video Using Spatio-Temporal Feature Extractor and CNN-LSTM

LBVCNN: Local Binary Volume Convolutional Neural Network for Facial Expression Recognition from Image Sequences

Facial expression recognition with convolutional neural networks via a new face cropping and rotation strategy

Automatic Facial Expression Recognition Based on a Deep Convolutional-Neural-network Structure

Facial Expression Recognition System Based On Deep Residual Fusion Neural Network

Feature Level Analysis for 3D Facial Expression Recognition.

A deep-learning-based facial expression recognition method using textural features

Multi-channel Deep 3D Face Recognition