Abstract:Facial expression recognition (FER) in the wild is an active and challenging field of research. A system for automatic FER finds use in a wide range of applications related to advanced human–computer interaction (HCI), human–robot interaction (HRI), human behavioral analysis, gaming and entertainment, etc. Since their inception, convolutional neural networks (CNNs) have attained state‐of‐the‐art accuracy in the facial analysis task. However, recognizing facial expressions in the wild with high confidence running on a low‐cost embedded device remains challenging. To this end, this study presents an efficient dual‐channel ensembled deep CNN (DCE‐DCNN) for FER in the wild. Initially, two DCNNs, namely the DCNNG and DCNNS , are trained separately on the grayscale and Scharr‐convolved vertical gradient facial images, respectively. The proposed network later integrates the two pre‐trained DCNNs to obtain the dual‐channel integrated DCNN (DCI‐DCNN). Finally, all three neural networks, namely the DCNNG , DCNNS , and DCI‐DCNN, are jointly fine‐tuned to get a single dual‐channel‐multi‐output model. The multi‐output model produces three prediction scores for the given input facial image. The prediction scores are thus fused using the max‐voting ensemble scheme to obtain the DCE‐DCNN with the final classification label. On the FER2013, RAF‐DB, NCAER‐S, AffectNet, and CKPlus benchmark FER datasets, the proposed DCE‐DCNN consistently outperforms the two individual DCNNs and numerous state‐of‐the‐art CNNs. Moreover, the network achieves competitive recognition accuracy on all four FER in the wild datasets with reduced memory storage size and parameters. The proposed DCE‐DCNN model with high throughput on resource‐limited embedded devices is suitable for applications that seek real‐time classification of facial expressions in the wild with high confidence.

Training deep networks for facial expression recognition with crowd-sourced label distribution

Cgan Based Facial Expression Recognition for Human-Robot Interaction

SDNET: Lightweight Facial Expression Recognition For Sample Disequilibrium.

Automatic 4D Facial Expression Recognition via Collaborative Cross-domain Dynamic Image Network.

Facial Emotion Recognition with Noisy Multi-task Annotations

Joint Deep Learning of Facial Expression Synthesis and Recognition

Blended Emotion in-the-Wild: Multi-label Facial Expression Recognition Using Crowdsourced Annotations and Deep Locality Feature Learning

Deep Learning from Crowds

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Facial expression recognition of masked faces using deep learning

Adaptively Learning Facial Expression Representation via C-F Labels and Distillation

Robust Lightweight Facial Expression Recognition Network with Label Distribution Training

A dual‐channel ensembled deep convolutional neural network for facial expression recognition in the wild

Cross-Domain Facial Expression Recognition via Contrastive Warm up and Complexity-Aware Self-Training

Noisy label facial expression recognition via face-specific label distribution learning

Facial Expression Recognition with Contrastive Learning and Uncertainty-Guided Relabeling

GCANet: Geometry Cues-aware Facial Expression Recognition based on Graph Convolutional Networks

Cross-Domain Facial Expression Recognition by Combining Transfer Learning and Face-Cycle Generative Adversarial Network

Omni-supervised Facial Expression Recognition via Distilled Data

Curriculum Learning for Speech Emotion Recognition from Crowdsourced Labels