Abstract:Current works on facial action unit (AU) recognition typically require fully AU-labeled training samples. To reduce the reliance on time-consuming manual AU annotations, we propose a novel semi-supervised AU recognition method leveraging two kinds of readily available auxiliary information. The method leverages the dependencies between AUs and expressions as well as the dependencies among AUs, which are caused by facial anatomy and therefore embedded in all facial images, independent on their AU annotation status. The other auxiliary information is facial image synthesis given AUs, the dual task of AU recognition from facial images, and therefore has intrinsic probabilistic connections with AU recognition, regardless of AU annotations. Specifically, we propose a dual semi-supervised generative adversarial network for AU recognition from partially AU-labeled and fully expression-labeled facial images. The proposed network consists of an AU classifier C, an image generator G , and a discriminator D. In addition to minimize the supervised losses of the AU classifier and the face generator for labeled training data, we explore the probabilistic duality between the tasks using adversary learning to force the convergence of the face-AU-expression tuples generated from the AU classifier and the face generator, and the ground-truth distribution in labeled data for all training data. This joint distribution also includes the inherent AU dependencies. Furthermore, we reconstruct the facial image using the output of the AU classifier as the input of the face generator, and create AU labels by feeding the output of the face generator to the AU classifier. We minimize reconstruction losses for all training data, thus exploiting the informative feedback provided by the dual tasks. Within-database and cross-database experiments on three benchmark databases demonstrate the superiority of our method in both AU recognition and face synthesis compared to state-of-the-art works.

Dual Learning for Joint Facial Landmark Detection and Action Unit Recognition

Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

Dual Learning for Facial Action Unit Detection under Nonfull Annotation.

Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment

Weakly Supervised Dual Learning for Facial Action Unit Recognition

Dual Semi-Supervised Learning for Facial Action Unit Recognition.

Feature and Label Relation Modeling for Multiple-Facial Action Unit Classification and Intensity Estimation

Joint Patch And Multi-Label Learning For Facial Action Unit Detection

Facial Action Unit Detection Using Attention and Relation Learning

Facial Action Units Detection Aided by Global-Local Expression Embedding

Multiple Facial Action Unit Recognition by Learning Joint Features and Label Relations.

Deep Facial Action Unit Recognition from Partially Labeled Data.

Multiple-Facial Action Unit Recognition by Shared Feature Learning and Semantic Relation Modeling

Exploring Adversarial Learning for Deep Semi-Supervised Facial Action Unit Recognition

Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition

Knowledge-Driven Self-Supervised Representation Learning for Facial Action Unit Recognition

A Novel Dual-channel Graph Convolutional Neural Network for Facial Action Unit Recognition

Learning Deep Representation for Face Alignment with Auxiliary Attributes

Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

Meta Auxiliary Learning for Facial Action Unit Detection

Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition