Abstract:The inherent dependencies among facial action units (AUs) caused by the underlying anatomic mechanism are essential for the proper recognition of AUs and the estimation of intensity levels, but they have not been exploited to their full potential. We are proposing novel methods to recognize AUs and estimate intensity via hybrid Bayesian networks (BNs). The upper two layers are latent regression BNs (LRBNs), and the lower layers are BNs. The visible nodes of the LRBN layers are the representations of ground-truth AU occurrences or AU intensities. Through the directed connections from latent layer and visible layer, an LRBN can successfully represent relationships between multiple AUs or AU intensities. The lower layers include BNs with two nodes for AU recognition, and BNs with three nodes for AU intensity estimation. The bottom layers incorporate measurements from facial images with AU dependencies for intensity estimation and AU recognition. Efficient learning algorithms of the hybrid Bayesian networks are proposed for AU recognition as well as intensity estimation. Furthermore, the proposed hybrid BN models are extended for facial expression-assisted AU recognition and intensity estimation, as AU relationships are closely related to facial expressions. We test our methods on three benchmark databases for AU recognition and two benchmark databases for intensity estimation. The results demonstrate that the proposed approaches faithfully model the complex and global inherent AU dependencies, and the expression labels available only during training can boost the estimation of AU dependencies for both AU recognition and intensity estimation.

Multiple Facial Action Unit Recognition by Learning Joint Features and Label Relations.

Capturing Feature and Label Relations Simultaneously for Multiple Facial Action Unit Recognition

Multiple Facial Action Unit Recognition Enhanced by Facial Expressions.

Feature and Label Relation Modeling for Multiple-Facial Action Unit Classification and Intensity Estimation

Facial Action Unit Recognition and Intensity Estimation Enhanced Through Label Dependencies

Deep Facial Action Unit Recognition from Partially Labeled Data.

Multiple-Facial Action Unit Recognition by Shared Feature Learning and Semantic Relation Modeling

Capturing Global Semantic Relationships for Facial Action Unit Recognition

Joint Patch And Multi-Label Learning For Facial Action Unit Detection

Dual Learning for Joint Facial Landmark Detection and Action Unit Recognition

Capture expression-dependent AU relations for expression recognition

Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

Facial Action Unit Detection Using Attention and Relation Learning

Exploring Domain Knowledge for Facial Expression-Assisted Action Unit Activation Recognition.

Facial Action Unit recognition by relation modeling from both qualitative knowledge and quantitative data

Facial Action Units Detection Aided by Global-Local Expression Embedding

Facial Action Unit Classification with Hidden Knowledge under Incomplete Annotation

Expression-assisted Facial Action Unit Recognition under Incomplete AU Annotation.

Knowledge-Driven Self-Supervised Representation Learning for Facial Action Unit Recognition

Deep Facial Action Unit Recognition and Intensity Estimation from Partially Labelled Data

Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment