3D-FERNet: A Facial Expression Recognition Network utilizing 3D information

Jiaxiang Shang,Yajing Chen
DOI: https://doi.org/10.1109/ICPR56361.2022.9956497
2022-08-21
Abstract:In this paper, we propose a 3D information-based facial expression recognition network (3D-FERNet), which effectively combines identity, 3D and 2D expression information for facial expression recognition (FER). The 3DFERNet model consists of three feature encoders to extract identity, 2d expression, and 3d expression features, and one semantic-based feature fusion module to combine these features for further classification. Firstly, the identity encoder is constructed by a face identity recognition network, which is used to perform identity-conditional FER and remove identity bias of FER prediction results. Secondly, we design the 3D expression encoder by a 3D face reconstruction network, which is trained under a semi-supervised method with a variety of unlabeled expression data. The generated 3D features can capture the subtle differences between similar expression cases. Thirdly, for the 2D expression encoder, we can leverage most existing FER networks as the backbone. Finally, after obtaining these facial features, we propose a semantic-based feature fusion module, which is based on the attention mechanism for feature combination. Experiments show that the proposed 3D-FERNet achieves state-of-the-art classification accuracy in multiple benchmark data sets. Moreover, the identity feature encoder and 3D feature encoder can serve as two general modules to plug in most existing FER models.
Computer Science
What problem does this paper attempt to address?