Adaptive Metric Learning with Deep Neural Networks for Video-Based Facial Expression Recognition

Xiaofeng Liu,Yubin Ge,Chao Yang,Ping Jia
DOI: https://doi.org/10.1117/1.jei.27.1.013022
IF: 0.829
2018-01-01
Journal of Electronic Imaging
Abstract:Video-based facial expression recognition has become increasingly important for plenty of applications in the real world. Despite that numerous efforts have been made for the single sequence, how to balance the complex distribution of intra- and interclass variations well between sequences has remained a great difficulty in this area. We propose the adaptive (N + M)-tuplet clusters loss function and optimize it with the softmax loss simultaneously in the training phrase. The variations introduced by personal attributes are alleviated using the similarity measurements of multiple samples in the feature space with many fewer comparison times as conventional deep metric learning approaches, which enables the metric calculations for large data applications (e.g., videos). Both the spatial and temporal relations are well explored by a unified framework that consists of an Inception-ResNet network with long short term memory and the two fully connected layer branches structure. Our proposed method has been evaluated with three well-known databases, and the experimental results show that our method outperforms many state-of-the-art approaches. (c) 2018 SPIE and IS&T
What problem does this paper attempt to address?