Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition

Ren Zhang,Ning He,Shengjie Liu,Ying Wu,Kang Yan,Yuzhe He,Ke Lu
DOI: https://doi.org/10.1007/s13735-022-00250-9
2022-10-10
International Journal of Multimedia Information Retrieval
Abstract:Micro-expressions can convey feelings that people are trying to hide. At present, some studies on micro-expression, most of which only use the temporal or spatial information in the image to recognize micro-expressions, neglect the intrinsic features of the image. To solve this problem, we detect the subject's heart rate in the long micro-expression videos; we extract the image's spatio-temporal feature through a spatio-temporal network and then extract the heart rate feature using a heart rate network. A multimodal learning method that combines the heart rate and spatio-temporal features is used to recognize micro-expressions. The experimental results on CASMEII , SAMM , and SMIC show that the proposed methods' unweighted F1-score and unweighted average recall are 0.8867 and 0.8962, respectively. The spatio-temporal fusion network combined with heart rate information provides an essential reference for multimodal approaches to micro-expression recognition.
computer science, artificial intelligence, software engineering
What problem does this paper attempt to address?