Abstract:Creating a large and natural facial expression database is a prerequisite for facial expression analysis and classification. It is, however, not only time consuming but also difficult to capture an adequately large number of spontaneous facial expression images and their meanings because no standard, uniform, and exact measurements are available for database collection and annotation. Thus, comprehensive first-hand data analyses of a spontaneous expression database may provide insight for future research on database construction, expression recognition, and emotion inference. This paper presents our analyses of a multimodal spontaneous facial expression database of natural visible and infrared facial expressions (NVIE). First, the effectiveness of emotion-eliciting videos in the database collection is analyzed with the mean and variance of the subjects' self-reported data. Second, an interrater reliability analysis of raters' subjective evaluations for apex expression images and sequences is conducted using Kappa and Kendall's coefficients. Third, we propose a matching rate matrix to explore the agreements between displayed spontaneous expressions and felt affective states. Lastly, the thermal differences between the posed and spontaneous facial expressions are analyzed using a paired-samples t-test. The results of these analyses demonstrate the effectiveness of our emotion-inducing experimental design, the gender difference in emotional responses, and the coexistence of multiple emotions/expressions. Facial image sequences are more informative than apex images for both expression and emotion recognition. Labeling an expression image or sequence with multiple categories together with their intensities could be a better approach than labeling the expression image or sequence with one dominant category. The results also demonstrate both the importance of facial expressions as a means of communication to convey affective states and the diversity of the displayed ma- ifestations of felt emotions. There are indeed some significant differences between the temperature difference data of most posed and spontaneous facial expressions, many of which are found in the forehead and cheek regions.

Building a Chinese Natural Emotional Audio-Visual Database

CHEAVD: a Chinese natural emotional audio–visual database

Construction and Evaluation of Mandarin Multimodal Emotional Speech Database

Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition

The Mandarin Chinese auditory emotions stimulus database: A validated set of Chinese pseudo-sentences

HEU Emotion: A Large-scale Database for Multi-modal Emotion Recognition in the Wild

CNAMD Corpus: A Chinese Natural Audiovisual Multimodal Database of Conversations for Social Interactive Agents

MASC: A Speech Corpus in Mandarin for Emotion Analysis and Affective Speaker Recognition

CHAD: a chinese affective database

EmoSpeech: A Corpus of Emotionally Rich and Contextually Detailed Speech Annotations

MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation

An EEG-Based Multi-Modal Emotion Database with Both Posed and Authentic Facial Actions for Emotion Analysis

A standardized database of Chinese emotional short videos based on age and gender differences

MPED: A Multi-Modal Physiological Emotion Database for Discrete Emotion Recognition

Design, construction and evaluation of emotional multimodal pathological speech database

MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild

Werewolf-XL: A Database for Identifying Spontaneous Affect in Large Competitive Group Interactions

Analyses of a Multimodal Spontaneous Facial Expression Database

M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database

Exploiting EEG signals and audiovisual feature fusion for video emotion recognition

MES-P: an Emotional Tonal Speech Dataset in Mandarin Chinese with Distal and Proximal Labels