ERIT Lightweight Multimodal Dataset for Elderly Emotion Recognition and Multimodal Fusion Evaluation

Rita Frieske,Bertrand E. Shi
2024-07-25
Abstract:ERIT is a novel multimodal dataset designed to facilitate research in a lightweight multimodal fusion. It contains text and image data collected from videos of elderly individuals reacting to various situations, as well as seven emotion labels for each data sample. Because of the use of labeled images of elderly users reacting emotionally, it is also facilitating research on emotion recognition in an underrepresented age group in machine learning visual emotion recognition. The dataset is validated through comprehensive experiments indicating its importance in neural multimodal fusion research.
Computer Vision and Pattern Recognition,Computation and Language
What problem does this paper attempt to address?
The main goal of this paper is to create a multimodal dataset named ERIT to facilitate emotion recognition research for the elderly population and support lightweight multimodal fusion evaluation. Specifically: 1. **Addressing the need for emotion recognition in the elderly**: With the growing elderly population, it becomes particularly important to develop efficient emotion recognition systems tailored to the characteristics of the elderly. The ERIT dataset aims to provide a rich data source for training and evaluating emotion recognition models for the elderly. 2. **Lightweight multimodal fusion evaluation**: The ERIT dataset includes text and image data extracted from videos, aiming to simplify the evaluation process of multimodal fusion tasks. By using only 7 basic emotion labels (anger, disgust, fear, happiness, sadness, surprise, and neutral), this dataset enables easy research on emotion recognition across different modalities. 3. **Filling the research gap in emotion recognition for the elderly**: Existing emotion recognition datasets often overlook this specific age group. The ERIT dataset attempts to fill this gap by publicly releasing a facial expression emotion recognition dataset. Unlike some previous elderly facial emotion recognition datasets, the emotional expressions in ERIT are natural responses of the elderly to presented materials, rather than acted emotions. Through these efforts, the ERIT dataset not only helps improve the accuracy of emotion recognition in elderly care and health-related applications but also provides a solid foundation for developing more precise and robust emotion recognition systems for the elderly.