Cross-Modal Diversity-Based Active Learning for Multi-Modal Emotion Estimation.

Yifan Xu,Lubin Meng,Ruimin Peng,Yingjie Yin,Jingting Ding,Liang Li,Dongrui Wu
DOI: https://doi.org/10.1109/ijcnn54540.2023.10191581
2023-01-01
Abstract:Emotion recognition is an important part of affective computing. Utilizing information from multiple modalities would facilitate more accurate emotion recognition. The performance of data-driven machine learning models usually relies on a large amount of labeled training data. However, labeling emotional data is expensive, because each sample usually requires multiple evaluators to annotate. To alleviate the annotation cost, this paper proposes a cross-modal diversity measure that considers the correlation between different modalities and integrates it with the representativeness for sample selection in unsupervised active learning (AL) for regression. To our knowledge, this challenging multi-modal unsupervised AL scenario has not been explored before: previous research only considered either unsupervised uni-modal AL or supervised multi-modal AL. Experiments on RECOLA and IEMOCAP datasets demonstrated the effectiveness of our proposed AL approach.
What problem does this paper attempt to address?