Active Learning For Dimensional Speech Emotion Recognition

Wenjing Han,Haifeng Li,Huabin Ruan,Lin Ma,Jiayin Sun,Bjoern Schuller
DOI: https://doi.org/10.21437/interspeech.2013-247
2013-01-01
Abstract:State-of-the-art dimensional speech emotion recognition systems are trained using continuously labelled instances. The data labelling process is labour intensive and time-consuming. In this paper, we propose to apply active learning to reduce according efforts: The unlabelled instances are evaluated automatically, and only the most informative ones are intelligently picked by an informativeness measure function for a human to label. Specifically, we estimate the informativeness of each unlabelled instance based on a binary-classification confidence score for an emotion being predicted to be negative or positive on a given emotional dimension. For verification, we consider a pool-based and a stream-based scenario run on part of the continuous AVEC 2012 task to demonstrate the feasibility of the proposed approach in practice. In the result, our approach requires significantly less human labelled data instances to reach a given performance than passive learning does in both scenarios. Index Terms: Active Learning, Speech Emotion Recognition, Affective Computing, Continuous Emotion Representation
What problem does this paper attempt to address?