Transferable Positive/negative Speech Emotion Recognition Via Class-wise Adversarial Domain Adaptation

Hao Zhou,Ke Chen
DOI: https://doi.org/10.1109/icassp.2019.8683299
2019-01-01
Abstract:Speech emotion recognition plays an important role in building more intelligent and human-like agents. Due to the difficulty of collecting speech emotional data, an increasingly popular solution is leveraging a related and rich source corpus to help address the target corpus. However, domain shift between the corpora poses a serious challenge, making domain shift adaptation difficult to function even on the recognition of positive/negative emotions. In this work, we propose class-wise adversarial domain adaptation to address this challenge by reducing the shift for all classes between different corpora. Experiments on the well-known corpora EMODB and Aibo demonstrate that our method is effective even when only a very limited number of target labeled examples are provided.
What problem does this paper attempt to address?