Cross-corpus speech emotion recognition using transfer semi-supervised discriminant analysis

Peng Song,Xinran Zhang,Shifeng Ou,Jingjing Liu,Yanwei Yu,Wenming Zheng
DOI: https://doi.org/10.1109/ISCSLP.2016.7918395
2016-01-01
Abstract:Many speech emotion recognition approaches have been presented in recent years, and most of them assume that emotional speech utterances in training and testing corpora are collected under the same conditions. However, in many real applications, this assumption does not hold as the training data and testing data are often obtained from different scenarios, e.g., ages, noises, languages. To address this problem, in this paper, a novel transfer learning approach, called transfer semi-supervised linear discriminant analysis (TSDA), is presented for cross-corpus speech emotion recognition. On one hand, the distribution similarity between source and target databases is considered. On the other hand, the semi-supervised linear discriminant analysis (SDA) algorithm is adopted for feature dimension reduction. Finally, the transfer SDA method, which jointly optimizes the SDA and distribution similarity measurement together, is proposed. Experiments are carried out on public emotional datasets, and results demonstrate the effectiveness of our proposed approach.
What problem does this paper attempt to address?