Learning Corpus-Invariant Discriminant Feature Representations for Speech Emotion Recognition.

Peng Song,Shifeng Ou,Zhenbin Du,Yanyan Guo,Wenming Ma,Jinglei Liu,Wenming Zheng
DOI: https://doi.org/10.1587/transinf.2016edl8222
2017-01-01
IEICE Transactions on Information and Systems
Abstract:As a hot topic of speech signal processing, speech emotion recognition methods have been developed rapidly in recent years. Some satisfactory results have been achieved. However, it should be noted that most of these methods are trained and evaluated on the same corpus. In reality, the training data and testing data are often collected from different corpora, and the feature distributions of different datasets often follow different distributions. These discrepancies will greatly affect the recognition performance. To tackle this problem, a novel corpus-invariant discriminant feature representation algorithm, called transfer discriminant analysis (TDA), is presented for speech emotion recognition. The basic idea of TDA is to integrate the kernel LDA algorithm and the similarity measurement of distributions into one objective function. Experimental results under the cross-corpus conditions show that our proposed method can significantly improve the recognition rates.
What problem does this paper attempt to address?