Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition.

Xiuzhen Chen,Xiaoyan Zhou,Cheng Lu,Yuan Zong,Wenming Zheng,Chuangao Tang
DOI: https://doi.org/10.1587/transinf.2019edl8038
2019-01-01
IEICE Transactions on Information and Systems
Abstract:For cross-corpus speech emotion recognition (SER), how to obtain effective feature representation for the discrepancy elimination of feature distributions between source and target domains is a crucial issue. In this paper, we propose a Target-adapted Subspace Learning (TaSL) method for cross-corpus SER. The TaSL method trys to find a projection subspace, where the feature regress the label more accurately and the gap of feature distributions in target and source domains is bridged effectively. Then, in order to obtain more optimal projection matrix, l(1) norm and l(2,1) norm penalty terms are added to different regularization terms, respectively. Finally, we conduct extensive experiments on three public corpuses, EmoDB, eNTERFACE and AFEW 4.0. The experimental results show that our proposed method can achieve better performance compared with the state-of-the-art methods in the cross-corpus SER tasks.
What problem does this paper attempt to address?