Speech Emotion Recognition Based on Robust Discriminative Sparse Regression

Peng Song,Wenming Zheng,Yanwei Yu,Shifeng Ou
DOI: https://doi.org/10.1109/tcds.2020.2990928
IF: 4.546
2021-01-01
IEEE Transactions on Cognitive and Developmental Systems
Abstract:Speech emotion recognition has recently attracted much interest due to the widespread of multimedia data. It generally involves two basic problems: 1) feature extraction and 2) emotion classification. Most previous algorithms just focus on solving one of these two problems. In this article, we aim to deal with these two problems in a joint learning framework, and present a novel regression algorithm, namely, robust discriminative sparse regression (RDSR). In RDSR, we propose a sparse regression algorithm to make our model robust to outliers and noises, and introduce a feature selection regularization constraint simultaneously to select the most discriminative and relevant features. In addition, to well predict the labels, we exploit the local and global consistency over labels, and incorporate it into the proposed framework. To solve the objective function of RDSR, we design an efficient alternative optimization algorithm. Finally, experimental results on several public emotion data sets verify the effectiveness and the superiority of our proposed method.
What problem does this paper attempt to address?