Emotion recognition using support vector machine and deep neural network

R Chen,Y Zhou,Y Qian
DOI: https://doi.org/10.1007/978-981-10-8111-8_12
2017-01-01
Abstract:Emotion recognition from voice has recently attracted considerable interest in the fields of human-machine communication. In this paper, we propose an emotion recognition system which is a combination of three subsystems. The first and second subsystems utilize support vector machines (SVM) and deep neural networks (DNN) respectively to classify the features directly. In the third subsystem, we utilize DNN to extract segment-level features from raw data and show that they are effective for speech emotion recognition. The extracted segment-level features are emotion state probability distribution. Then we construct utterance-level features from segment-level probability distributions. Finally, utterance-level features are fed into a SVM to identify the emotions for each utterance. The experimental results show that all the subsystems outperform the hidden markov model (HMM) baseline, and the combined system get the best performance on F-score.
What problem does this paper attempt to address?