Speech Interactive Emotion Recognition System Based on Random Forest

Susu Yan,Liang Ye,Shuai Han,Tian Han,Yue Li,Esko Alasaarela
DOI: https://doi.org/10.1109/iwcmc48107.2020.9148117
2020-01-01
Abstract:In daily life, speech is the main medium of human communication, and interpersonal communication is emotional. People hope that the computer can give a response based on the emotions contained in the voice. In this paper, we build a Wechat program of speech emotion recognition system, which is based on a random forest classifier. Firstly, the system preprocesses the collected speech signals in order to reduce noise. Secondly, 16 acoustic features are extracted from the pre-processed speech signals. The system obtains the emotional features of speech by applying 12 statistical functions to the original acoustic features. The emotional classification of Berlin Speech Emotion Database uses two classifiers: the Random Forest Classifier and the Support Vector Machine. The recognition accuracy of the SVM classifier is 83%. The accuracy of the random forest classifier is 89%. Finally, the random forest classifier is used to build the speech emotion recognition system.
What problem does this paper attempt to address?