A Comparative Analysis of Different Approach for Basic Emotions Recognition from Speech

Shammy Shikder Shanta,Sham-E-Ansari,Atiqul Islam Chowdhury,Mohammad Munem Shahriar,Khairul Hasan,Md. Sham-E-Ansari,Md. Khairul Hasan
DOI: https://doi.org/10.1109/icecit54077.2021.9641208
2021-09-14
Abstract:Human-Computer Interaction is one of the most progressive technology of recent times, and speech emotion recognition is one of the factors working behind this technology. The communication between two humans via verbal expressions is a tough process when this process is about to be implemented on robots or any system. If the robot or system cannot recognize the emotion that it receives from the speech, it will be a disastrous situation. This article shows some Machine Learning and Deep Learning approaches for recognizing emotions from speech. In this paper, many algorithms were compared to bring out a better model to perform the task more precisely. Algorithms like Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), Deep Neural Network (DNN), and Convolutional Neural Network (CNN) were used here. Among these algorithms, the CNN model has performed well by giving an accuracy of more than 85%. Moreover, this customized CNN model has also outperformed all the models that were applied to the Emo-DB dataset.
What problem does this paper attempt to address?