Speech Expression Multimodal Emotion Recognition Based on Deep Belief Network

Dong Liu,Longxi Chen,Zhiyong Wang,Guangqiang Diao
DOI: https://doi.org/10.1007/s10723-021-09564-0
2021-05-18
Journal of Grid Computing
Abstract:Aiming at the problems of insufficient information and poor recognition rate in single-mode emotion recognition, a multi-mode emotion recognition method based on deep belief network is proposed. Firstly, speech and expression signals are preprocessed and feature extracted to obtain high-level features of single-mode signals. Then, the high-level speech features and expression features are fused by using the bimodal deep belief network (BDBN), and the multimodal fusion features for classification are obtained, and the redundant information between modes is removed. Finally, the multi-modal fusion features are classified by LIBSVM to realize the final emotion recognition. Based on the Friends data set, the proposed model is demonstrated experimentally. The experimental results show that the recognition accuracy of multimodal fusion feature is the best, which is 90.89%, and the unweighted recognition accuracy of the proposed model is 86.17%, which is better than other comparison methods, and has certain research value and practicability.
computer science, information systems, theory & methods
What problem does this paper attempt to address?