Recognition of Emotions Using Multimodal Physiological Signals and an Ensemble Deep Learning Model

Zhong Yin,Mengyuan Zhao,Yongxiong Wang,Jingdong Yang,Jianhua Zhang
DOI: https://doi.org/10.1016/j.cmpb.2016.12.005
IF: 6.1
2017-01-01
Computer Methods and Programs in Biomedicine
Abstract:BACKGROUND AND OBJECTIVE:Using deep-learning methodologies to analyze multimodal physiological signals becomes increasingly attractive for recognizing human emotions. However, the conventional deep emotion classifiers may suffer from the drawback of the lack of the expertise for determining model structure and the oversimplification of combining multimodal feature abstractions.METHODS:In this study, a multiple-fusion-layer based ensemble classifier of stacked autoencoder (MESAE) is proposed for recognizing emotions, in which the deep structure is identified based on a physiological-data-driven approach. Each SAE consists of three hidden layers to filter the unwanted noise in the physiological features and derives the stable feature representations. An additional deep model is used to achieve the SAE ensembles. The physiological features are split into several subsets according to different feature extraction approaches with each subset separately encoded by a SAE. The derived SAE abstractions are combined according to the physiological modality to create six sets of encodings, which are then fed to a three-layer, adjacent-graph-based network for feature fusion. The fused features are used to recognize binary arousal or valence states.RESULTS:DEAP multimodal database was employed to validate the performance of the MESAE. By comparing with the best existing emotion classifier, the mean of classification rate and F-score improves by 5.26%.CONCLUSIONS:The superiority of the MESAE against the state-of-the-art shallow and deep emotion classifiers has been demonstrated under different sizes of the available physiological instances.
What problem does this paper attempt to address?