Multimodal Information-Based Broad and Deep Learning Model for Emotion Understanding

Min Li,Luefeng Chen,Min Wu,Witold Pedrycz,Kaoru Hirota
DOI: https://doi.org/10.23919/ccc52363.2021.9549897
2021-01-01
Abstract:Multimodal information-based broad and deep learning model (MIBDL) for emotion understanding is proposed, in which facial expression and body gesture are used to achieve emotional states recognition for emotion understanding. It aims to understand coexistence multimodal information in human-robot interaction by using different processing methods of deep network and broad network, which obtains the features of depth and width dimensions. Moreover, random mapping in the initial broad learning network could cause information loss and its shallow layer network is difficult to cope with complex tasks. To address this problem, we use principal component analysis to generate the nodes of the broad learning, and the stacked broad learning network is adapted to make it easier for the existing broad learning networks to cope with complex tasks by creating deep variations of the existing network. To verify the effectiveness of the proposal, experiments completed on benchmark database of spontaneous emotion expressions are developed, and experimental results show that the proposal outperforms the state-of-the-art methods. According to the simulation experiments on the FABO database, by using the proposed method, the multimodal recognition rate is 17,54%, 1.24%, and 0.23% higher than those of the temporal normalized motion and appearance features(TN), the multi-channel CNN (MCCNN), and the hierarchical classification fusion strategy (HCFS), respectively.
What problem does this paper attempt to address?