Semantic fusion of facial expressions and textual opinions from different datasets for learning-centered emotion recognition

Héctor Manuel Cárdenas-López,Ramón Zatarain-Cabada,María Lucía Barrón-Estrada,Hugo Mitre-Hernández
DOI: https://doi.org/10.1007/s00500-023-08076-1
IF: 3.732
2023-04-21
Soft Computing
Abstract:Learning-centered emotions have a significant role in the cognitive process in learning. For this reason, it is relevant that virtual learning environments consider the cognitive and affective aspects of the student. Methods of artificial intelligence such as the recognition of facial expressions, and sentimental analysis have proven to be an excellent alternative in the automatic recognition of emotions. However, learning-centered emotions and opinion-based sentiment dataset commonly contain single modalities. At the same time, single modalities cannot effectively represent complex emotions in real life. This work presents three different fusion methods applied to three image-based and text-based dataset for learning-centered emotion recognition. Using some conventional deep learning architectures, the three new multimodal datasets showed promising results when compared with similar architectures trained in unimodal information. The improvement of one of the methods (embedding-based representation) was 4% compared to single-modality hyperparameter optimization. The main objective of this study is to benchmark the viability of semantic fusion of multimodal learning-centered emotional data from the different datasets for intelligent tutoring system applications.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?