Adaptive Modality Distillation for Separable Multimodal Sentiment Analysis

Wei Peng,Xiaopeng Hong,Guoying Zhao,Erik Cambria
DOI: https://doi.org/10.1109/mis.2021.3057757
IF: 6.744
2021-05-01
IEEE Intelligent Systems
Abstract:Multimodal sentiment analysis has increasingly attracted attention since with the arrival of complementary data streams, it has great potential to improve and go beyond unimodal sentiment analysis. In this article, we present an efficient separable multimodal learning method to deal with the tasks with modality missing issue. In this method, the multimodal tensor is utilized to guide the evolution of each separated modality representation. To save the computational expense, Tucker decomposition is introduced, which leads to a general extension of the low-rank tensor fusion method with more modality interactions. The method, in turn, enhances our modality distillation processing. Comprehensive experiments on three popular multimodal sentiment analysis datasets, CMU-MOSI, POM, and IEMOCAP, show a superior performance especially when only partial modalities are available.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?
This paper attempts to address a common problem in multimodal sentiment analysis, namely how to effectively perform sentiment analysis when one or more modalities are missing. The authors propose an efficient separated multimodal learning method based on tensor fusion networks, which can utilize complementary information to guide the evolution of each separated modality representation. To save computational cost, Tucker decomposition is introduced, thereby extending the low-rank tensor fusion method and enhancing the modality distillation process. Experimental results show that on three popular datasets, CMU-MOSI, POM, and IEMOCAP, this method demonstrates superior performance even when only partial modalities are available. Additionally, the authors validate the effectiveness of the proposed adaptive temperature mechanism through ablation studies. Overall, this method exhibits high robustness and prediction accuracy in the presence of missing modalities.