QuMIN: quantum multi-modal data fusion for humor detection

Arpan Phukan,Anas Anwarul Haq Khan,Asif Ekbal
DOI: https://doi.org/10.1007/s11042-024-19790-9
IF: 2.577
2024-07-13
Multimedia Tools and Applications
Abstract:Humour detection has attracted considerable attention due to its significance in interpreting dialogues across text, visual, and acoustic modalities. However, effective methods to map correlations among different modalities remain an active area of research. In this study, we go beyond traditional machine learning techniques by introducing a Variational Quantum Circuit (VQC) that capitalizes on the inherent quantum properties of superposition, entanglement, and interference. Our proposed model, Qu antum M ulti-Modal Data Fus i o n (QuMIN), is designed to better capture and reproduce the interaction across modalities, as well as the internal correlations within each modality. Our introduction of the novel VQC, which augments the DialogueRNN baseline with only an additional 4,809 parameters, signifies a substantial advancement in multi-modal humor detection with improvements of 12.34% in precision, 8.84% in recall and 10.57% in F1 score compared to the state-of-the-art methods.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?