MFAE: Multimodal Fusion and Alignment for Entity-level Disinformation Detection

Zhenxiang Pan,Yingchi Mao,Li Xiong,Tianfu Pang,Ping Ping
DOI: https://doi.org/10.1016/j.patrec.2024.06.008
IF: 4.757
2024-06-14
Pattern Recognition Letters
Abstract:Nowadays, the dissemination of disinformation on social media has evolved from a purely textual form to multiple modalities consisting of both text and images. This further amplifies the misleading and deceptive nature of disinformation. Overcoming the misleading and confusing noise to achieve accurate disinformation detection presents a significant challenge. To address this challenge, we propose a method named Multimodal Fusion and Alignment for Entity-level Disinformation Detection (MFAE). MFAE first uses an improved dynamic routing algorithm to extract more comprehensive semantic visual entity features. Then, a graph matching network is used to capture the correspondences between entities within modalities. The experiment shows that MFAE is capable of capturing textual and visual semantic information more comprehensively. On the TWITTER and WEIBO datasets, MFAE achieves accuracy improvements of approximately 2.0% and 7.5%, respectively, compared to the state-of-the-art methods, resulting in accuracy of 89.5% and 96.7%.
computer science, artificial intelligence
What problem does this paper attempt to address?