MR2: A Benchmark for Multimodal Retrieval-Augmented Rumor Detection in Social Media.

Xuming Hu,Zhijiang Guo,Junzhe Chen,Lijie Wen,Philip S. Yu
DOI: https://doi.org/10.1145/3539618.3591896
2023-01-01
Abstract:As social media platforms are evolving from text-based forums into multi-modal environments, the nature of misinformation in social media is also transforming accordingly. Misinformation spreaders have recently targeted contextual connections between the modalities e.g., text and image. However, existing datasets for rumor detection mainly focus on a single modality i.e., text. To bridge this gap, we construct MR2, a multimodal multilingual retrieval-augmented dataset for rumor detection. The dataset covers rumors with images and texts, and provides evidence from both modalities that are retrieved from the Internet. Further, we develop established baselines and conduct a detailed analysis of the systems evaluated on the dataset. Extensive experiments show that MR2 will provide a challenging testbed for developing rumor detection systems designed to retrieve and reason over social media posts. Source code and data are available at: https://github.com/THU-BPM/MR2.
What problem does this paper attempt to address?