Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation

Bing He,Mustaque Ahamad,Srijan Kumar
DOI: https://doi.org/10.48550/arXiv.2303.06433
2023-03-11
Abstract:The spread of online misinformation threatens public health, democracy, and the broader society. While professional fact-checkers form the first line of defense by fact-checking popular false claims, they do not engage directly in conversations with misinformation spreaders. On the other hand, non-expert ordinary users act as eyes-on-the-ground who proactively counter misinformation -- recent research has shown that 96% counter-misinformation responses are made by ordinary users. However, research also found that 2/3 times, these responses are rude and lack evidence. This work seeks to create a counter-misinformation response generation model to empower users to effectively correct misinformation. This objective is challenging due to the absence of datasets containing ground-truth of ideal counter-misinformation responses, and the lack of models that can generate responses backed by communication theories. In this work, we create two novel datasets of misinformation and counter-misinformation response pairs from in-the-wild social media and crowdsourcing from college-educated students. We annotate the collected data to distinguish poor from ideal responses that are factual, polite, and refute misinformation. We propose MisinfoCorrect, a reinforcement learning-based framework that learns to generate counter-misinformation responses for an input misinformation post. The model rewards the generator to increase the politeness, factuality, and refutation attitude while retaining text fluency and relevancy. Quantitative and qualitative evaluation shows that our model outperforms several baselines by generating high-quality counter-responses. This work illustrates the promise of generative text models for social good -- here, to help create a safe and reliable information ecosystem. The code and data is accessible on <a class="link-external link-https" href="https://github.com/claws-lab/MisinfoCorrect" rel="external noopener nofollow">this https URL</a>.
Social and Information Networks,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the effective methods to combat the spread of false information on social media. Specifically, the paper focuses on how to generate high - quality anti - false - information responses, which need to have the following characteristics: objective, evidence - supported, polite and able to effectively refute false information. The author points out that although it is very common for non - professional ordinary users to actively combat false information on social media, in most cases, their responses are often rude and lack factual basis, which may instead intensify contradictions and reduce trust. Therefore, this paper aims to improve the ability of ordinary users to combat false information by developing an anti - false - information response generation model based on reinforcement learning, so as to more effectively curb the spread of false information. To achieve this goal, the main contributions of the paper include: 1. Creating two new datasets, including false information and its corresponding anti - false - information responses. One dataset is from real social media, and the other is collected through crowdsourcing. 2. Proposing a reinforcement - learning - based framework MisinfoCorrect, which can generate anti - false - information responses with the above - mentioned required characteristics. 3. Proving through quantitative and qualitative evaluations that the proposed model is superior to the existing baseline models in generating high - quality anti - false - information responses. The specific technical details mentioned in the paper include using GPT - 2 as the basic language model and guiding the model to generate more polite, evidence - supported and effectively refutable false - information responses by designing specific reward functions. These reward functions include politeness rewards, evidence - support rewards, refutation rewards, and text fluency and relevance rewards. In this way, the model can not only consider the quality of the content when generating responses, but also ensure the relevance and naturalness of the responses.