Abstract:The spread of online misinformation threatens public health, democracy, and the broader society. While professional fact-checkers form the first line of defense by fact-checking popular false claims, they do not engage directly in conversations with misinformation spreaders. On the other hand, non-expert ordinary users act as eyes-on-the-ground who proactively counter misinformation -- recent research has shown that 96% counter-misinformation responses are made by ordinary users. However, research also found that 2/3 times, these responses are rude and lack evidence. This work seeks to create a counter-misinformation response generation model to empower users to effectively correct misinformation. This objective is challenging due to the absence of datasets containing ground-truth of ideal counter-misinformation responses, and the lack of models that can generate responses backed by communication theories. In this work, we create two novel datasets of misinformation and counter-misinformation response pairs from in-the-wild social media and crowdsourcing from college-educated students. We annotate the collected data to distinguish poor from ideal responses that are factual, polite, and refute misinformation. We propose MisinfoCorrect, a reinforcement learning-based framework that learns to generate counter-misinformation responses for an input misinformation post. The model rewards the generator to increase the politeness, factuality, and refutation attitude while retaining text fluency and relevancy. Quantitative and qualitative evaluation shows that our model outperforms several baselines by generating high-quality counter-responses. This work illustrates the promise of generative text models for social good -- here, to help create a safe and reliable information ecosystem. The code and data is accessible on <a class="link-external link-https" href="https://github.com/claws-lab/MisinfoCorrect" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the effective methods to combat the spread of false information on social media. Specifically, the paper focuses on how to generate high - quality anti - false - information responses, which need to have the following characteristics: objective, evidence - supported, polite and able to effectively refute false information. The author points out that although it is very common for non - professional ordinary users to actively combat false information on social media, in most cases, their responses are often rude and lack factual basis, which may instead intensify contradictions and reduce trust. Therefore, this paper aims to improve the ability of ordinary users to combat false information by developing an anti - false - information response generation model based on reinforcement learning, so as to more effectively curb the spread of false information. To achieve this goal, the main contributions of the paper include: 1. Creating two new datasets, including false information and its corresponding anti - false - information responses. One dataset is from real social media, and the other is collected through crowdsourcing. 2. Proposing a reinforcement - learning - based framework MisinfoCorrect, which can generate anti - false - information responses with the above - mentioned required characteristics. 3. Proving through quantitative and qualitative evaluations that the proposed model is superior to the existing baseline models in generating high - quality anti - false - information responses. The specific technical details mentioned in the paper include using GPT - 2 as the basic language model and guiding the model to generate more polite, evidence - supported and effectively refutable false - information responses by designing specific reward functions. These reward functions include politeness rewards, evidence - support rewards, refutation rewards, and text fluency and relevance rewards. In this way, the model can not only consider the quality of the content when generating responses, but also ensure the relevance and naturalness of the responses.

Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation

Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation

Corrective or Backfire: Characterizing and Predicting User Response to Social Correction

Countering Misinformation via Emotional Response Generation

Correcting misinformation on social media with a large language model

MisinfoEval: Generative AI in the Era of "Alternative Facts"

AMIR: Automated MisInformation Rebuttal -- A COVID-19 Vaccination Datasets based Recommendation System

Misinformation Concierge: A Proof-of-Concept with Curated Twitter Dataset on COVID-19 Vaccination

Exploring the impact of automated correction of misinformation in social media

Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Dataset

Advanced Misinformation Detection: A Bi-LSTM Model Optimized by Genetic Algorithms

A Survey on the Role of Crowds in Combating Online Misinformation: Annotators, Evaluators, and Creators

Synthetic Lies: Understanding AI-Generated Misinformation and Evaluating Algorithmic and Human Solutions

Machine Learning-based Automatic Annotation and Detection of COVID-19 Fake News

A Comparative Study of Hybrid Models in Health Misinformation Text Classification

Misinformation with Legal Consequences (MisLC): A New Task Towards Harnessing Societal Harm of Misinformation

Addressing contingency in algorithmic (mis)information classification: Toward a responsible machine learning agenda

Generative Debunking of Climate Misinformation

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

Deep Breath: A Machine Learning Browser Extension to Tackle Online Misinformation

Crowd Intelligence for Early Misinformation Prediction on Social Media