Abstract:Real-world misinformation, often multimodal, can be partially or fully factual but misleading using diverse tactics like conflating correlation with causation. Such misinformation is severely understudied, challenging to address, and harms various social domains, particularly on social media, where it can spread rapidly. High-quality and timely correction of misinformation that identifies and explains its (in)accuracies effectively reduces false beliefs. Despite the wide acceptance of manual correction, it is difficult to be timely and scalable. While LLMs have versatile capabilities that could accelerate misinformation correction, they struggle due to a lack of recent information, a tendency to produce false content, and limitations in addressing multimodal information. We propose MUSE, an LLM augmented with access to and credibility evaluation of up-to-date information. By retrieving evidence as refutations or supporting context, MUSE identifies and explains content (in)accuracies with references. It conducts multimodal retrieval and interprets visual content to verify and correct multimodal content. Given the absence of a comprehensive evaluation approach, we propose 13 dimensions of misinformation correction quality. Then, fact-checking experts evaluate responses to social media content that are not presupposed to be misinformation but broadly include (partially) incorrect and correct posts that may (not) be misleading. Results demonstrate MUSE's ability to write high-quality responses to potential misinformation--across modalities, tactics, domains, political leanings, and for information that has not previously been fact-checked online--within minutes of its appearance on social media. Overall, MUSE outperforms GPT-4 by 37% and even high-quality responses from laypeople by 29%. Our work provides a general methodological and evaluative framework to correct misinformation at scale.

Preventing and Detecting Misinformation Generated by Large Language Models

On the Risk of Misinformation Pollution with Large Language Models

Can LLM-Generated Misinformation Be Detected?

Combating Misinformation in the Age of LLMs: Opportunities and Challenges

Explaining Misinformation Detection Using Large Language Models

Securing Large Language Models: Addressing Bias, Misinformation, and Prompt Attacks

Misinforming LLMs: vulnerabilities, challenges and opportunities

Disinformation Capabilities of Large Language Models

The Science of Detecting LLM-Generated Texts

Can Large Language Models Detect Misinformation in Scientific News Reporting?

From Deception to Detection: The Dual Roles of Large Language Models in Fake News

Synthetic Lies: Understanding AI-Generated Misinformation and Evaluating Algorithmic and Human Solutions

MisinfoEval: Generative AI in the Era of "Alternative Facts"

Catching Chameleons: Detecting Evolving Disinformation Generated using Large Language Models

The Dark Side of Language Models: Exploring the Potential of LLMs in Multimedia Disinformation Generation and Dissemination

Correcting misinformation on social media with a large language model

Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Dataset

Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content