FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs

Eun Cheol Choi,Emilio Ferrara
2024-02-09
Abstract:Our society is facing rampant misinformation harming public health and trust. To address the societal challenge, we introduce FACT-GPT, a system leveraging Large Language Models (LLMs) to automate the claim matching stage of fact-checking. FACT-GPT, trained on a synthetic dataset, identifies social media content that aligns with, contradicts, or is irrelevant to previously debunked claims. Our evaluation shows that our specialized LLMs can match the accuracy of larger models in identifying related claims, closely mirroring human judgment. This research provides an automated solution for efficient claim matching, demonstrates the potential of LLMs in supporting fact-checkers, and offers valuable resources for further research in the field.
Human-Computer Interaction,Social and Information Networks,Computers and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to quickly identify and match information that has been verified as false (i.e., false claims) on social media, in order to support fact - checking work. Specifically, the paper introduces a system named FACT - GPT, which utilizes large language models (LLMs) to automate the "claim - matching" stage in the fact - checking process. By training these models to recognize social media content that is consistent with, contradictory to, or irrelevant to previously debunked false claims, FACT - GPT aims to improve the efficiency of fact - checking, reduce the workload of repetitive verification, and support content moderation on online platforms. The core objective of the paper is to demonstrate the capabilities of large language models in assisting fact - checkers, especially their potential in detecting rumors and reducing the spread of false information. In addition, the research also explores how to generate synthetic training datasets to optimize the performance of these models and evaluates the performance differences of different models on this task. In this way, the paper not only provides technical solutions but also offers valuable resources for future research.