Abstract:Recently, misinformation incorporating both texts and images has been disseminated more effectively than those containing text alone on social media, raising significant concerns for multi-modal fact-checking. Existing research makes contributions to multi-modal feature extraction and interaction, but fails to fully enhance the valuable semantic representations or excavate the intricate entity information. Besides, existing multi-modal fact-checking datasets are primarily focused on English and merely concentrate on a single type of misinformation, thereby neglecting a comprehensive summary and coverage of various types of misinformation. Taking these factors into account, we construct the first large-scale Chinese Multi-modal Fact-Checking (CMFC) dataset which encompasses 46,000 claims. The CMFC covers all types of misinformation for fact-checking and is divided into two sub-datasets, Collected Chinese Multi-modal Fact-Checking (CCMF) and Synthetic Chinese Multi-modal Fact-Checking (SCMF). To establish baseline performance, we propose a novel Entity-enhanced and Stance Checking Network (ESCNet), which includes Multi-modal Feature Extraction Module, Stance Transformer, and Entity-enhanced Encoder. The ESCNet jointly models stance semantic reasoning features and knowledge-enhanced entity pair features, in order to simultaneously learn effective semantic-level and knowledge-level claim representations. Our work offers the first step and establishes a benchmark for evidence-based, multi-type, multi-modal fact-checking.

CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking

Do We Need Language-Specific Fact-Checking Models? The Case of Chinese

Signature Detection, Restoration, and Verification: A Novel Chinese Document Signature Forgery Detection Benchmark

Check-COVID: Fact-Checking COVID-19 News Claims with Scientific Evidence

Give Me More Details: Improving Fact-Checking with Latent Retrieval

MCFEND: A Multi-source Benchmark Dataset for Chinese Fake News Detection

Fact Checking Beyond Training Set

CFEVER: A Chinese Fact Extraction and VERification Dataset

Augmenting the Veracity and Explanations of Complex Fact Checking via Iterative Self-Revision with LLMs

HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking

FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers

ESCNet: Entity-enhanced and Stance Checking Network for Multi-modal Fact-Checking

RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict

AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web

Explainable Automated Fact-Checking for Public Health Claims

FacTeR-Check: Semi-automated fact-checking through Semantic Similarity and Natural Language Inference

Cross-lingual COVID-19 Fake News Detection

Some Observations on Fact-Checking Work with Implications for Computational Support

CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval

An AI-based System to Assist Human Fact-Checkers for Labeling Cantonese Fake News on Social Media