Abstract:We present Claim-Dissector: a novel latent variable model for fact-checking and analysis, which given a claim and a set of retrieved evidences jointly learns to identify: (i) the relevant evidences to the given claim, (ii) the veracity of the claim. We propose to disentangle the per-evidence relevance probability and its contribution to the final veracity probability in an interpretable way -- the final veracity probability is proportional to a linear ensemble of per-evidence relevance probabilities. In this way, the individual contributions of evidences towards the final predicted probability can be identified. In per-evidence relevance probability, our model can further distinguish whether each relevant evidence is supporting (S) or refuting (R) the claim. This allows to quantify how much the S/R probability contributes to the final verdict or to detect disagreeing evidence. Despite its interpretable nature, our system achieves results competitive with state-of-the-art on the FEVER dataset, as compared to typical two-stage system pipelines, while using significantly fewer parameters. It also sets new state-of-the-art on FAVIQ and RealFC datasets. Furthermore, our analysis shows that our model can learn fine-grained relevance cues while using coarse-grained supervision, and we demonstrate it in 2 ways. (i) We show that our model can achieve competitive sentence recall while using only paragraph-level relevance supervision. (ii) Traversing towards the finest granularity of relevance, we show that our model is capable of identifying relevance at the token level. To do this, we present a new benchmark TLR-FEVER focusing on token-level interpretability -- humans annotate tokens in relevant evidences they considered essential when making their judgment. Then we measure how similar are these annotations to the tokens our model is focusing on.

Contrastive Learning to Improve Retrieval for Real-world Fact Checking

Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments

Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction

Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM

Fact Checking Beyond Training Set

AIC CTU system at AVeriTeC: Re-framing automated fact-checking as a simple RAG task

Give Me More Details: Improving Fact-Checking with Latent Retrieval

From Relevance to Utility: Evidence Retrieval with Feedback for Fact Verification

Augmenting the Veracity and Explanations of Complex Fact Checking via Iterative Self-Revision with LLMs

CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval

AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web

Robust Information Retrieval for False Claims with Distracting Entities In Fact Extraction and Verification

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers

Complex Claim Verification with Evidence Retrieved in the Wild

A Multi-Level Attention Model for Evidence-Based Fact Checking

RAC: Efficient LLM Factuality Correction with Retrieval Augmentation

Bridging Textual and Tabular Worlds for Fact Verification: A Lightweight, Attention-Based Model

Natural Logic-guided Autoregressive Multi-hop Document Retrieval for Fact Verification

FactFinders at CheckThat! 2024: Refining Check-worthy Statement Detection with LLMs through Data Pruning

How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models

Evidence-backed Fact Checking using RAG and Few-Shot In-Context Learning with LLMs