Abstract:In an era increasingly dominated by digital platforms, the spread of misinformation poses a significant challenge, highlighting the need for solutions capable of assessing information veracity. Our research contributes to the field of Explainable Artificial Antelligence (XAI) by developing transformer-based fact-checking models that contextualise and justify their decisions by generating human-accessible explanations. Importantly, we also develop models for automatic evaluation of explanations for fact-checking verdicts across different dimensions such as \texttt{(self)-contradiction}, \texttt{hallucination}, \texttt{convincingness} and \texttt{overall quality}. By introducing human-centred evaluation methods and developing specialised datasets, we emphasise the need for aligning Artificial Intelligence (AI)-generated explanations with human judgements. This approach not only advances theoretical knowledge in XAI but also holds practical implications by enhancing the transparency, reliability and users' trust in AI-driven fact-checking systems. Furthermore, the development of our metric learning models is a first step towards potentially increasing efficiency and reducing reliance on extensive manual assessment. Based on experimental results, our best performing generative model \textsc{ROUGE-1} score of 47.77, demonstrating superior performance in generating fact-checking explanations, particularly when provided with high-quality evidence. Additionally, the best performing metric learning model showed a moderately strong correlation with human judgements on objective dimensions such as \texttt{(self)-contradiction and \texttt{hallucination}, achieving a Matthews Correlation Coefficient (MCC) of around 0.7.}

Surprising Efficacy of Fine-Tuned Transformers for Fact-Checking over Larger Language Models

Learning to Generate and Evaluate Fact-checking Explanations with Transformers

The Perils & Promises of Fact-checking with Large Language Models

The perils and promises of fact-checking with large language models

Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation

Are Large Language Models Good Fact Checkers: A Preliminary Study

Tell Me Why: Explainable Public Health Fact-Checking with Large Language Models

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Long-form factuality in large language models

Generative Large Language Models in Automated Fact-Checking: A Survey

Language Models Hallucinate, but May Excel at Fact Verification

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

FactFinders at CheckThat! 2024: Refining Check-worthy Statement Detection with LLMs through Data Pruning

FactCheck Editor: Multilingual Text Editor with End-to-End fact-checking

Multimodal Large Language Models to Support Real-World Fact-Checking

Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers

Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models

A Comparative Study of Translation Bias and Accuracy in Multilingual Large Language Models for Cross-Language Claim Verification

QuestGen: Effectiveness of Question Generation Methods for Fact-Checking Applications