The Automated Verification of Textual Claims (AVeriTeC) Shared Task

Michael Schlichtkrull,Yulong Chen,Chenxi Whitehouse,Zhenyun Deng,Mubashara Akhtar,Rami Aly,Zhijiang Guo,Christos Christodoulopoulos,Oana Cocarascu,Arpit Mittal,James Thorne,Andreas Vlachos
2024-10-31
Abstract:The Automated Verification of Textual Claims (AVeriTeC) shared task asks participants to retrieve evidence and predict veracity for real-world claims checked by fact-checkers. Evidence can be found either via a search engine, or via a knowledge store provided by the organisers. Submissions are evaluated using AVeriTeC score, which considers a claim to be accurately verified if and only if both the verdict is correct and retrieved evidence is considered to meet a certain quality threshold. The shared task received 21 submissions, 18 of which surpassed our baseline. The winning team was TUDA_MAI with an AVeriTeC score of 63%. In this paper we describe the shared task, present the full results, and highlight key takeaways from the shared task.
Computation and Language
What problem does this paper attempt to address?