The TRIPOD-LLM Statement: A Targeted Guideline For Reporting Large Language Models Use

Jack Gallifant,Majid Afshar,Saleem Ameen,Yindalon Aphinyanaphongs,Shan Chen,Giovanni Cacciamani,Dina Demner-Fushman,Dmitriy Dligach,Roxana Daneshjou,Chrystinne Fernandes,Lasse Hyldig Hansen,Adam Landman,Liam G. McCoy,Timothy Miller,Amy Moreno,Nikolaj Munch,David Restrepo,Guergana Savova,Renato Umeton,Judy Wawira Gichoya,Gary S. Collins,Karel G. M. Moons,Leo A. Celi,Danielle S. Bitterman
DOI: https://doi.org/10.1101/2024.07.24.24310930
2024-07-25
Abstract:Large Language Models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present TRIPOD-LLM, an extension of the TRIPOD+AI statement, addressing the unique challenges of LLMs in biomedical applications. TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. The guidelines introduce a modular format accommodating various LLM research designs and tasks, with 14 main items and 32 subitems applicable across all categories. Developed through an expedited Delphi process and expert consensus, TRIPOD-LLM emphasizes transparency, human oversight, and task-specific performance reporting. We also introduce an interactive website (https://tripod-llm.vercel.app/) facilitating easy guideline completion and PDF generation for submission. As a living document, TRIPOD-LLM will evolve with the field, aiming to enhance the quality, reproducibility, and clinical applicability of LLM research in healthcare through comprehensive reporting.
Health Informatics
What problem does this paper attempt to address?