The STOIC2021 COVID-19 AI challenge: Applying reusable training methodologies to private data

Luuk H. Boulogne,Julian Lorenz,Daniel Kienzle,Robin Schön,Katja Ludwig,Rainer Lienhart,Simon Jégou,Guang Li,Cong Chen,Qi Wang,Derik Shi,Mayug Maniparambil,Dominik Müller,Silvan Mertes,Niklas Schröter,Fabio Hellmann,Miriam Elia,Ine Dirks,Matías Nicolás Bossa,Abel Díaz Berenguer,Tanmoy Mukherjee,Jef Vandemeulebroucke,Hichem Sahli,Nikos Deligiannis,Panagiotis Gonidakis,Ngoc Dung Huynh,Imran Razzak,Reda Bouadjenek,Mario Verdicchio,Pasquale Borrelli,Marco Aiello,James A. Meakin,Alexander Lemm,Christoph Russ,Razvan Ionasec,Nikos Paragios,Bram van Ginneken,Marie-Pierre Revel-Dubois
DOI: https://doi.org/10.1016/j.media.2024.103230
IF: 10.9
2024-06-08
Medical Image Analysis
Abstract:Challenges drive the state-of-the-art of automated medical image analysis. The quantity of public training data that they provide can limit the performance of their solutions. Public access to the training methodology for these solutions remains absent. This study implements the Type Three (T3) challenge format, which allows for training solutions on private data and guarantees reusable training methodologies. With T3, challenge organizers train a codebase provided by the participants on sequestered training data. T3 was implemented in the STOIC2021 challenge, with the goal of predicting from a computed tomography (CT) scan whether subjects had a severe COVID-19 infection, defined as intubation or death within one month. STOIC2021 consisted of a Qualification phase, where participants developed challenge solutions using 2 000 publicly available CT scans, and a Final phase, where participants submitted their training methodologies with which solutions were trained on CT scans of 9 724 subjects. The organizers successfully trained six of the eight Final phase submissions. The submitted codebases for training and running inference were released publicly. The winning solution obtained an area under the receiver operating characteristic curve for discerning between severe and non-severe COVID-19 of 0.815. The Final phase solutions of all finalists improved upon their Qualification phase solutions.
engineering, biomedical,computer science, interdisciplinary applications, artificial intelligence,radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?