Data infrastructures for AI in medical imaging: a report on the experiences of five EU projects
Haridimos Kondylakis,Varvara Kalokyri,Stelios Sfakianakis,Kostas Marias,Manolis Tsiknakis,Ana Jimenez-Pastor,Eduardo Camacho-Ramos,Ignacio Blanquer,J. Damian Segrelles,Sergio López-Huguet,Caroline Barelle,Magdalena Kogut-Czarkowska,Gianna Tsakou,Nikolaos Siopis,Zisis Sakellariou,Paschalis Bizopoulos,Vicky Drossou,Antonios Lalas,Konstantinos Votis,Pedro Mallol,Luis Marti-Bonmati,Leonor Cerdá Alberich,Karine Seymour,Samuel Boucher,Esther Ciarrocchi,Lauren Fromont,Jordi Rambla,Alexander Harms,Andrea Gutierrez,Martijn P. A. Starmans,Fred Prior,Josep Ll. Gelpi,Karim Lekadir
DOI: https://doi.org/10.1186/s41747-023-00336-x
2023-05-08
European Radiology Experimental
Abstract:Abstract Artificial intelligence (AI) is transforming the field of medical imaging and has the potential to bring medicine from the era of ‘sick-care’ to the era of healthcare and prevention. The development of AI requires access to large, complete, and harmonized real-world datasets, representative of the population, and disease diversity. However, to date, efforts are fragmented, based on single–institution, size-limited, and annotation-limited datasets. Available public datasets ( e.g. , The Cancer Imaging Archive, TCIA, USA) are limited in scope, making model generalizability really difficult. In this direction, five European Union projects are currently working on the development of big data infrastructures that will enable European, ethically and General Data Protection Regulation-compliant, quality-controlled, cancer-related, medical imaging platforms, in which both large-scale data and AI algorithms will coexist. The vision is to create sustainable AI cloud-based platforms for the development, implementation, verification, and validation of trustable, usable, and reliable AI models for addressing specific unmet needs regarding cancer care provision. In this paper, we present an overview of the development efforts highlighting challenges and approaches selected providing valuable feedback to future attempts in the area. Key points • Artificial intelligence models for health imaging require access to large amounts of harmonized imaging data and metadata. • Main infrastructures adopted either collect centrally anonymized data or enable access to pseudonymized distributed data. • Developing a common data model for storing all relevant information is a challenge. • Trust of data providers in data sharing initiatives is essential. • An online European Union meta-tool-repository is a necessity minimizing effort duplication for the various projects in the area.