Anindo Saha,Joeran S Bosma,Jasper J Twilt,Bram van Ginneken,Anders Bjartell,Anwar R Padhani,David Bonekamp,Geert Villeirs,Georg Salomon,Gianluca Giannarini,Jayashree Kalpathy-Cramer,Jelle Barentsz,Klaus H Maier-Hein,Mirabela Rusu,Olivier Rouvière,Roderick van den Bergh,Valeria Panebianco,Veeru Kasivisvanathan,Nancy A Obuchowski,Derya Yakar,Mattijs Elschot,Jeroen Veltman,Jurgen J Fütterer,Maarten de Rooij,Henkjan Huisman,PI-CAI consortium,Constant R Noordman,Ivan Slootweg,Christian Roest,Stefan J Fransen,Mohammed R S Sunoqrot,Tone F Bathen,Dennis Rouw,Jos Immerzeel,Jeroen Geerdink,Chris van Run,Miriam Groeneveld,James Meakin,Ahmet Karagöz,Alexandre Bône,Alexandre Routier,Arnaud Marcoux,Clément Abi-Nader,Cynthia Xinran Li,Dagan Feng,Deniz Alis,Ercan Karaarslan,Euijoon Ahn,François Nicolas,Geoffrey A Sonn,Indrani Bhattacharya,Jinman Kim,Jun Shi,Hassan Jahanandish,Hong An,Hongyu Kan,Ilkay Oksuz,Liang Qiao,Marc-Michel Rohé,Mert Yergin,Mohamed Khadra,Mustafa E Şeker,Mustafa S Kartal,Noëlie Debs,Richard E Fan,Sara Saunders,Simon J C Soerensen,Stefania Moroianu,Sulaiman Vesal,Yuan Yuan,Afsoun Malakoti-Fard,Agnė Mačiūnien,Akira Kawashima,Ana M M de M G de Sousa Machadov,Ana Sofia L Moreira,Andrea Ponsiglione,Annelies Rappaport,Arnaldo Stanzione,Arturas Ciuvasovas,Baris Turkbey,Bart de Keyzer,Bodil G Pedersen,Bram Eijlers,Christine Chen,Ciabattoni Riccardo,Ewout F W Courrech Staal,Fredrik Jäderling,Fredrik Langkilde,Giacomo Aringhieri,Giorgio Brembilla,Hannah Son,Hans Vanderlelij,Henricus P J Raat,Ingrida Pikūnienė,Iva Macova,Ivo Schoots,Iztok Caglic,Jeries P Zawaideh,Jonas Wallström,Leonardo K Bittencourt,Misbah Khurram,Moon H Choi,Naoki Takahashi,Nelly Tan,Paolo N Franco,Patricia A Gutierrez,Per Erik Thimansson,Pieter Hanus,Philippe Puech,Philipp R Rau,Pieter de Visschere,Ramette Guillaume,Renato Cuocolo,Ricardo O Falcão,Rogier S A van Stiphout,Rossano Girometti,Ruta Briediene,Rūta Grigienė,Samuel Gitau,Samuel Withey,Sangeet Ghai,Tobias Penzkofer,Tristan Barrett,Varaha S Tammisetti,Vibeke B Løgager,Vladimír Černý,Wulphert Venderink,Yan M Law,Young J Lee

Abstract:Background: Artificial intelligence (AI) systems can potentially aid the diagnostic pathway of prostate cancer by alleviating the increasing workload, preventing overdiagnosis, and reducing the dependence on experienced radiologists. We aimed to investigate the performance of AI systems at detecting clinically significant prostate cancer on MRI in comparison with radiologists using the Prostate Imaging-Reporting and Data System version 2.1 (PI-RADS 2.1) and the standard of care in multidisciplinary routine practice at scale. Methods: In this international, paired, non-inferiority, confirmatory study, we trained and externally validated an AI system (developed within an international consortium) for detecting Gleason grade group 2 or greater cancers using a retrospective cohort of 10 207 MRI examinations from 9129 patients. Of these examinations, 9207 cases from three centres (11 sites) based in the Netherlands were used for training and tuning, and 1000 cases from four centres (12 sites) based in the Netherlands and Norway were used for testing. In parallel, we facilitated a multireader, multicase observer study with 62 radiologists (45 centres in 20 countries; median 7 [IQR 5-10] years of experience in reading prostate MRI) using PI-RADS (2.1) on 400 paired MRI examinations from the testing cohort. Primary endpoints were the sensitivity, specificity, and the area under the receiver operating characteristic curve (AUROC) of the AI system in comparison with that of all readers using PI-RADS (2.1) and in comparison with that of the historical radiology readings made during multidisciplinary routine practice (ie, the standard of care with the aid of patient history and peer consultation). Histopathology and at least 3 years (median 5 [IQR 4-6] years) of follow-up were used to establish the reference standard. The statistical analysis plan was prespecified with a primary hypothesis of non-inferiority (considering a margin of 0·05) and a secondary hypothesis of superiority towards the AI system, if non-inferiority was confirmed. This study was registered at ClinicalTrials.gov, NCT05489341. Findings: Of the 10 207 examinations included from Jan 1, 2012, through Dec 31, 2021, 2440 cases had histologically confirmed Gleason grade group 2 or greater prostate cancer. In the subset of 400 testing cases in which the AI system was compared with the radiologists participating in the reader study, the AI system showed a statistically superior and non-inferior AUROC of 0·91 (95% CI 0·87-0·94; p<0·0001), in comparison to the pool of 62 radiologists with an AUROC of 0·86 (0·83-0·89), with a lower boundary of the two-sided 95% Wald CI for the difference in AUROC of 0·02. At the mean PI-RADS 3 or greater operating point of all readers, the AI system detected 6·8% more cases with Gleason grade group 2 or greater cancers at the same specificity (57·7%, 95% CI 51·6-63·3), or 50·4% fewer false-positive results and 20·0% fewer cases with Gleason grade group 1 cancers at the same sensitivity (89·4%, 95% CI 85·3-92·9). In all 1000 testing cases where the AI system was compared with the radiology readings made during multidisciplinary practice, non-inferiority was not confirmed, as the AI system showed lower specificity (68·9% [95% CI 65·3-72·4] vs 69·0% [65·5-72·5]) at the same sensitivity (96·1%, 94·0-98·2) as the PI-RADS 3 or greater operating point. The lower boundary of the two-sided 95% Wald CI for the difference in specificity (-0·04) was greater than the non-inferiority margin (-0·05) and a p value below the significance threshold was reached (p<0·001). Interpretation: An AI system was superior to radiologists using PI-RADS (2.1), on average, at detecting clinically significant prostate cancer and comparable to the standard of care. Such a system shows the potential to be a supportive tool within a primary diagnostic setting, with several associated benefits for patients and radiologists. Prospective validation is needed to test clinical applicability of this system. Funding: Health~Holland and EU Horizon 2020.

Performance of Artificial Intelligence-Aided Diagnosis System for Clinically Significant Prostate Cancer with MRI: A Diagnostic Comparison Study

Assessing the Performance of Artificial Intelligence Assistance for Prostate MRI: A Two‐Center Study Involving Radiologists With Different Experience Levels

Artificial intelligence as diagnostic aiding tool in cases of Prostate Imaging Reporting and Data System category 3: the results of retrospective multi-center cohort study

Using an Artificial Intelligence Model to Detect and Localize Visible Clinically Significant Prostate Cancer in Prostate Magnetic Resonance Imaging: a Multicenter External Validation Study.

Assessment of a fully-automated diagnostic AI software in prostate MRI: Clinical evaluation and histopathological correlation

Artificial Intelligence Compared to Radiologists for the Initial Diagnosis of Prostate Cancer on Magnetic Resonance Imaging: A Systematic Review and Recommendations for Future Studies

Comparative performance of fully-automated and semi-automated artificial intelligence methods for the detection of clinically significant prostate cancer on MRI: a systematic review

The Added Value of AI-based Computer-Aided Diagnosis in Classification of Cancer at Prostate MRI

External Validation of a Previously Developed Deep Learning-based Prostate Lesion Detection Algorithm on Paired External and In-House Biparametric MRI Scans

A multicenter study of artificial intelligence-aided software for detecting visible clinically significant prostate cancer on mpMRI

Artificial intelligence for the diagnosis of clinically significant prostate cancer based on multimodal data: a multicenter study

Artificial intelligence and radiologists in prostate cancer detection on MRI (PI-CAI): an international, paired, non-inferiority, confirmatory study

PI-RADSAI: Introducing a New Human-in-the-loop AI Model for Prostate Cancer Diagnosis Based on MRI

Deep‐Learning‐Based Artificial Intelligence for PI‐RADS Classification to Assist Multiparametric Prostate MRI Interpretation: A Development Study

AI-predicted Mpmri Image Features for the Prediction of Clinically Significant Prostate Cancer.

Artificial intelligence is a promising prospect for the detection of prostate cancer extracapsular extension with mpMRI: a two-center comparative study

A systematic review on artificial intelligence evaluating PSMA PET scan for intraprostatic cancer

Systematic Review of AI-Assisted MRI in Prostate Cancer Diagnosis: Enhancing Accuracy Through Second Opinion Tools

AI-based automated evaluation of image quality and protocol tailoring in patients undergoing MRI for suspected prostate cancer

Challenges in the Use of Artificial Intelligence for Prostate Cancer Diagnosis from Multiparametric Imaging Data

What benefit can be obtained from magnetic resonance imaging diagnosis with artificial intelligence in prostate cancer compared with clinical assessments?