Deep‐Learning‐Based Artificial Intelligence for PI‐RADS Classification to Assist Multiparametric Prostate MRI Interpretation: A Development Study
Thomas Sanford,Stephanie A. Harmon,Evrim B. Turkbey,Deepak Kesani,Sena Tuncer,Manuel Madariaga,Chris Yang,Jonathan Sackett,Sherif Mehralivand,Pingkun Yan,Sheng Xu,Bradford J. Wood,Maria J. Merino,Peter A. Pinto,Peter L. Choyke,Baris Turkbey
DOI: https://doi.org/10.1002/jmri.27204
IF: 4.4
2020-06-01
Journal of Magnetic Resonance Imaging
Abstract:BackgroundThe Prostate Imaging Reporting and Data System (PI‐RADS) provides guidelines for risk stratification of lesions detected on multiparametric MRI (mpMRI) of the prostate but suffers from high intra/interreader variability. PurposeTo develop an artificial intelligence (AI) solution for PI‐RADS classification and compare its performance with an expert radiologist using targeted biopsy results. Study TypeRetrospective study including data from our institution and the publicly available ProstateX dataset. PopulationIn all, 687 patients who underwent mpMRI of the prostate and had one or more detectable lesions (PI‐RADS score >1) according to PI‐RADSv2. Field Strength/SequenceT2‐weighted, diffusion‐weighted imaging (DWI; five evenly spaced b values between b = 0–750 s/mm2) for apparent diffusion coefficient (ADC) mapping, high b ‐value DWI (b = 1500 or 2000 s/mm2), and dynamic contrast‐enhanced T1‐weighted series were obtained at 3.0T. AssessmentPI‐RADS lesions were segmented by a radiologist. Bounding boxes around the T2/ADC/high‐b value segmentations were stacked and saved as JPEGs. These images were used to train a convolutional neural network (CNN). The PI‐RADS scores obtained by the CNN were compared with radiologist scores. The cancer detection rate was measured from a subset of patients who underwent biopsy. Statistical TestsAgreement between the AI and the radiologist‐driven PI‐RADS scores was assessed using a kappa score, and differences between categorical variables were assessed with a Wald test. ResultsFor the 1034 detection lesions, the kappa score for the AI system vs. the expert radiologist was moderate, at 0.40. However, there was no significant difference in the rates of detection of clinically significant cancer for any PI‐RADS score in 86 patients undergoing targeted biopsy (P = 0.4–0.6). Data ConclusionWe developed an AI system for assignment of a PI‐RADS score on segmented lesions on mpMRI with moderate agreement with an expert radiologist and a similar ability to detect clinically significant cancer. Level of Evidence4 Technical Efficacy Stage2
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?