Differentiating Benign from Malignant Renal Tumors Using T2 ‐ and Diffusion‐Weighted Images: A Comparison of Deep Learning and Radiomics Models Versus Assessment from Radiologists

Qing Xu,QingQiang Zhu,Hao Liu,LuFan Chang,ShaoFeng Duan,WeiQiang Dou,SaiYang Li,Jing Ye
DOI: https://doi.org/10.1002/jmri.27900
IF: 4.4
2021-08-30
Journal of Magnetic Resonance Imaging
Abstract:BackgroundDifferentiating benign from malignant renal tumors is important for selection of the most effective treatment. PurposeTo develop magnetic resonance imaging (MRI)-based deep learning (DL) models for differentiation of benign and malignant renal tumors and to compare their discrimination performance with the performance of radiomics models and assessment by radiologists. Study TypeRetrospective. PopulationA total of 217 patients were randomly assigned to a training cohort (N = 173) or a testing cohort (N = 44). Field Strength/SequenceDiffusion-weighted imaging (DWI) and fast spin-echo sequence T2-weighted imaging (T2WI) at 3.0T. AssessmentA radiologist manually labeled the region of interest (ROI) on each image. Three DL models using ResNet-18 architecture and three radiomics models using random forest were developed using T2WI alone, DWI alone, and a combination of the two image sets to discriminate between benign and malignant renal tumors. The diagnostic performance of two radiologists was assessed based on professional experience. We also compared the performance of each model and the radiologists. Statistical TestsThe area under the receiver operating characteristic (ROC) curve (AUC) was used to assess the performance of each model and the radiologists. P < 0.05 indicated statistical significance. ResultsThe AUC of the DL models based on T2WI, DWI, and the combination was 0.906, 0.846, and 0.925 in the testing cohorts, respectively. The AUC of the combination DL model was significantly better than that of the models based on individual sequences (0.925 > 0.906, 0.925 > 0.846). The AUC of the radiomics models based on T2WI, DWI, and the combination was 0.824, 0.742, and 0.826 in the testing cohorts, respectively. The AUC of two radiologists was 0.724 and 0.667 in the testing cohorts. ConclusionThus, the MRI-based DL model is useful for differentiating benign from malignant renal tumors in clinic, and the DL model based on T2WI + DWI had the best performance. Level of Evidence3 Technical Efficacy Stage2
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?