Predicting Muscle Invasion in Bladder Cancer Based on MRI: A Comparison of Radiomics, and Single-task and Multi-task Deep Learning

Jianpeng Li,Zhengxuan Qiu,Kangyang Cao,Lei Deng,Weijing Zhang,Chuanmiao Xie,Shuiqing Yang,Peiyan Yue,Jian Zhong,Jiegeng Lyu,Xiang Huang,Kunlin Zhang,Yujian Zou,Bingsheng Huang
DOI: https://doi.org/10.1016/j.cmpb.2023.107466
IF: 6.1
2023-03-07
Computer Methods and Programs in Biomedicine
Abstract:Background and Objectives Radiomics and deep learning are two popular technologies used to develop computer-aided detection and diagnosis schemes for analyzing medical images. This study aimed to compare the effectiveness of radiomics, single-task deep learning (DL) and multi-task DL methods in predicting muscle-invasive bladder cancer (MIBC) status based on T2-weighted imaging (T2WI). Methods A total of 121 tumours (93 for training, from Centre 1; 28 for testing, from Centre 2) were included. MIBC was confirmed with pathological examination. A radiomics model, a single-task model, and a multi-task model based on T2WI were constructed in the training cohort with five-fold cross-validation, and validation was conducted in the external test cohort. Receiver operating characteristic (ROC) curve analysis was performed to evaluate the diagnostic performance of each model. DeLong's test and a permutation test were used to compare the performance of the models. Results The area under the ROC curve (AUC) values of the radiomics, single-task and multi-task models in the training cohort were: 0.920, 0.933 and 0.932, respectively; and were 0.844, 0.884 and 0.932, respectively, in the test cohort. The multi-task model achieved better performance in the test cohort than did the other models. No statistically significant differences in AUC values and Kappa coefficients were observed between pairwise models, in either the training or test cohorts. According to the Grad-CAM feature visualization results, the multi-task model focused more on the diseased tissue area in some samples of the test cohort compared with the single-task model. Conclusions The T2WI-based radiomics, single-task, and multi-task models all exhibited good diagnostic performance in preoperatively predicting MIBC, in which the multi-task model had the best diagnostic performance. Compared with the radiomics method, our multi-task DL method had the advantage of saving time and effort. Compared with the single-task DL method, our multi-task DL method had the advantage of being more lesion-focused and more reliable for clinical reference.
engineering, biomedical,computer science, interdisciplinary applications,medical informatics, theory & methods
What problem does this paper attempt to address?