A deep learning–machine learning fusion approach for the classification of benign, malignant, and intermediate bone tumors

Renyi Liu,Derun Pan,Yuan Xu,Hui Zeng,Zilong He,Jiongbin Lin,Weixiong Zeng,Zeqi Wu,Zhendong Luo,Genggeng Qin,Weiguo Chen
DOI: https://doi.org/10.1007/s00330-021-08195-z
IF: 7.034
2021-08-25
European Radiology
Abstract:ObjectivesTo build and validate deep learning and machine learning fusion models to classify benign, malignant, and intermediate bone tumors based on patient clinical characteristics and conventional radiographs of the lesion.MethodsIn this retrospective study, data were collected with pathologically confirmed diagnoses of bone tumors between 2012 and 2019. Deep learning and machine learning fusion models were built to classify tumors as benign, malignant, or intermediate using conventional radiographs of the lesion and potentially relevant clinical data. Five radiologists compared diagnostic performance with and without the model. Diagnostic performance was evaluated using the area under the curve (AUC).ResultsA total of 643 patients’ (median age, 21 years; interquartile range, 12–38 years; 244 women) 982 radiographs were included. In the test set, the binary category classification task, the radiological model of classification for benign/not benign, malignant/nonmalignant, and intermediate/not intermediate had AUCs of 0.846, 0.827, and 0.820, respectively; the fusion models had an AUC of 0.898, 0.894, and 0.865, respectively. In the three-category classification task, the radiological model achieved a macro average AUC of 0.813, and the fusion model had a macro average AUC of 0.872. In the observation test, the mean macro average AUC of all radiologists was 0.819. With the three-category classification fusion model support, the macro AUC improved by 0.026.ConclusionWe built, validated, and tested deep learning and machine learning models that classified bone tumors at a level comparable with that of senior radiologists. Model assistance may somewhat help radiologists’ differential diagnoses of bone tumors.Key Points• The deep learning model can be used to classify benign, malignant, and intermediate bone tumors.• The machine learning model fusing information from radiographs and clinical characteristics can improve the classification capacity for bone tumors.• The diagnostic performance of the fusion model is comparable with that of senior radiologists and is potentially useful as a complement to radiologists in a bone tumor differential diagnosis.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?
This paper mainly explores how to use the fusion of deep learning and machine learning methods to classify benign, malignant, and intermediate bone tumors. The study is based on data from bone tumor patients with pathological diagnoses from 2012 to 2019, and a classification model is constructed using conventional radiographic images of lesion sites and potentially relevant clinical data. In the test set, the model performs well in binary classification tasks (such as benign/malignant, malignant/non-malignant, intermediate/non-intermediate) with the area under the curve (AUC), and the fusion model outperforms the individual radiological model. In the three-classification task, the fusion model also outperforms the radiological model in terms of macro-average AUC. By comparing with the diagnostic performance of five radiologists, the study found that the diagnostic performance of the fusion model is comparable to that of experienced radiologists, and it may serve as a powerful auxiliary tool for the differential diagnosis of bone tumors.