Machine learning–based CT texture analysis to predict HPV status in oropharyngeal squamous cell carcinoma: comparison of 2D and 3D segmentation

Jiliang Ren,Ying Yuan,Meng Qi,Xiaofeng Tao
DOI: https://doi.org/10.1007/s00330-020-07011-4
IF: 7.034
2020-06-26
European Radiology
Abstract:ObjectiveTo compare the CT texture feature reproducibility of 2D and 3D segmentations and their machine learning (ML)–based classifications for predicting human papilloma virus (HPV) status in oropharyngeal squamous cell carcinoma (OPSCC).Materials and methodsData about 47 patients with pathological OPSCC (15 HPV positive and 32 HPV negative) were collected from a public database. Using 2D and 3D manual segmentations, 1032 texture features were extracted from contrast-enhanced CT images. Intraclass correlation coefficients (ICCs) were calculated to evaluate intraobserver and interobserver reproducibility. Collinearity analysis and a wrapper-based subset search algorithm were used for feature selection. Models were created using k-nearest neighbors (k-NN), logistic regression (LR), and random forest (RF) alone and with a synthetic minority oversampling technique (SMOTE). Classifier performance was assessed using 10-fold cross-validation.ResultsCompared with 2D segmentation (468 of 1032, 45.3%), 3D segmentation (576 of 1032, 55.8%) yielded more texture features with reliable reproducibility (good to excellent in both intraobserver and interobserver analyses) (p < 0.001). RF and k-NN classifiers failed to achieve better classification performance using 3D features than using 2D features either alone or with SMOTE. The best models for 2D and 3D segmentations were both created using RF, which alone achieved areas under the curve (AUCs) of 0.880 and 0.847, respectively, and with SMOTE, AUCs of 0.953 and 0.920, respectively, were achieved.ConclusionsThree-dimensional segmentation had better CT texture feature reproducibility, but 2D segmentation showed better performance. Considering the cost, 2D segmentation is more recommended for ML-based classification of HPV status of OPSCC.Key Points• Three-dimensional segmentation had better CT texture feature reproducibility than 2D segmentation.• Despite yielding more features with reliable reproducibility, 3D segmentation failed to provide better classification performance as compared to 2D for predicting HPV status of oropharyngeal squamous cell carcinoma.• The best models for 2D and 3D segmentations were both created using random forest classifier.
radiology, nuclear medicine & medical imaging
What problem does this paper attempt to address?