Robustness of radiomics among photon-counting detector CT and dual-energy CT systems: a texture phantom study

Lan Zhu,Haipeng Dong,Jing Sun,Lingyun Wang,Yue Xing,Yangfan Hu,Junjie Lu,Jiarui Yang,Jingshen Chu,Chao Yan,Fei Yuan,Jingyu Zhong
DOI: https://doi.org/10.1007/s00330-024-10976-1
2024-07-24
Abstract:Objectives: To evaluate the robustness of radiomics features among photon-counting detector CT (PCD-CT) and dual-energy CT (DECT) systems. Methods: A texture phantom consisting of twenty-eight materials was scanned with one PCD-CT and four DECT systems (dual-source, rapid kV-switching, dual-layer, and sequential scanning) at three dose levels twice. Thirty sets of virtual monochromatic images at 70 keV were reconstructed. Regions of interest were delineated for each material with a rigid registration. Ninety-three radiomics were extracted per PyRadiomics. The test-retest repeatability between repeated scans was assessed by Bland-Altman analysis. The intra-system reproducibility between dose levels, and inter-system reproducibility within the same dose level, were evaluated by intraclass correlation coefficient (ICC) and concordance correlation coefficient (CCC). Inter-system variability among five scanners was assessed by coefficient of variation (CV) and quartile coefficient of dispersion (QCD). Results: The test-retest repeatability analysis presented that 97.1% of features were repeatable between scan-rescans. The mean ± standard deviation ICC and CCC were 0.945 ± 0.079 and 0.945 ± 0.079 for intra-system reproducibility, respectively, and 86.0% and 85.7% of features were with ICC > 0.90 and CCC > 0.90, respectively, between different dose levels. The mean ± standard deviation ICC and CCC were 0.157 ± 0.174 and 0.157 ± 0.174 for inter-system reproducibility, respectively, and none of the features were with ICC > 0.90 or CCC > 0.90 within the same dose level. The inter-system variability suggested that 6.5% and 12.8% of features were with CV < 10% and QCD < 10%, respectively, among five CT systems. Conclusion: The radiomics features were non-reproducible with significant variability in values among different CT techniques. Clinical relevance statement: Radiomics features are non-reproducible with significant variability in values among photon-counting detector CT and dual-energy CT systems, necessitating careful attention to improve the cross-system generalizability of radiomic features before implementation of radiomics analysis in clinical routine. Key points: CT radiomics stability should be guaranteed before the implementation in the clinical routine. Radiomics robustness was on a low level among photon-counting detectors and dual-energy CT techniques. Limited inter-system robustness of radiomic features may impact the generalizability of models.
What problem does this paper attempt to address?