Automatic hip osteoarthritis grading with uncertainty estimation from computed tomography using digitally-reconstructed radiographs

Masachika Masuda,Mazen Soufi,Yoshito Otake,Keisuke Uemura,Sotaro Kono,Kazuma Takashima,Hidetoshi Hamada,Yi Gu,Masaki Takao,Seiji Okada,Nobuhiko Sugano,Yoshinobu Sato
2023-12-30
Abstract:Progression of hip osteoarthritis (hip OA) leads to pain and disability, likely leading to surgical treatment such as hip arthroplasty at the terminal stage. The severity of hip OA is often classified using the Crowe and Kellgren-Lawrence (KL) classifications. However, as the classification is subjective, we aimed to develop an automated approach to classify the disease severity based on the two grades using digitally-reconstructed radiographs (DRRs) from CT images. Automatic grading of the hip OA severity was performed using deep learning-based models. The models were trained to predict the disease grade using two grading schemes, i.e., predicting the Crowe and KL grades separately, and predicting a new ordinal label combining both grades and representing the disease progression of hip OA. The models were trained in classification and regression settings. In addition, the model uncertainty was estimated and validated as a predictor of classification accuracy. The models were trained and validated on a database of 197 hip OA patients, and externally validated on 52 patients. The model accuracy was evaluated using exact class accuracy (ECA), one-neighbor class accuracy (ONCA), and balanced accuracy.The deep learning models produced a comparable accuracy of approximately 0.65 (ECA) and 0.95 (ONCA) in the classification and regression settings. The model uncertainty was significantly larger in cases with large classification errors (P<6e-3). In this study, an automatic approach for grading hip OA severity from CT images was developed. The models have shown comparable performance with high ONCA, which facilitates automated grading in large-scale CT databases and indicates the potential for further disease progression analysis. Classification accuracy was correlated with the model uncertainty, which would allow for the prediction of classification errors.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the automatic grading of hip osteoarthritis (hip OA). Specifically, the authors aim to develop a method for automatically evaluating the severity of hip osteoarthritis using digitally reconstructed radiographs (DRRs) based on CT images, using the Crowe and Kellgren - Lawrence (KL) classification systems. Traditional methods for grading hip osteoarthritis rely on the subjective judgment of radiologists or orthopedic surgeons, and there are large inter - observer and intra - observer differences. Therefore, the goal of this study is to automate this process through a deep - learning model to improve the accuracy and repeatability of diagnosis while reducing the dependence on professionals. ### Main problems: 1. **Automated grading**: Develop a method that can automatically generate DRRs from CT images and automatically grade the severity of hip osteoarthritis based on these DRRs. 2. **Multi - level classification**: Not only perform binary classification (normal vs. diseased), but also perform multi - level classification to reflect the progression stage of the disease. 3. **Model uncertainty estimation**: Evaluate the uncertainty of the model to predict the possibility of classification errors, thereby improving the reliability of the model. ### Solutions: 1. **Data processing**: - Use a pre - trained model with 3D CNN and U - Net architectures to detect the femoral head centers (FHCs) in CT images. - Extract a 150 mm³ cubic area containing the hip joint area from the FHCs. - Project the extracted area in the anteroposterior direction to generate DRRs images. 2. **Model architecture**: - Use three deep - learning models: VGG16, DenseNet161, and VisionTransformer (ViT). - The models are trained in classification and regression settings to predict Crowe and KL grades respectively. - For joint classification, the dimension of the final layer is changed from 1000 to 7; for separate classification, two fully - connected layers are added, and the dimension of the final layer is set to 4. 3. **Uncertainty estimation**: - Use the Monte - Carlo Dropout (MCdropout) method to estimate the uncertainty of the model. - Calculate the variance of the output through multiple dropout samplings as a measure of uncertainty. 4. **Evaluation metrics**: - Use exact - class accuracy (ECA) and one - neighbor - class accuracy (ONCA) to evaluate grading accuracy. - Use standard error (SE) to evaluate regression performance. - Report the balanced accuracy to take into account the effect of class imbalance. ### Experimental results: - **Internal dataset**: In the internal dataset, ViT and DenseNet have the highest ECA in the regression setting, which are 0.660 ± 0.010 and 0.663 ± 0.016 respectively. The ONCA of all models exceeds 0.90. - **External dataset**: In the external dataset, the ECA of ViT and DenseNet in the regression setting are 0.567 ± 0.038 and 0.606 ± 0.058 respectively, and the ONCA are 0.913 ± 0.012 and 0.942 ± 0.010 respectively. - **Uncertainty analysis**: The regression error of the ViT model is the smallest, which is 0.383 (IQR: 0.670), significantly lower than other models. ### Conclusion: This study has successfully developed an automatic grading method for hip osteoarthritis based on CT images, which can achieve high accuracy in multi - level classification. The uncertainty estimation of the model helps to identify potential classification errors and improves the reliability and practicality of the model. This method is expected to perform disease - progression analysis in large - scale databases and provide strong support for clinical diagnosis.