Looking into Concept Explanation Methods for Diabetic Retinopathy Classification

Andrea M. Storås,Josefine V. Sundgaard
DOI: https://doi.org/10.59275/j.melba.2024-e7fd
2024-10-04
Abstract:Diabetic retinopathy is a common complication of diabetes, and monitoring the progression of retinal abnormalities using fundus imaging is crucial. Because the images must be interpreted by a medical expert, it is infeasible to screen all individuals with diabetes for diabetic retinopathy. Deep learning has shown impressive results for automatic analysis and grading of fundus images. One drawback is, however, the lack of interpretability, which hampers the implementation of such systems in the clinic. Explainable artificial intelligence methods can be applied to explain the deep neural networks. Explanations based on concepts have shown to be intuitive for humans to understand, but have not yet been explored in detail for diabetic retinopathy grading. This work investigates and compares two concept-based explanation techniques for explaining deep neural networks developed for automatic diagnosis of diabetic retinopathy: Quantitative Testing with Concept Activation Vectors and Concept Bottleneck Models. We found that both methods have strengths and weaknesses, and choice of method should take the available data and the end user's preferences into account.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The problem this paper attempts to address is the interpretability of deep neural networks in the automatic diagnosis of Diabetic Retinopathy (DR). Specifically, the authors focus on the following points: 1. **The contradiction between automatic diagnosis and interpretability**: Although deep learning has achieved significant results in the automatic analysis and grading of fundus images, these models are often difficult to interpret, which hinders their application in clinical practice. Doctors may refuse to use these systems if they do not understand why the model makes specific predictions. 2. **Application of concept-based explanation methods**: To improve the interpretability of the models, the authors explore two concept-based explanation methods—Concept Activation Vectors (TCAV) and Concept Bottleneck Models (CBMs). These methods aim to explain the model's decision-making process by measuring the model's dependence on high-level clinical findings (such as hemorrhages, microaneurysms, etc.). 3. **Comparison of different explanation methods**: The authors compare the advantages and disadvantages of these two methods in explaining the grading of diabetic retinopathy and evaluate their effectiveness in practical applications. The goal of the study is to provide clinicians with more intuitive and understandable explanations, thereby enhancing their trust and acceptance of deep learning models. In summary, the main purpose of this paper is to improve the interpretability and clinical applicability of automatic diabetic retinopathy diagnosis systems by introducing and evaluating concept-based explanation methods.