An Inherently Interpretable AI model improves Screening Speed and Accuracy for Early Diabetic Retinopathy

Kerol R. Djoumessi Donteu,Ziwei Huang,Laura Kuehlewein,Annekatrin Rickmann,Natalia Simon,Lisa M. Koch,Philipp Berens
DOI: https://doi.org/10.1101/2024.06.27.24309574
2024-06-27
Abstract:Background: Diabetic retinopathy (DR) is a frequent concomitant disease of diabetes, affecting millions worldwide. Screening for this disease based on fundus images has been one of the first successful use cases for modern artificial intelligence in medicine. Current state-of-the-art systems typically use black-box models to make referral decisions, requiring post-hoc methods for AI-human interaction. Methods: In this retrospective reader study, we evaluated an inherently interpretable deep learning model, which explicitly models the local evidence of DR as part of its network architecture, for early DR screening. We trained the network on 34,350 high-quality fundus images from a publicly available dataset and validated its state-of-the-art performance on a large range of ten external datasets. We obtained detailed lesion annotations from ophthalmologists on 65 images to study if the class evidence maps highlight clinically relevant information. Finally, we tested the clinical usefulness of our model in a reader study, where we compared screening for DR without AI support to screening with AI support with and without AI explanations. Results: The inherently interpretable deep learning model obtained an accuracy of .906 [.900-.913] (95%-confidence interval) and an AUC of .904 [.894-.913] on the internal test set and similar performance on external datasets. High evidence regions directly extracted from the model contained clinically relevant lesions such as microaneurysms or haemorrhages with a high precision of .960 [.941 - .976]. Decision support by the model highlighting high-evidence regions in the image improved screening accuracy for difficult decisions and improved screening speed. Interpretation: Inherently interpretable deep learning models can reach state-of-the-art performance and support screening for early DR by improving human-AI collaboration.
Ophthalmology
What problem does this paper attempt to address?
The paper aims to address the issue of early screening for Diabetic Retinopathy (DR) and proposes an inherently interpretable artificial intelligence model to improve the speed and accuracy of screening. Specifically, the research objectives include: 1. **Developing an inherently interpretable deep learning model**: The paper introduces a deep learning model based on the sparse BagNet architecture, which can directly extract local evidence maps from its network structure, thereby providing transparency in the decision-making process. 2. **Improving screening performance**: By using this inherently interpretable model, the researchers aim to enhance the accuracy and speed of early-stage screening for Diabetic Retinopathy. 3. **Supporting clinical decision-making**: The model not only makes predictions but also provides information on which image regions led to specific prediction results. This is a very useful tool for clinicians, as it can increase their trust in the AI system's recommended decisions. 4. **Evaluating the model's practicality in real-world settings**: The researchers validated the value of the explanations provided by the model for clinical practice through a retrospective reader study. The results showed that in difficult cases, the model's support significantly improved the accuracy and speed of screening. In summary, the core objective of this paper is to demonstrate the potential application of a novel, inherently interpretable deep learning model in the early screening task for Diabetic Retinopathy, and how it can improve human-machine collaboration by providing trustworthy explanations, thereby enhancing the quality of clinical decision-making.