Abstract:The interpretability of deep learning is crucial for evaluating the reliability of medical imaging models and reducing the risks of inaccurate patient recommendations. This study addresses the "human out of the loop" and "trustworthiness" issues in medical image analysis by integrating medical professionals into the interpretability process. We propose a disease-weighted attention map refinement network (DWARF) that leverages expert feedback to enhance model relevance and accuracy. Our method employs cyclic training to iteratively improve diagnostic performance, generating precise and interpretable feature maps. Experimental results demonstrate significant improvements in interpretability and diagnostic accuracy across multiple medical imaging datasets. This approach fosters effective collaboration between AI systems and healthcare professionals, ultimately aiming to improve patient outcomes

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is to improve the interpretability and diagnostic accuracy of deep - learning models in medical image analysis, especially by integrating the knowledge and feedback of medical professionals into the model training process to enhance the relevance and reliability of the model. Specifically, the paper proposes a Disease - Weighted Attention map Refinement Network (DWARF), aiming to solve the following problems: 1. **The "human - out - of - the - loop" problem**: Traditional methods often ignore the participation of medical professionals during training and application, resulting in insufficient model interpretability and making it difficult to gain the trust of clinicians. 2. **The trustworthiness problem**: Existing medical AI systems have problems such as short - cut learning and misattribution, which affect the reliability and credibility of the system. 3. **The alignment problem between attention maps and medical knowledge**: The attention maps generated by many medical AI systems are not completely consistent with the areas of actual clinical concern, affecting the accuracy and practicality of the model. To solve these problems, the paper proposes the DWARF framework, which improves existing methods in the following ways: - **Introducing expert feedback**: Using the annotations and feedback of medical professionals to optimize attention maps, making the model's explanations more in line with actual clinical needs. - **Recurrent training mechanism**: Continuously improving the model's ability to recognize and segment specific diseases through iterative training, thereby improving diagnostic performance. - **Disease - specific head module**: Introducing a dedicated head module for each disease, mapping the original attention map to a more accurate segmentation map, enhancing the model's sensitivity to specific diseases. Through these improvements, DWARF not only improves classification performance and the quality of attention maps but also enhances clinicians' confidence in AI - assisted diagnostic tools. Experimental results show that DWARF significantly outperforms other baseline models on multiple medical image datasets.

DWARF: Disease-weighted network for attention map refinement

A Quantitative Approach for Evaluating Disease Focus and Interpretability of Deep Learning Models for Alzheimer's Disease Classification

Category Weighted Network and Relation Weighted Label for Diabetic Retinopathy Screening

Characterizing the Interpretability of Attention Maps in Digital Pathology

Improving Interpretability of Deep Neural Networks in Medical Diagnosis by Investigating the Individual Units

NEURO HAND: A weakly supervised Hierarchical Attention Network for interpretable neuroimaging abnormality Detection

IMPA-Net: Interpretable Multi-Part Attention Network for Trustworthy Brain Tumor Classification from MRI

Toward Transparent AI for Neurological Disorders: A Feature Extraction and Relevance Analysis Framework

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction

Interpretable thoracic pathologic prediction via learning group-disentangled representation

Interpretable medical deep framework by logits-constraint attention guiding graph-based multi-scale fusion for Alzheimer’s disease analysis

Interpretable and synergistic deep learning for visual explanation and statistical estimations of segmentation of disease features from medical images

Large Data of Medical Images Driven Repeatable Diseases Intelligent Diagnosis Model using Deep Learning Technology

An interpretable dual attention network for diabetic retinopathy grading: IDANet

Iterative annotation to ease neural network training: Specialized machine learning in medical image analysis

Enhancing explainability in brain tumor detection: A novel DeepEBTDNet model with LIME on MRI images

Interpretable Clinical Prediction Via Attention-Based Neural Network.

Disease‐driven domain generalization for neuroimaging‐based assessment of Alzheimer's disease

Target area distillation and section attention segmentation network for accurate 3D medical image segmentation

Iterative Augmentation of Visual Evidence for Weakly-Supervised Lesion Localization in Deep Interpretability Frameworks: Application to Color Fundus Images