DWARF: Disease-weighted network for attention map refinement

Haozhe Luo,Aurélie Pahud de Mortanges,Oana Inel,Abraham Bernstein,Mauricio Reyes
2024-06-28
Abstract:The interpretability of deep learning is crucial for evaluating the reliability of medical imaging models and reducing the risks of inaccurate patient recommendations. This study addresses the "human out of the loop" and "trustworthiness" issues in medical image analysis by integrating medical professionals into the interpretability process. We propose a disease-weighted attention map refinement network (DWARF) that leverages expert feedback to enhance model relevance and accuracy. Our method employs cyclic training to iteratively improve diagnostic performance, generating precise and interpretable feature maps. Experimental results demonstrate significant improvements in interpretability and diagnostic accuracy across multiple medical imaging datasets. This approach fosters effective collaboration between AI systems and healthcare professionals, ultimately aiming to improve patient outcomes
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the interpretability and diagnostic accuracy of deep - learning models in medical image analysis, especially by integrating the knowledge and feedback of medical professionals into the model training process to enhance the relevance and reliability of the model. Specifically, the paper proposes a Disease - Weighted Attention map Refinement Network (DWARF), aiming to solve the following problems: 1. **The "human - out - of - the - loop" problem**: Traditional methods often ignore the participation of medical professionals during training and application, resulting in insufficient model interpretability and making it difficult to gain the trust of clinicians. 2. **The trustworthiness problem**: Existing medical AI systems have problems such as short - cut learning and misattribution, which affect the reliability and credibility of the system. 3. **The alignment problem between attention maps and medical knowledge**: The attention maps generated by many medical AI systems are not completely consistent with the areas of actual clinical concern, affecting the accuracy and practicality of the model. To solve these problems, the paper proposes the DWARF framework, which improves existing methods in the following ways: - **Introducing expert feedback**: Using the annotations and feedback of medical professionals to optimize attention maps, making the model's explanations more in line with actual clinical needs. - **Recurrent training mechanism**: Continuously improving the model's ability to recognize and segment specific diseases through iterative training, thereby improving diagnostic performance. - **Disease - specific head module**: Introducing a dedicated head module for each disease, mapping the original attention map to a more accurate segmentation map, enhancing the model's sensitivity to specific diseases. Through these improvements, DWARF not only improves classification performance and the quality of attention maps but also enhances clinicians' confidence in AI - assisted diagnostic tools. Experimental results show that DWARF significantly outperforms other baseline models on multiple medical image datasets.