Advancing Dermatological Diagnostics: Interpretable AI for Enhanced Skin Lesion Classification

Carlo Metta,Andrea Beretta,Riccardo Guidotti,Yuan Yin,Patrick Gallinari,Salvatore Rinzivillo,Fosca Giannotti
DOI: https://doi.org/10.3390/diagnostics14070753
IF: 3.6
2024-04-03
Diagnostics
Abstract:A crucial challenge in critical settings like medical diagnosis is making deep learning models used in decision-making systems interpretable. Efforts in Explainable Artificial Intelligence (XAI) are underway to address this challenge. Yet, many XAI methods are evaluated on broad classifiers and fail to address complex, real-world issues, such as medical diagnosis. In our study, we focus on enhancing user trust and confidence in automated AI decision-making systems, particularly for diagnosing skin lesions, by tailoring an XAI method to explain an AI model's ability to identify various skin lesion types. We generate explanations using synthetic images of skin lesions as examples and counterexamples, offering a method for practitioners to pinpoint the critical features influencing the classification outcome. A validation survey involving domain experts, novices, and laypersons has demonstrated that explanations increase trust and confidence in the automated decision system. Furthermore, our exploration of the model's latent space reveals clear separations among the most common skin lesion classes, a distinction that likely arises from the unique characteristics of each class and could assist in correcting frequent misdiagnoses by human professionals.
medicine, general & internal
What problem does this paper attempt to address?
The paper attempts to address the issue of improving the interpretability of artificial intelligence (AI) decision systems in dermatological lesion diagnosis, thereby enhancing the trust and confidence of medical professionals, novices, and general users in automated decision systems. Specifically: 1. **Improving Interpretability**: In critical scenarios such as medical diagnosis, it is necessary for deep learning models to have interpretability so that users can understand how AI makes decisions. 2. **Enhancing Trust**: By generating example and counterexample images to explain the AI model's ability to recognize various types of skin lesions and providing methods for key feature impact on classification results. 3. **Practical Application Verification**: Through a validation survey involving domain experts, novices, and general users, demonstrating that these explanations can indeed increase trust and confidence in automated decision systems. 4. **Model Latent Space Analysis**: Exploring the model's latent space to reveal clear separations of common skin lesion categories, which helps correct frequent misdiagnoses by human doctors. In summary, this study aims to improve the transparency and reliability of dermatological lesion diagnosis by enhancing existing classification models with interpretability features.