The Impact of AI Explanations on Clinicians Trust and Diagnostic Accuracy in Breast Cancer

Olya Rezaeian,Onur Asan,Alparslan Emrah Bayrak
2024-12-16
Abstract:Advances in machine learning have created new opportunities to develop artificial intelligence (AI)-based clinical decision support systems using past clinical data and improve diagnosis decisions in life-threatening illnesses such breast cancer. Providing explanations for AI recommendations is a possible way to address trust and usability issues in black-box AI systems. This paper presents the results of an experiment to assess the impact of varying levels of AI explanations on clinicians' trust and diagnosis accuracy in a breast cancer application and the impact of demographics on the findings. The study includes 28 clinicians with varying medical roles related to breast cancer diagnosis. The results show that increasing levels of explanations do not always improve trust or diagnosis performance. The results also show that while some of the self-reported measures such as AI familiarity depend on gender, age and experience, the behavioral assessments of trust and performance are independent of those variables.
Human-Computer Interaction
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is: **Can providing explanations improve clinicians' trust and diagnostic accuracy in artificial intelligence (AI) - based clinical decision support systems (CDSS) in breast cancer diagnosis?** Specifically, the study aims to evaluate the impact of different levels of AI interpretability on clinicians' trust and diagnostic performance, and explore the role of demographic characteristics (such as age, gender, and AI familiarity) in this regard. ### Research Questions 1. **RQ1**: Can providing explanations improve human decision - making performance in breast cancer detection and trust in AI - based clinical decision support systems? 2. **RQ2**: How do clinicians' demographic characteristics (such as age, gender, and AI familiarity) affect their trust and performance in AI - based clinical decision support systems? ### Methods To answer these questions, the researchers designed an experiment to evaluate the impact of AI interpretability on clinicians through different intervention conditions. The experiment includes five conditions: - **Baseline**: Clinicians make diagnoses solely based on their own judgment. - **Intervention I (Classification)**: AI provides classification suggestions (healthy, benign tumor, malignant tumor) without providing explanations. - **Intervention II (Probability Distribution)**: Based on the classification suggestions, provide probability estimates for each category. - **Intervention III (Tumor Localization)**: Based on the probability estimates, provide an estimate of the tumor location. - **Intervention IV (Enhanced Tumor Localization with Confidence Levels)**: Based on the tumor location estimate, provide high - confidence and low - confidence estimates. ### Results The experimental results show that increasing the level of explanation does not always improve clinicians' trust or diagnostic performance. Moreover, although some self - reported measures (such as AI familiarity) depend on gender, age, and experience, behaviorally - assessed trust and performance are independent of these variables. ### Significance This study provides valuable insights into how to integrate interpretability in medical AI systems, which is helpful for more informed data - driven clinical decision - making and improving patient prognosis.