Explainable AI Enhances Glaucoma Referrals, Yet the Human-AI Team Still Falls Short of the AI Alone

Catalina Gomez,Ruolin Wang,Katharina Breininger,Corinne Casey,Chris Bradley,Mitchell Pavlak,Alex Pham,Jithin Yohannan,Mathias Unberath
2024-05-24
Abstract:Primary care providers are vital for initial triage and referrals to specialty care. In glaucoma, asymptomatic and fast progression can lead to vision loss, necessitating timely referrals to specialists. However, primary eye care providers may not identify urgent cases, potentially delaying care. Artificial Intelligence (AI) offering explanations could enhance their referral decisions. We investigate how various AI explanations help providers distinguish between patients needing immediate or non-urgent specialist referrals. We built explainable AI algorithms to predict glaucoma surgery needs from routine eyecare data as a proxy for identifying high-risk patients. We incorporated intrinsic and post-hoc explainability and conducted an online study with optometrists to assess human-AI team performance, measuring referral accuracy and analyzing interactions with AI, including agreement rates, task time, and user experience perceptions. AI support enhanced referral accuracy among 87 participants (59.9%/50.8% with/without AI), though Human-AI teams underperformed compared to AI alone. Participants believed they included AI advice more when using the intrinsic model, and perceived it more useful and promising. Without explanations, deviations from AI recommendations increased. AI support did not increase workload, confidence, and trust, but reduced challenges. On a separate test set, our black-box and intrinsic models achieved an accuracy of 77% and 71%, respectively, in predicting surgical outcomes. We identify opportunities of human-AI teaming for glaucoma management in primary eye care, noting that while AI enhances referral accuracy, it also shows a performance gap compared to AI alone, even with explanations. Human involvement remains essential in medical decision making, underscoring the need for future research to optimize collaboration, ensuring positive experiences and safe AI use.
Human-Computer Interaction,Artificial Intelligence
What problem does this paper attempt to address?
This paper discusses how to enhance the accuracy of glaucoma referrals using interpretable artificial intelligence (AI). In the study, the authors built an interpretable AI algorithm to predict whether patients need glaucoma surgery based on routine ophthalmic data, acting as a proxy for identifying high-risk patients. They conducted an online study involving optometrists to evaluate the performance of humans and the AI team in terms of referral accuracy, interaction protocol with AI, task time, and user experience perception. The results showed that AI support improved referral accuracy for 87 participants (59.9% compared to 50.8%), but the performance of humans working with the AI team was still inferior to AI decisions alone. When using an embedded model, participants reported considering AI recommendations more and considered them to be more useful and promising. In cases without explanations, participants deviated from AI recommendations more frequently. AI support did not increase workload, confidence, and trust but reduced challenges. In an independent test set, the accuracy of the black-box model and the embedded model in predicting surgical outcomes was 77% and 71%, respectively. The study emphasized that although AI can improve referral accuracy, there is still a gap between its performance with explanations and AI decisions alone. Human involvement in medical decision-making remains essential, and future research should optimize the collaboration between humans and AI to ensure a positive experience and safe use of AI.