Machine learning is more accurate and biased than risk scoring tools in the prediction of postoperative atrial fibrillation after cardiac surgery

Joyce C Ho,Shalmali Joshi,Eduardo Valverde,Kathryn Wood,Kendra J Grubb,Miguel A Leal,Vicki Stover Hertzberg
DOI: https://doi.org/10.1101/2024.07.05.24310013
2024-07-07
Abstract:Incidence of postoperative atrial fibrillation (POAF) after cardiac surgery remains high and is associated with adverse patient outcomes. Risk scoring tools have been developed to predict POAF, yet discrimination performance remains moderate. Machine learning (ML) models can achieve better performance but may exhibit performance heterogeneity across race and sex subpopulations. We evaluate 8 risk scoring tools and 6 ML models on a heterogeneous cohort derived from electronic health records. Our results suggest that ML models achieve higher discrimination yet are less fair, especially with respect to race. Our findings highlight the need for building accurate and fair ML models to facilitate consistent and equitable assessment of POAF risk.
Health Informatics
What problem does this paper attempt to address?