Machine learning analysis of the UNOS database fails to predict lung transplant outcomes
Lucy Luo,Marcin Możejko,Nikolay Markov,Alec Peltekian,Suror Mohsin,Mary Carns,Phillip Cooper,Jeffrey Lysne,Anthony Joudi,Alan Betensley,Bradford C Bemiss,Catherine Myers,Ankit Bharat,Rade Tomic,Ambalavanan Arunachalam,Ewa Szczurek,GR Scott Budinger,Alexander V Misharin,Mrinalini Venkata Subramani
DOI: https://doi.org/10.1101/2024.10.19.24315817
2024-10-21
Abstract:Background: Lung transplantation is the only life-saving therapy for end-stage lung disease. However, lung transplantation has the worst survival among all solid organ transplants.1 We applied machine learning to a large standardized electronic health record (EHR) dataset from the United Network for Organ Sharing (UNOS) to test whether pre-transplant and peri-transplant donor and recipient features can predict one-, three- and five-year survival, or favorable long-term outcomes in lung transplant. Methods: We used data from 43,869 first time lung transplant recipients >18 years old from 1987 to November 2022 for whom one-, three-, and five-year survival outcomes were available. We applied XGBoost or a tabular BERT model called EHRFormer to the UNOS EHR dataset. Results: Using pre-transplant features XGBoost predicted one year mortality with a test AUC = 0.6 [0.57, 0.64] 95% CI. Addition of peri-transplant features only modestly improved AUC for one-year mortality prediction (test AUC = 0.63 [0.60, 0.67] 95% CI and 0.64 [0.63, 0.66] 95% CI for XGBoost and EHRFormer, respectively). Top predictive features of one year mortality using peri-transplant features from each model were length of index stay, transplant type, recipient age, ventilation status during the index stay, and creatinine at the time of transplant. Both XGBoost and EHRFormer performed better when predicting lung function at one-year post-transplant (XGBoost test AUC = 0.74; EHRFormer test AUC = 0.76). Both models identified and used features previously associated with transplant outcomes to inform predictions. Conclusions: Despite machine learning approaches identifying known risk factors for transplant outcomes, EHR data collected by UNOS poorly predict one-, three-, and five-year mortality outcomes of lung transplantation. These results suggest caution when using pre-transplant EHR features to predict lung transplant outcomes.