Federated Learning of Electronic Health Records Improves Mortality Prediction in Patients Hospitalized with COVID-19.
Akhil Vaid,Suraj K Jaladanki,Jie Xu,Shelly Teng,Arvind Kumar,Samuel Lee,Sulaiman Somani,Ishan Paranjpe,Jessica K De Freitas,Tingyi Wanyan,Kipp W Johnson,Mesude Bicak,Eyal Klang,Young Joon Kwon,Anthony Costa,Shan Zhao,Riccardo Miotto,Alexander W Charney,Erwin Böttinger,Zahi A Fayad,Girish N Nadkarni,Fei Wang,Benjamin S Glicksberg
DOI: https://doi.org/10.1101/2020.08.11.20172809
2020-01-01
Abstract:Machine learning (ML) models require large datasets which may be siloed across different healthcare institutions. Using federated learning, a ML technique that avoids locally aggregating raw clinical data across multiple institutions, we predict mortality within seven days in hospitalized COVID-19 patients. Patient data was collected from Electronic Health Records (EHRs) from five hospitals within the Mount Sinai Health System (MSHS). Logistic Regression with L1 regularization (LASSO) and Multilayer Perceptron (MLP) models were trained using local data at each site, a pooled model with combined data from all five sites, and a federated model that only shared parameters with a central aggregator. Both the federated LASSO and federated MLP models performed better than their local model counterparts at four hospitals. The federated MLP model also outperformed the federated LASSO model at all hospitals. Federated learning shows promise in COVID-19 EHR data to develop robust predictive models without compromising patient privacy.