On classifying sepsis heterogeneity in the ICU: insight using machine learning

Zina M Ibrahim,Honghan Wu,Ahmed Hamoud,Lukas Stappen,Richard J B Dobson,Andrea Agarossi
DOI: https://doi.org/10.1093/jamia/ocz211
2020-01-17
Journal of the American Medical Informatics Association
Abstract:Abstract Objectives Current machine learning models aiming to predict sepsis from electronic health records (EHR) do not account 20 for the heterogeneity of the condition despite its emerging importance in prognosis and treatment. This work demonstrates the added value of stratifying the types of organ dysfunction observed in patients who develop sepsis in the intensive care unit (ICU) in improving the ability to recognize patients at risk of sepsis from their EHR data. Materials and Methods Using an ICU dataset of 13 728 records, we identify clinically significant sepsis subpopulations with distinct organ dysfunction patterns. We perform classification experiments with random forest, gradient boost trees, and support vector machines, using the identified subpopulations to distinguish patients who develop sepsis in the ICU from those who do not. Results The classification results show that features selected using sepsis subpopulations as background knowledge yield a superior performance in distinguishing septic from non-septic patients regardless of the classification model used. The improved performance is especially pronounced in specificity, which is a current bottleneck in sepsis prediction machine learning models. Conclusion Our findings can steer machine learning efforts toward more personalized models for complex conditions including sepsis.
information science & library science,computer science, information systems, interdisciplinary applications,health care sciences & services,medical informatics
What problem does this paper attempt to address?