Artificial Intelligence (AI) Based Prediction of Mortality, for COVID-19 Patients

Mahbubunnabi Tamala,Mohammad Marufur Rahmanb,Maryam Alhasimc,Mobarak Al Mulhimd,Mohamed Derichee
2024-03-28
Abstract:For severely affected COVID-19 patients, it is crucial to identify high-risk patients and predict survival and need for intensive care (ICU). Most of the proposed models are not well reported making them less reproducible and prone to high risk of bias particularly in presence of imbalance data/class. In this study, the performances of nine machine and deep learning algorithms in combination with two widely used feature selection methods were investigated to predict last status representing mortality, ICU requirement, and ventilation days. Fivefold cross-validation was used for training and validation purposes. To minimize bias, the training and testing sets were split maintaining similar distributions. Only 10 out of 122 features were found to be useful in prediction modelling with Acute kidney injury during hospitalization feature being the most important one. The algorithms performances depend on feature numbers and data pre-processing techniques. LSTM performs the best in predicting last status and ICU requirement with 90%, 92%, 86% and 95% accuracy, sensitivity, specificity, and AUC respectively. DNN performs the best in predicting Ventilation days with 88% accuracy. Considering all the factors and limitations including absence of exact time point of clinical onset, LSTM with carefully selected features can accurately predict last status and ICU requirement. DNN performs the best in predicting Ventilation days. Appropriate machine learning algorithm with carefully selected features and balance data can accurately predict mortality, ICU requirement and ventilation support. Such model can be very useful in emergency and pandemic where prompt and precise
Machine Learning
What problem does this paper attempt to address?
This paper aims to solve the problem of identifying high - risk patients among COVID - 19 patients and predicting whether these patients need intensive care unit (ICU) and mechanical ventilation support. Specifically, the research objectives include: 1. **Predicting mortality**: By analyzing patients' clinical data, predict whether patients will die due to COVID - 19. 2. **Predicting ICU needs**: Determine which patients may need to enter the intensive care unit to receive a higher level of care. 3. **Predicting the number of ventilation days**: Estimate the length of time patients need mechanical ventilation support. To achieve these goals, the researchers used 122 parameters and combined nine machine - learning and deep - learning algorithms, as well as two commonly used feature - selection methods (random forest and extra - trees classifier). The five - fold cross - validation method was used to train and validate the model to ensure the reliability and generalization ability of the model. The study also paid special attention to the data imbalance problem and balanced the data distribution through over - sampling and under - sampling techniques to improve the prediction performance of the model. The study found that LSTM performed best in predicting the "final state" (i.e., survival or death) and "ICU needs", with an accuracy rate, sensitivity, and specificity reaching 90%, 92%, and 86% respectively, and an AUC value of 95%. DNN performed best in predicting the "number of ventilation days", with an accuracy rate of 88%. The performance of these models depends not only on the number of selected features but also on the data pre - processing techniques. In conclusion, this study has successfully improved the prediction accuracy of high - risk situations in COVID - 19 patients by integrating multiple machine - learning and deep - learning methods, which is of great significance for emergency decision - making and resource allocation during the epidemic.