Severity Detection for the Coronavirus Disease 2019 (COVID-19) Patients Using a Machine Learning Model Based on the Blood and Urine Tests

Haochen Yao,Nan Zhang,Ruochi Zhang,Meiyu Duan,Tianqi Xie,Jiahui Pan,Ejun Peng,Juanjuan Huang,Yingli Zhang,Xiaoming Xu,Hong Xu,Fengfeng Zhou,Guoqing Wang
DOI: https://doi.org/10.3389/fcell.2020.00683
IF: 5.5
2020-07-31
Frontiers in Cell and Developmental Biology
Abstract:The recent outbreak of the coronavirus disease-2019 (COVID-19) caused serious challenges to the human society in China and across the world. COVID-19 induced pneumonia in human hosts and carried a highly inter-person contagiousness. The COVID-19 patients may carry severe symptoms, and some of them may even die of major organ failures. This study utilized the machine learning algorithms to build the COVID-19 severeness detection model. Support vector machine (SVM) demonstrated a promising detection accuracy after 32 features were detected to be significantly associated with the COVID-19 severeness. These 32 features were further screened for inter-feature redundancies. The final SVM model was trained using 28 features and achieved the overall accuracy 0.8148. This work may facilitate the risk estimation of whether the COVID-19 patients would develop the severe symptoms. The 28 COVID-19 severeness associated biomarkers may also be investigated for their underlining mechanisms how they were involved in the COVID-19 infections.
cell biology,developmental biology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to predict the disease severity of COVID - 19 patients by using blood and urine test data through a machine - learning model. Specifically, the research aims to distinguish between severe and mild patients, in order to help clinicians better assess the disease risk of patients, so as to take corresponding treatment measures in a timely manner and reduce the incidence of severe cases and mortality. In the paper, the author used the Support Vector Machine (SVM) algorithm to construct a detection model. This model can make predictions based on 32 features that are significantly related to the severity of COVID - 19. These features were screened from 100 initial features, including 8 clinical indicators, 76 blood test values and 16 urine test values. Eventually, through further feature selection, the model was optimized to use only 28 features, and the overall accuracy reached 0.8148. This research not only helps to improve the diagnostic accuracy of the disease severity of COVID - 19 patients, but also may provide clues for finding biomarkers of the disease, and then provide potential targets for drug development.