Rapid Prediction and Evaluation of COVID-19 Epidemic in the United States Based on Feature Selection and Improved ARIMAX Model

Yichi Li,Shenglan Chu,Huan Zhao,Fumei Rong,Chenglin Liu,Suling Zhao,Zhen Wang,Zhouqiang Xiong
DOI: https://doi.org/10.1145/3469213.3471327
2021-05-28
Abstract:Purpose: Through the short-term rapid prediction and evaluation of the COVID-19 epidemic in the United States, explore the development trend of the COVID-19 epidemic in the United States in the short term in the future, and provide an effective basis for the prediction, prevention and control of subsequent epidemic spread;Method: Using the feature selection method based on stepwise regression is to process theCOVID-19 epidemic data set fromJanuary 13,2020 to January 16,2021 in the United States, and data mining is carried out through computer programs for a large number of indicators that reflect the situation of the epidemic. After statistical testing, the ARIMA model and the improved ARIMAX model based on feature selection quickly solves the development trend of the COVID-19 epidemic in the United States in the short-term;Result: The implementation of the computer program shows that the traditional ARIMAmodel cannot predict the cumulative number of COVID-19 diagnoses in the United States well, and the improved ARIMAX model using features based on stepwise regression can accurately predict the scale of the COVID-19 epidemic in the United States under the 95% confidence interval. The U.S. epidemic will show a clear upward trend in the next 60 days, and in mid-March the cumulative number of confirmed diagnoses in the country is about 3,724,000, and the cumulative death toll is about 476,000, and the number of people in the ICU ward is about 22,118. © 2021 Association for Computing Machinery. All rights reserved.
Medicine
What problem does this paper attempt to address?