Impact analysis of environmental and social factors on early-stage COVID-19 transmission in China by machine learning

Yifei Han,Jinliang Huang,Rendong Li,Qihui Shao,Dongfeng Han,Xiyue Luo,Juan Qiu
DOI: https://doi.org/10.1016/j.envres.2022.112761
IF: 8.3
2022-05-01
Environmental Research
Abstract:As a highly contagious disease, COVID-19 caused a worldwide pandemic and it is still ongoing. However, the infection in China has been successfully controlled although its initial transmission was also nationwide and has caused a serious public health crisis. The analysis on the early-stage COVID-19 transmission in China is worth investigating for its guiding significance on prevention to other countries and regions. In this study, we conducted the experiments from the perspectives of COVID-19 occurrence and intensity. We eliminated unimportant factors from 113 variables and applied four machine learning-based classification and regression models to predict COVID-19 occurrence and intensity, respectively. The influence of each important factor was analysed when applicable. Our optimal model on COVID-19 occurrence prediction presented an accuracy of 91.91% and the best R2 of intensity prediction reached 0.778. Linear regression-based model was identified as unable to fit and predict the intensity, and thus only the variable influence on COVID-19 occurrence can be explained. We found that (1) CO VID-19 was more likely to occur in prosperous cities closer to the epicentre and located on higher altitudes, (2) and the occurrence was higher under extreme weather and high minimum relative humidity. (3) Most air pollutants increased the risk of COVID-19 occurrence except NO2 and O3, and there existed a lag effect of 6-7 days. (4) NPIs (non-pharmaceutical interventions) did not show apparent effect until two weeks after.
environmental sciences,public, environmental & occupational health
What problem does this paper attempt to address?