Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models

Jayaraman J. Thiagarajan,Peer-Timo Bremer,Rushil Anirudh,Timothy C. Germann,Sara Y. Del Valle,Frederick H. Streitz
DOI: https://doi.org/10.48550/arXiv.2010.08478
2020-10-17
Abstract:A crucial aspect of managing a public health crisis is to effectively balance prevention and mitigation strategies, while taking their socio-economic impact into account. In particular, determining the influence of different non-pharmaceutical interventions (NPIs) on the effective use of public resources is an important problem, given the uncertainties on when a vaccine will be made available. In this paper, we propose a new approach for obtaining optimal policy recommendations based on epidemiological models, which can characterize the disease progression under different interventions, and a look-ahead reward optimization strategy to choose the suitable NPI at different stages of an epidemic. Given the time delay inherent in any epidemiological model and the exponential nature especially of an unmanaged epidemic, we find that such a look-ahead strategy infers non-trivial policies that adhere well to the constraints specified. Using two different epidemiological models, namely SEIR and EpiCast, we evaluate the proposed algorithm to determine the optimal NPI policy, under a constraint on the number of daily new cases and the primary reward being the absence of restrictions.
Machine Learning,Computers and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to optimize non - pharmaceutical intervention (NPI) policies through machine learning in epidemiological models to balance epidemic prevention and control and economic and social impacts. Specifically, researchers hope to find a method to select appropriate NPIs at different epidemic stages, so as to minimize restrictions on society and the economy on the premise of not exceeding the carrying capacity of medical resources. ### Main problems: 1. **Balancing prevention and control with economic and social impacts**: How to minimize social and economic losses caused by prevention and control measures while effectively controlling the epidemic. 2. **Optimizing non - pharmaceutical intervention (NPI) policies**: Determine the impact of different NPIs (such as social isolation, school closures, etc.) on disease transmission, and formulate optimal prevention and control strategies on this basis. 3. **Coping with time delays and exponential growth**: Considering the time delays (such as the incubation period) in epidemiological models and the exponential growth characteristics when not controlled, how to predict and adjust prevention and control measures in advance to avoid a possible surge in cases in the future. ### Solutions: The author proposes a two - layer look - ahead optimization algorithm based on machine learning to optimize NPI policies in the SEIR and EpiCast epidemiological models. This algorithm combines short - term and long - term rewards to dynamically adjust prevention and control measures, ensuring the maximum freedom of social and economic activities on the premise of not violating the daily new case threshold. ### Key points: - **Short - term rewards**: Evaluate the effect of implementing a certain NPI in the short term (such as 21 days). If it meets the daily new case limit, a corresponding reward will be given; otherwise, no reward will be given. - **Long - term rewards**: Further evaluate the NPI effect in a longer period (such as 35 days), allowing a certain degree of flexibility. Even if some measures may lead to an increase in cases in the future, as long as they can be corrected later, a certain reward can still be obtained. - **Surrogate model**: For the EpiCast model with high computational complexity, use machine learning to construct a surrogate model to accelerate the simulation process and improve the optimization efficiency. Through this method, researchers can dynamically adjust prevention and control policies in a complex epidemiological context to achieve the best balance between epidemic prevention and control and social and economic activities.