Abstract:To mitigate the impact of the pandemic, several measures include lockdowns, rapid vaccination programs, school closures, and economic stimulus. These interventions can have positive or unintended negative consequences. Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous). We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation where we perform multi-objective optimization. We determine the optimal policy for lockdown and vaccination in a minimalist age-stratified multi-vaccine scenario with a basic simulation for economic activity. With no lockdown and vaccination (mid-age and elderly), results show optimal economy (individuals below the poverty line) with balanced health objectives (infection, and hospitalization). An in-depth simulation is needed to further validate our results and open-source our framework.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to balance the impacts on public health and economic activities through optimizing intervention measures during the outbreak of large - scale pandemics (such as COVID - 19). Specifically, the researchers utilized the Deep Deterministic Policy Gradient (DDPG) algorithm to conduct multi - objective optimization in a large - scale agent - based epidemiological model with 100,000 individuals to determine the optimal lockdown and vaccination strategies. ### Background of the Paper - **Epidemic Situation Background**: The COVID - 19 pandemic has led to more than 460 million infections, 6 million deaths, and a 3% decline in the global GDP in 2020. - **Existing Intervention Measures**: Governments of various countries have adopted a variety of measures to control the epidemic, including lockdowns, rapid vaccination programs, school closures, and economic stimuli. These measures may bring positive or negative consequences. - **Limitations of Existing Research**: - Limited simulation targets. - Small scale (usually only a few thousand individuals). - The model type is not suitable for intervention research. - Limited number of explored intervention strategies (discrete vs continuous). ### Research Objectives - **Optimization Objectives**: In a large - scale (100,000 individuals) agent - based epidemiological model, determine the optimal lockdown and vaccination strategies through multi - objective optimization. - **Specific Problems**: - How to maximize economic activities while minimizing infection and hospitalization rates? - How to allocate vaccine resources among different age groups? - How to determine the best time and duration of lockdown? ### Methods - **Models**: - **SEIR Model**: Extended to 9 compartments, including Susceptible, Exposed, Asymptomatic, Pre - Symptomatic, Infected Mild, Infected Severe, Hospitalized, etc. - **Economic Model**: Each household has savings, daily income and expenditure, and the health status of family members affects economic activities. - **Optimization Algorithm**: - **DDPG**: Used to handle continuous optimization states and intervention actions, such as the start and end times of lockdowns, vaccination times for different age groups, etc. - **Evaluation Criteria**: - **Health Indicators**: The number of mildly infected people and the number of hospitalized people. - **Economic Indicator**: The number of people below the poverty line. ### Experimental Setup - **Experimental Parameters**: - Initial infection rates: 15% and 1%. - Vaccine effectiveness: high - efficiency and low - efficiency. - Vaccine availability: high - availability and low - availability. - **Reward Functions**: - Health reward ($H_{Rew}$): $- 1\times(InfectedMild + Hospitalized)$. - Economic reward ($E_{Rew}$): $- 1\times(Below\_Poverty\_Line)$. - Total reward ($T_{Rew}$): $H_{Rew}+\kappa\times E_{Rew}$, where $\kappa$ is the mixing factor. ### Results - **Results in Different Scenarios**: - **High Initial Infection Rate**: Whether lockdown is implemented or not, vaccination can significantly reduce infection and hospitalization rates, but has a greater impact on the economy. - **Low Initial Infection Rate**: Early vaccination can effectively control the epidemic while having a relatively small impact on the economy. - **Different Vaccine Availability and Effectiveness**: Even if the vaccine effect is low, extensive vaccination can still significantly reduce infection and hospitalization rates. ### Discussion - **Main Conclusions**: - Optimization algorithms (such as DDPG) show in policy informatics...

Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using Deep Deterministic Policy Gradient

Modeling and Optimization of Epidemiological Control Policies Through Reinforcement Learning

PaCAR: COVID-19 Pandemic Control Decision Making via Large-Scale Agent-Based Modeling and Deep Reinforcement Learning

A Game-Theoretic Approach for Hierarchical Epidemic Control

Multilevel Optimization for Policy Design with Agent-Based Epidemic Models

Deep reinforcement learning framework for controlling infectious disease outbreaks in the context of multi-jurisdictions

Epidemiologically and Socio-economically Optimal Policies via Bayesian Optimization

Optimal Policies for a Pandemic: A Stochastic Game Approach and a Deep Learning Algorithm

First 100 days of pandemic; an interplay of pharmaceutical, behavioral and digital interventions -- A study using agent based modeling

City-scale Simulation of Covid-19 Pandemic and Intervention Policies using Agent-based Modelling

Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models

Phase-wise Evaluation and Optimization of Non-Pharmaceutical Interventions to Contain the COVID-19 Pandemic in the U.S.

Data-driven Optimization Model for Global Covid-19 Intervention Plans

Discrete Stochastic Optimization for Public Health Interventions with Constraints

Data-driven Simulation and Optimization for Covid-19 Exit Strategies

Optimization Methods for Disease Prevention and Epidemic Control

A framework for modeling human behavior in large-scale agent-based epidemic simulations

Optimal Control Policies to Address the Pandemic Health-Economy Dilemma

An Integrated Epidemic Simulation Workflow for Submodular Intervention Strategies

A simulation-deep reinforcement learning (SiRL) approach for epidemic control optimization