Development of COVID-19 Booster Vaccine Policy by Microsimulation and Q-learning

Guoxuan Ma,Lili Zhao,Jian Kang
DOI: https://doi.org/10.48550/arXiv.2410.12936
2024-10-17
Abstract:The COVID-19 pandemic highlighted the urgent need for effective vaccine policies, but traditional clinical trials often lack sufficient data to capture the diverse population characteristics necessary for comprehensive public health strategies. Ethical concerns around randomized trials during a pandemic further complicate policy development for public health. Reinforcement Learning (RL) offers a promising alternative for vaccine policy development. However, direct online RL exploration in real-world scenarios can result in suboptimal and potentially harmful decisions. This study proposes a novel framework combining tabular Q-learning with microsimulation (i.e., a Recurrent Neural Network (RNN) environment simulator) to address these challenges in public health vaccine policymaking, which enables effective vaccine policy learning without real-world interaction, addressing both ethical and exploration challenges. The RNN environment simulator captures temporal associations between infection and patient characteristics, generating realistic simulation data. Our tabular Q-learning model produces an interpretable policy table that balances the risks of severe infection against vaccination side effects. Applied to COVID-19 booster policies, the learned Q-learning-based policy outperforms current practices, offering a path toward more effective vaccination strategies.
Applications
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to develop effective COVID - 19 booster vaccine policies using Reinforcement Learning (RL) without directly interacting with the real world. Specifically, traditional clinical trials often lack sufficient data to capture diverse population characteristics, and there are ethical issues in conducting random trials during a pandemic. Therefore, the researchers proposed a new framework that combines Q - learning and microsimulation (an RNN - based environment simulator) to address these challenges. ### Main problems and solutions 1. **Data insufficiency and representativeness problems**: - The data from traditional clinical trials is not sufficient to comprehensively evaluate the effectiveness of vaccine policies because they usually recruit subjects with specific characteristics, and these subjects may not represent the entire population. - For example, immunosuppressed patients are excluded from some vaccine trials, resulting in the inability to develop effective vaccine policies for this population. 2. **Ethical issues**: - Conducting large - scale random trials during a pandemic may put unvaccinated groups at risk of infection, raising ethical concerns. 3. **Exploration and safety issues**: - Direct online RL exploration may lead to sub - optimal or even harmful decisions, especially in the early training stage. ### Solutions - **Combination of Q - learning and microsimulation**: - Use the Q - learning algorithm to learn the optimal policy and generate a virtual environment through microsimulation (an RNN - based environment simulator) to avoid direct interaction with the real world. - The RNN - based environment simulator can capture the temporal associations between infection states and patient characteristics and generate realistic simulation data. - **Reward function design**: - Define a reward function to balance the risk of severe infection and vaccine side effects, ensuring that the policy can effectively prevent infection and minimize side effects. ### Specific applications - **COVID - 19 booster policy**: - The researchers use this framework to determine whether and when different populations should receive booster shots, thereby optimizing the vaccination strategy. Through this method, researchers can develop more effective public health vaccine policies using existing data without violating ethical principles.