A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions

Luis Miralles-Pechuán,Fernando Jiménez,Hiram Ponce,Lourdes Martínez-Villaseñor
DOI: https://doi.org/10.48550/arXiv.2005.07656
2020-05-16
Abstract:Whenever countries are threatened by a pandemic, as is the case with the COVID-19 virus, governments should take the right actions to safeguard public health as well as to mitigate the negative effects on the economy. In this regard, there are two completely different approaches governments can take: a restrictive one, in which drastic measures such as self-isolation can seriously damage the economy, and a more liberal one, where more relaxed restrictions may put at risk a high percentage of the population. The optimal approach could be somewhere in between, and, in order to make the right decisions, it is necessary to accurately estimate the future effects of taking one or other measures. In this paper, we use the SEIR epidemiological model (Susceptible - Exposed - Infected - Recovered) for infectious diseases to represent the evolution of the virus COVID-19 over time in the population. To optimize the best sequences of actions governments can take, we propose a methodology with two approaches, one based on Deep Q-Learning and another one based on Genetic Algorithms. The sequences of actions (confinement, self-isolation, two-meter distance or not taking restrictions) are evaluated according to a reward system focused on meeting two objectives: firstly, getting few people infected so that hospitals are not overwhelmed with critical patients, and secondly, avoiding taking drastic measures for too long which can potentially cause serious damage to the economy. The conducted experiments prove that our methodology is a valid tool to discover actions governments can take to reduce the negative effects of a pandemic in both senses. We also prove that the approach based on Deep Q-Learning overcomes the one based on Genetic Algorithms for optimizing the sequences of actions.
Machine Learning,Physics and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how the government can take the best actions to protect public health while minimizing the negative impact on the economy in the face of an epidemic such as COVID - 19. Specifically, the paper explores the optimal path between two extreme strategies: one is strict restrictive measures, which may seriously damage the economy; the other is relatively loose restrictions, which may lead to a large number of people facing health risks. In order to find the balance between the two, the author proposes methods based on Deep Q - Learning and Genetic Algorithms to optimize a series of action sequences that the government can take. These action sequences aim to achieve two goals: one is to minimize the number of infected people and avoid overloading the hospital system; the other is to minimize the damage to the economy caused by the long - term implementation of strict measures. By creating three different scenarios and defining a reward system to give priority to action sequences that meet the above - mentioned goals, the paper verifies the effectiveness of the proposed method and proves that the method based on Deep Q - Learning is superior to the method based on Genetic Algorithms in optimizing action sequences.