Reinforcement Learning in Healthcare: A Survey

Chao Yu,Jiming Liu,Shamim Nemati
DOI: https://doi.org/10.48550/arXiv.1908.08796
2020-04-24
Abstract:As a subfield of machine learning, reinforcement learning (RL) aims at empowering one's capabilities in behavioural decision making by using interaction experience with the world and an evaluative feedback. Unlike traditional supervised learning methods that usually rely on one-shot, exhaustive and supervised reward signals, RL tackles with sequential decision making problems with sampled, evaluative and delayed feedback simultaneously. Such distinctive features make RL technique a suitable candidate for developing powerful solutions in a variety of healthcare domains, where diagnosing decisions or treatment regimes are usually characterized by a prolonged and sequential procedure. This survey discusses the broad applications of RL techniques in healthcare domains, in order to provide the research community with systematic understanding of theoretical foundations, enabling methods and techniques, existing challenges, and new insights of this emerging paradigm. By first briefly examining theoretical foundations and key techniques in RL research from efficient and representational directions, we then provide an overview of RL applications in healthcare domains ranging from dynamic treatment regimes in chronic diseases and critical care, automated medical diagnosis from both unstructured and structured clinical data, as well as many other control or scheduling domains that have infiltrated many aspects of a healthcare system. Finally, we summarize the challenges and open issues in current research, and point out some potential solutions and directions for future research.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges and opportunities faced by applying Reinforcement Learning (RL) technology in the medical and health field. Specifically, the paper aims to explore how to use RL technology to solve decision - making problems in the medical and health field, especially those processes that require long - time - series decision - making, such as dynamic treatment plans for chronic diseases, treatment decisions in intensive care, and automated clinical diagnosis. The paper also discusses the existing challenges and future research directions in the application of RL in these fields, in order to provide the research community with a systematic understanding, including theoretical basis, enabling methods and technologies, existing challenges, and new insights. By reviewing the basic theoretical framework, key technologies, and application examples of RL, the paper shows the unique advantages of RL in dealing with problems with delayed feedback and sequence - decision - making characteristics. This makes RL an ideal choice for constructing efficient decision - making strategies, especially in the medical and health field, where the decision - making process often has a long - time span or sequence nature. The paper also emphasizes RL's ability to find the optimal strategy only through past experience without the need to establish an accurate mathematical model of the biological system, which is particularly important for the complex and difficult - to - model human body system.