Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment

Kristine Zhang,Yuanheng Wang,Jianzhun Du,Brian Chu,Leo Anthony Celi,Ryan Kindle,Finale Doshi-Velez
DOI: https://doi.org/10.48550/arXiv.2101.03309
IF: 5.414
2021-01-09
Machine Learning
Abstract:Many batch RL health applications first discretize time into fixed intervals. However, this discretization both loses resolution and forces a policy computation at each (potentially fine) interval. In this work, we develop a novel framework to compress continuous trajectories into a few, interpretable decision points --places where the batch data support multiple alternatives. We apply our approach to create recommendations from a cohort of hypotensive patients dataset. Our reduced state space results in faster planning and allows easy inspection by a clinical expert.
What problem does this paper attempt to address?