On Efficient Sampling for Reinforcement Learning with Multiple Constraints*

Qing-Shan Jia,Qi Guo
DOI: https://doi.org/10.1109/case59546.2024.10711482
2024-01-01
Abstract:Reinforcement learning has attracted growing interest in many decision making problems. A special feature in automation is the pervasive existence of multiple simulation-based constraints. Existing algorithms may behave inefficient when evaluating the feasibility of policies in this case. We consider this important problem in this work, and make the following contributions. First, we convert the constrained Q-learning problem to the maximization of the probability of correct selection with simulation budget constraint. Second, we provide an algorithm to solve this problem. Third, we show that this algorithm is asymptotically optimal. We hope this work might strengthen the connection between reinforcement learning and simulation-based optimization.
What problem does this paper attempt to address?