Reinforcement Learning: Stochastic Approximation Algorithms for Markov Decision Processes

Vikram Krishnamurthy
DOI: https://doi.org/10.48550/arXiv.1512.07669
2015-12-24
Abstract:This article presents a short and concise description of stochastic approximation algorithms in reinforcement learning of Markov decision processes. The algorithms can also be used as a suboptimal method for partially observed Markov decision processes.
Optimization and Control
What problem does this paper attempt to address?