Limiting Average Criteria for Nonstationary Markov Decision Processes

XP Guo,P Shi
DOI: https://doi.org/10.1137/s1052623499355235
IF: 2.763
2001-01-01
SIAM Journal on Optimization
Abstract:This paper deals with the so-called limiting average criteria for nonstationary Markov decision processes with (possibly unbounded) rewards and Borel state space. A new set of conditions is provided, under which the existence of both a solution to the optimality equations and the limiting average $\epsilon (\geq 0)$-optimal Markov policies is derived. Also, a rolling horizon algorithm for computing limiting average $\epsilon (>0)$-optimal Markov policies is developed. Furthermore, the results in this paper are illustrated by several examples such as the water regulation problem.
What problem does this paper attempt to address?