Another Set of Verifiable Conditions for Average Markov Decision Processes with Borel Spaces

Xiaolong Zou,Xianping Guo
DOI: https://doi.org/10.14736/kyb-2015-2-0276
2015-01-01
Kybernetika
Abstract:In this paper we give a new set of verifiable conditions for the existence of average optimal stationary policies in discrete-time Markov decision processes with Borel spaces and unbounded reward/cost functions. More precisely, we provide another set of conditions, which only consists of a Lyapunov-type condition and the common continuity-compactness conditions. These conditions are imposed on the primitive data of the model of Markov decision processes and thus easy to verify. We also give two examples for which all our conditions are satisfied, but some of conditions in the related literature fail to hold.
What problem does this paper attempt to address?