The Average Variance Criterion for Nonstationary MDP with Borel State Space

GUO Xian Ping
DOI: https://doi.org/10.3321/j.issn:0583-1431.2001.02.020
2001-01-01
Abstract:In this paper, we consider the average variance criterion for nonstationary Markov decision processes (MDP) with Borel state space. First, from the optimality equations we prove the existence of optimal Markov policies under ergodic conditions.Secondly, by the theory on Markov processes and structuring a new model we also prove that there exists a Markov policy, which is optimal in an average expected criterion,minimizes the average variance in the class of optimal policies for average expected criterion. So we extend the main results obtained by Dynkin E. B. and Yushkevich A. A.and by Kurano M. etc.
What problem does this paper attempt to address?