Markov Decision Processes with Variance Minimization: A New Condition and Approach

Quanxin Zhu,Xianping Guo
DOI: https://doi.org/10.1080/07362990701282807
2007-01-01
Stochastic Analysis and Applications
Abstract:This article deals with the limiting average variance criterion for discrete-time Markov decision processes in Borel spaces. The costs may have neither upper nor lower bounds. We propose another set of conditions under which we prove the existence of a variance minimal policy in the class of average expected cost optimal stationary policies. Our conditions are weaker than those in the previous literature. Moreover, some sufficient conditions for the existence of a variance minimal policy are imposed on the primitive data of the model. In particular, the stochastic monotonicity condition in this paper has been. rst used to study the limiting average variance criterion. Also, the optimality inequality approach provided here is different from the "optimality equation approach" widely used in the previous literature. Finally, we use a controlled queueing system to illustrate our results.
What problem does this paper attempt to address?