New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces.

Qingda Wei,Xianping Guo
DOI: https://doi.org/10.1007/s10957-012-9986-8
2012-01-01
Journal of Optimization Theory and Applications
Abstract:This paper deals with semi-Markov decision processes under the average expected criterion. The state and action spaces are Borel spaces, and the cost/reward function is allowed to be unbounded from above and from below . We give another set of conditions, under which the existence of an optimal (deterministic) stationary policy is proven by a new technique of two average optimality inequalities. Our conditions are slightly weaker than those in the existing literature, and some new sufficient conditions for the verifications of our assumptions are imposed on the primitive data of the model. Finally, we illustrate our results with three examples.
What problem does this paper attempt to address?