Performance Optimization of Semi-Markov Decision Processes with Discounted-cost Criteria.

Baoqun Yin,Yanjie Li,Yaping Zhou,Hongsheng Xi
DOI: https://doi.org/10.3166/ejc.14.213-222
IF: 2.649
2008-01-01
European Journal of Control
Abstract:We discuss the problems of discounted-cost performance optimization for a class of semi-Markov decision processes (SMDPs). We define a matrix which can be used as the infinitesimal generator of a Markov process. The discounted Poisson equation is proposed for an SMDP by using this matrix, from which the α-potential is defined. The optimality equation satisfied by the optimal stationary policy is given and the relation between discounted model and average model is discussed. Two iteration algorithms to find e-optimal policies are proposed and the proofs of convergence of these two algorithms are given. A numerical example is provided to illustrate the application of the algorithms.
What problem does this paper attempt to address?