Constrained Continuous-Time Markov Decision Processes with Average Criteria

Lanlan Zhang,Xianping Guo
DOI: https://doi.org/10.1007/s00186-007-0154-0
2007-01-01
Mathematical Methods of Operations Research
Abstract:In this paper, we study constrained continuous-time Markov decision processes with a denumerable state space and unbounded reward/cost and transition rates. The criterion to be maximized is the expected average reward, and a constraint is imposed on an expected average cost. We give suitable conditions that ensure the existence of a constrained-optimal policy. Moreover, we show that the constrained-optimal policy randomizes between two stationary policies differing in at most one state. Finally, we use a controlled queueing system to illustrate our conditions.
What problem does this paper attempt to address?