Risk-sensitive Continuous-Time Markov Decision Processes with Unbounded Rates and Borel Spaces

Xianping Guo,Junyu Zhang
DOI: https://doi.org/10.1007/s10626-019-00292-y
2019-01-01
Discrete Event Dynamic Systems
Abstract:This paper considers the finite-horizon risk-sensitive optimality for continuous-time Markov decision processes, and focuses on the more general case that the transition rates are unbounded, cost/reward rates are allowed to be unbounded from below and from above, the policies can be history-dependent, and the state and action spaces are Borel ones. Under mild conditions imposed on the decision process's primitive data, we establish the existence of a solution to the corresponding optimality equation (OE) by a so called approximation technique. Then, using the OE and the extension of Dynkin's formula developed here, we prove the existence of an optimal Markov policy, and verify that the value function is the unique solution to the OE. Finally, we give an example to illustrate the difference between our conditions and those in the previous literature.
What problem does this paper attempt to address?