Markov Decision Processes with State-Dependent Discount Factors and Unbounded Rewards/costs.

Qingda Wei,Xianping Guo
DOI: https://doi.org/10.1016/j.orl.2011.06.014
IF: 1.151
2011-01-01
Operations Research Letters
Abstract:This paper deals with discrete-time Markov decision processes with state-dependent discount factors and unbounded rewards/costs. Under general conditions, we develop an iteration algorithm for computing the optimal value function, and also prove the existence of optimal stationary policies. Furthermore, we illustrate our results with a cash-balance model.
What problem does this paper attempt to address?