A Discount Vanishing Approximation for Markov Decision Processes with Risk Sensitivity

Tanhao Huang,Xiaoyang Lu,Jinwen Chen
DOI: https://doi.org/10.1007/s10883-024-09691-3
IF: 1.27
2024-04-15
Journal of Dynamical and Control Systems
Abstract:In this paper optimal control of risk-sensitive Markov decision processes with countable states is studied. The state space is not assumed to be communicating. The focus is on dependence of the optimal values on the transition characteristics-communication, transience or absorption. A vanishing discount approach is used to introduce a partition of the state space, and certain transformation of the optimal values under discount is shown to convergence to the optimal values under risk sensitivity, as the discount factor tends to vanish. The partition of the state space turns out to be closely related to the characteristics of state communication, but weights more on the values under discount.
mathematics, applied,automation & control systems
What problem does this paper attempt to address?