Risk-Sensitivity Vanishing Limit for Controlled Markov Processes

Yanan Dai,Jinwen Chen
DOI: https://doi.org/10.1007/s10883-023-09641-5
IF: 1.27
2023-01-01
Journal of Dynamical and Control Systems
Abstract:In this paper, we prove that the optimal risk-sensitive reward for Markov decision processes with compact state space and action space converges to the optimal average reward as the risk-sensitive factor tends to 0. In doing so, a variational formula for the optimal risk-sensitive reward is derived. An extension of the Kreĭn-Rutman Theorem to certain nonlinear operators is involved. Based on these, partially observable Markov decision processes are also investigated. A portfolio optimization problem is presented as an example of an application of the approach, in which a duality-relation between the maximization of risk-sensitive reward and the maximization of upside chance for out-performance over the optimal average reward is established.
What problem does this paper attempt to address?