Duality Between Large Deviation Control and Risk-Sensitive Control for Markov Decision Processes.

Yanan Dai,Jinwen Chen
DOI: https://doi.org/10.1016/j.sysconle.2023.105490
IF: 2.742
2023-01-01
Systems & Control Letters
Abstract:This paper studies the dual relation between large deviations control of maximizing “up-side chance” probability and risk-sensitive control for Markov Decision Processes. To derive the desired duality, we apply a non-linear extension of the Kreĭn–Rutman Theorem to characterize the optimal risk-sensitive value and prove that an optimal policy exists which is stationary and deterministic. Benchmarks in the “up-side chance” probability which make the duality hold are characterized. It is proved that the optimal policy for the “up-side chance” probability can be approximated by the optimal one for the risk-sensitive control. The right-hand derivative of the optimal risk-sensitive value function plays an important role, and a variational formula for the optimal risk-sensitive value is applied to characterize it. Some essential differences between these two types of optimal control problems are presented.
What problem does this paper attempt to address?