Minimum Average Value-at-Risk for Finite Horizon Semi-Markov Decision Processes in Continuous Time.
Yonghui Huang,Xianping Guo
DOI: https://doi.org/10.1137/140976029
IF: 2.763
2016-01-01
SIAM Journal on Optimization
Abstract:This paper studies the average value-at-risk (AVaR) criterion for finite horizon semi-Markov decision processes (SMDPs) in continuous time. Via an alternative representation of AVaR, we reduce the problem of minimizing the AVaR of the finite horizon cost to two subproblems: one is to minimize the expected-positive-deviation of the finite horizon cost from some level over policies, which itself is a new and interesting problem in the finite horizon SMDP setting; the second is an ordinary problem of minimizing a function of a single variable. For the first subproblem, by the technique of extending the state space to include the cost level, we prove that the value function is a minimum solution to the optimality equation, and an optimal policy exists under suitable conditions. Furthermore, we show that the value function is the unique solution in a metric space to the optimality equation when one more condition is imposed, which plays a key role for the algorithm complexity analysis and the policy improvement algorithm. Based on the solution of the first subproblem, the existence and computation of an AVaR optimal policy are established by solving the second subproblem. To facilitate practical implementation of our results, we derive a value iteration algorithm and a policy improvement algorithm for computing an AVaR optimal policy. We perform complexity analysis of the value iteration algorithm, and discuss Monte Carlo simulation as a method of minimizing AVaR for a finite horizon SMDP. To demonstrate our results, two examples about a maintenance system and a cash-flow system are provided.