Abstract:This paper studies the minimizing risk problems in Markov decision processes with countable state space and reward set. The objective is to find a policy which minimizes the probability (risk) that the total discounted rewards do not exceed a specified value (target). In this sort of model, the decision made by the decision maker depends not only on system's states, but also on his target values. By introducing the decision-maker's state, we formulate a framework for minimizing risk models. The policies discussed depend on target values and the rewards may be arbitrary real numbers. For the finite horizon model, the main results obtained are: (i) The optimal value functions are distribution functions of the target, (ii) there exists an optimal deterministic Markov policy, and (iii) a policy is optimal if and only if at each realizable state it always takes optimal action. In addition, we obtain a sufficient condition and a necessary condition for the existence of finite horizon optimal policy independent of targets and we give an algorithm computing finite horizon optimal policies and optimal value functions. For an infinite horizon model, we establish the optimality equation and we obtain the structure property of optimal policy. We prove that the optimal value function is a distribution function of target and we present a new approximation formula which is the generalization of the nonnegative rewards cases. An example which illustrates the mistakes of previous literature shows that the existence of optimal policy has not been proved really. In this paper, we give an existence condition, which is a sufficient and necessary condition for the existence of an infinite horizon optimal policy independent of targets, and we point out that whether there exists an optimal policy remains an open problem in the general case.

Risk-Sensitive Optimal Control in Communicating Average Markov Decision Chains

Solution to the risk-sensitive average cost optimality equation in a class of Markov decision processes with finite state space

Characterization of the optimal average cost in Markov decision chains driven by a risk-seeking controller

Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion

On Average Risk-sensitive Markov Control Processes

Risk-Sensitive Average Markov Decision Processes in General Spaces

Risk-sensitive Markov control processes

Local Poisson Equations Associated with Discrete-Time Markov Control Processes

Risk-Sensitivity Vanishing Limit for Controlled Markov Processes

A Discount Vanishing Approximation for Markov Decision Processes with Risk Sensitivity

Acceptable risks and related decision problems with multiple risk-averse agents

Average Continuous Control of Piecewise Deterministic Markov Processes

Markov Decision Processes under Risk Sensitivity: A Discount Vanishing Approach

Risk-sensitive control of continuous time Markov chains

Risk Aware Minimum Principle for Optimal Control of Stochastic Differential Equations

Average cost optimal control under weak ergodicity hypotheses: Relative value iterations

Risk-Averse Decision Making Under Uncertainty

Discrete-time risk-aware optimal switching with non-adapted costs

Duality Between Large Deviation Control and Risk-Sensitive Control for Markov Decision Processes.

Risk-Averse Control of Undiscounted Transient Markov Models

Minimizing Risk Models in Markov Decision Processes with Policies Depending on Target Values