Analysis for Some Properties of Discrete Time Markov Decision Processes

QY Hu,WI Yue
DOI: https://doi.org/10.1080/02331930310001611493
IF: 2.2
2003-01-01
Optimization
Abstract:This paper investigates properties of the optimality equation and optimal policies in discrete time Markov decision processes with expected discounted total rewards under weak conditions that the model is well defined and the optimality equation is true. The optimal value function is characterized as a solution of the optimality equation and the structure of optimal policies is also given.
What problem does this paper attempt to address?