Potential Based Optimization Algorithm Of Constrained Markov Decision Processes

Li Yanjie,Yin Baoqun,Xi Hongsheng
2005-01-01
Abstract:This paper studies the optimization problem for a class of constrained Markov decision processes when the criterion is average reward functional with the average cost constraints. By using the property that the potential of Markov processes can be estimated by simulating a single sample path, we proposed a online optimization algorithm based on the Lagrange method and proved its convergence under some conditions.
What problem does this paper attempt to address?