Parameter Decision Making in Adaptive Markov Decision Process with Finite Planning Horizon

LI Jiang-hong,HAN Zheng-zhi
DOI: https://doi.org/10.3969/j.issn.0255-8297.2000.04.012
2000-01-01
Journal of Applied Sciences
Abstract:An algorithm is proposed for adaptive MDP with finite planning horizon by reason of the fact that all current algorithms only consider adaptive MDP with infinite planning horizon. Bayes principle is applied to learn an unknown system; and for every decision the probability that the actual decision equals the optimal decision is maximized. Simulation results demonstrate the validity of the new algorithm.
What problem does this paper attempt to address?