A Multi-step Q Reinforcement Learning Algorithm

CHEN Sheng-Lei,WU Hui-Zhong,HAN Xiang-Lan,XIAO Liang
2006-01-01
Computer Science
Abstract:Q learning is of great importance in reinforcement learning.MQ algorithm with multi-step predicting capabil- ity is proposed to compensate the drawbacks of Q learning and Q(γ)algorithmin in this paper .Firsly MDP model is presented.Then based on the analyses of Q learning and Q(γ)algorithm,MQ algorithm is proposed.The algorithm's update strategy and determination rule of k are also analyzed.The effectiveness of this algorithm is verified through the cliff walking simulation experiments.Theoretical analyses and experiments indicate that better predicting capability and decreased computation complexity can be obtained in this algorithm.So it can balance update speed and complexity ef- fectively.
What problem does this paper attempt to address?