Multi-Step Truncated Q Learning Algorithm

Shenglei Chen,HuiZhong Wu,Xianglan Han,Liang Xiao
DOI: https://doi.org/10.1109/icmlc.2005.1526943
2005-01-01
Abstract:Q learning is of great importance in reinforcement learning. To compensate the drawbacks of Q learning and Q(lambda) algorithm, MTQ algorithm is proposed in this paper. It makes use of future information of k steps to update current Q value. Thus it can consider more long-term benefit and the computation complexity is also decreased. Good balance is achieved between update speed and computation complexity. Experiments demonstrate effectiveness of this algorithm.
What problem does this paper attempt to address?