A Novel Experience-Based Exploration Method for Q-Learning.

Bohong Yang,Hong Lu,Baogen Li,Zheng Zhang,Wenqiang Zhang
DOI: https://doi.org/10.1007/978-981-13-2203-7_17
2018-01-01
Abstract:Reinforcement learning algorithms are used to deal with a lot of sequential problems, such as playing games, mechanical control, and so on. Q-Learning is a model-free reinforcement learning method. In traditional Q-learning algorithms, the agent stops immediately after it has reached the goal. We propose in this paper a new method—Experience-based Exploration method—in order to sample more efficient state-action pairs for Q-learning updating. In the Experience-based Exploration method, the agent does not stop and continues to search the states with high bellman-error inversely. In this setting, the agent will set the terminal state as a new start point, and generate pairs of action and state which could be useful. The efficacy of the method is proved analytically. And the experimental results verify the hypothesis on Gridworld.
What problem does this paper attempt to address?