A research of an improved R-Learning used in the training of RoboCup

Li Jin,Liu Quan,Yang Xudong,Yang Kai,Weng Dongliang
2012-01-01
Abstract:Reinforcement learning has become a central paradigm for solving learning-control problems in artificial intelligence.In reinforcement learning,it is more natural and computationally advantageous to formulate tasks so that the controller's objective is to maximize the average payoff received per time step in many problems,for example that the optimal behavior is a limit cycle.However,R-Learning has some problems,such as converge slowly and sensitive with parameter.To solve the problem of slow convergence,a improved algorithm R-Learning is proposed.The algorithm uses BP as the approximate function to generalize the state space.The experimental results of RoboCup show that the proposed algorithm converges faster and has the ability of generalization.
What problem does this paper attempt to address?