Research On Actor-Critic Reinforcement Learning In Robocup

He Guo,Tianying Liu,Yuxin Wang,Feng Chen,Jianming Fan
DOI: https://doi.org/10.1109/WCICA.2006.1713783
2006-01-01
Abstract:Actor-Critic method combines the fast convergence of value-based (Critic) and directivity on search of policy gradient (Actor). It is suitable for solving the problems with large state space. In this paper, the Actor Critic method with the tile-coding linear function approximation is analysed and applied to a RoboCup simulation subtask named "Soccer Keepaway". The experiments on Soccer Keepaway show that the policy learned by Actor-Critic method is better than policies from value-based Sarsa(lambda) and benchmarks.
What problem does this paper attempt to address?