REINFORCEMENT LEARNING-BASED HIGH-LEVEL BALL-STEALING STRATEGY FOR ROBOCUP KEEPAWAY

Li Xuejun,Chen Shiyang,Zhang Yiwen,Li Longshu
DOI: https://doi.org/10.3969/j.issn.1000-386x.2015.10.022
2015-01-01
Abstract:In Robocop Keepaway training task,traditional hand-coded ball-stealing strategies are very subjective and can't adapt well to training situation changes,this leads to the takers taking longer time to complete the tasks and having lower stealing success rate.To solve this problem,we apply the reinforcement learning to high-level action decision-making for stealing takers in Keepaway task.By analysing the characteristic of stealing task,we reasonably design the state space,action space and reward value of the reinforcement learning model of stealing takers,and state a corresponding reinforcement learning algorithm for stealing takers.Experimental results show that after the rein-forced learning the stealing takers can make more objective decisions according to game's situation,the effect of the decisions made are re-markably better than the hand-coded strategies.For typical 4v3 and 5v4 scale Keepaway tasks,with the learned strategy to making decision, the stealing takers shorten 7.1% of the time at least for completing ball -stealing task,and the stealing success rate increases no less than 15.0% as well.
What problem does this paper attempt to address?