Hybrid Attention-Oriented Experience Replay for Deep Reinforcement Learning and Its Application to a Multi-Robot Cooperative Hunting Problem.

Lingli Yu,Shuxin Huo,Zhengjiu Wang,Keyi Li
DOI: https://doi.org/10.1016/j.neucom.2022.12.020
IF: 6
2023-01-01
Neurocomputing
Abstract:Multiple robots complete a cooperative hunting task by obtaining environmental information and autonomously learning hunting decision-making strategies. However, with the increase in the number of environment participants, it becomes difficult for robots to process a large amount of environmental information. Thus, a multi-robot cooperative hunting decision-making method called hybrid attention-oriented experience replay in multi-agent deep deterministic policy gradient (HAER-MADDPG) is proposed. First, a hybrid attention module is designed to pay greater attention to key information in a large amount of environmental information by integrating it with the multi-agent deep deterministic policy gradient (MADDPG). The method then combines hybrid attention and prioritized experience replay to improve the utilization of experience samples. Finally, the proposed algorithm is tested through a predator–prey game. The results show that the effectiveness, convergence speed, and scalability of the proposed algorithm are better than those of the baseline algorithms. In addition, HAER-MADDPG is effectively applied to a hunting task with real robots.
What problem does this paper attempt to address?