Attentional Opponent Modelling for Multi-agent Cooperation

Siyang Tan,Binqiang Chen
DOI: https://doi.org/10.1109/ijcnn54540.2023.10191629
2023-01-01
Abstract:Opponent modelling (OM) can enable agents to reason behaviors of others, and hence act accordingly and interact effectively in multi-agent reinforcement learning (MARL). Existing OM approaches commonly assume the availability of opponents' trajectory, which needs incessant message exchange during execution in the partially observable environment. As a result, they are not cost-effective due to communication overhead, and are also inapplicable to many practical tasks with constraints on communication. To handle this problem, we propose attentional opponent modelling (ATOM) to infer the beliefs of neighboring entities with only local observation. ATOM does not require communication during execution and can be easily combined with existing MARL methods. As examples, we propose ATOM Actor-Critic and ATOM Q-learning architectures to facilitate multi-agent cooperation. Extensive experiments on several challenging tasks (i.e., Cooperative Navigation, Predator-Prey and Starcraft Multi-Agent Challenge) show the superior performance of our methods compared with benchmark OM approaches.
What problem does this paper attempt to address?