SMART: Sequential Multi-Agent Reinforcement Learning with Role Assignment Using Transformer

Yixing Lan,Hao Gao,Xin Xu,Qiang Fang,Yujun Zeng
DOI: https://doi.org/10.1109/tcds.2024.3504256
IF: 4.546
2024-01-01
IEEE Transactions on Cognitive and Developmental Systems
Abstract:Multi-agent reinforcement learning (MARL) has received increasing attention and been used to solve cooperative multi-agent decision-making and learning control tasks. However, the high complexity of the joint action space and the non-stationary learning process are two major problems that negatively impact on the sample efficiency and solution quality of MARL. To this end, this paper proposes a novel approach named Sequential Multi-Agent reinforcement learning with Role assignment using Transformer (SMART). By learning the effects of different actions on state transitions and rewards, SMART realizes the action abstraction of the original action space and the adaptive role cognitive modeling of multi-agent, which reduces the complexity of the multi-agent exploration and learning process. Meanwhile, SMART uses causal Transformer networks to update role assignment policy and action selection policy sequentially, alleviating the influence of non-stationary multi-agent policy learning. The convergence characteristic of SMART is theoretically analyzed. Extensive experiments on the challenging Google Football and StarCraft Multi-Agent Challenge are conducted, demonstrating that compared with mainstream MARL algorithms such as MAT and HAPPO, SMART achieves a new state-of-the-art performance. Meanwhile, the learned policies through SMART have good generalization ability when the number of agents changes.
What problem does this paper attempt to address?