HiSA: Facilitating Efficient Multi-Agent Coordination and Cooperation by Hierarchical Policy with Shared Attention

Zixuan Chen,Zhirui Zhu,Guang Yang,Yang Gao
DOI: https://doi.org/10.1007/978-3-031-20868-3_6
2022-01-01
Abstract:While numbers of partially observable agents improve their policies throughout decentralized training, the performance of multi-agent systems under this setting suffers from severe non-cooperation and non-stationary. Most previous works attempt to introduce communication into the training and optimization to facilitate cooperation between agents, but the noise message brought by communication may lead to misunderstandings during complex tasks and even lead to catastrophic failure of long-term training. To alleviate the above dilemma, in this paper, we propose the Hierarchical Structure with Shared Attention Mechanism (HiSA), a novel communication-based approach, to facilitate the efficiency and robustness of coordination and cooperation in multi-agent reinforcement learning (MARL). HiSA can not only resist the negative impact of noise in communication, but also effectively utilize attention as communication tool to build efficient cooperative hierarchical policies. Experimental results demonstrate that HiSA significantly outperforms existing communication-based MARL methods especially in the long-term complex cooperation scenarios with isomorphic agents.
What problem does this paper attempt to address?