GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation

Haoqiang Chen,Yadong Liu,Zongtan Zhou,Dewen Hu,Ming Zhang
DOI: https://doi.org/10.1007/s10489-020-01755-8
IF: 5.3
2020-01-01
Applied Intelligence
Abstract:Multi-agent reinforcement learning (MARL) is an important way to realize multi-agent cooperation. But there are still many challenges, including the scalability and the uncertainty of the environment that limit its application. In this paper, we explored to solve those problems through the graph network and the attention mechanism. Finally we succeeded in extending the existing algorithm and obtaining a new algorithm called GAMA. Specifically through the graph network, we made the environment information shared among agents. Meanwhile, the unimportant information was filtered out with the help of the attention mechanism, which helped to improve the communication efficiency. As a result, GAMA obtained the highest mean episode rewards compared to the baselines as well as excellent scalability. The reason why we choose the graph network is that understanding the relationship among agents plays a key role in solving multi-agent problems. And the graph network is very suitable for relational induction bias. Through the integration with the attention mechanism, it was shown that agents could figure out their relationship and focus on the influential environment factors in our experiment.
What problem does this paper attempt to address?