MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

Lianmin Zheng,Jiacheng Yang,Han Cai,Weinan Zhang,Jun Wang,Yong Yu
DOI: https://doi.org/10.48550/arXiv.1712.00600
2017-12-02
Abstract:We introduce MAgent, a platform to support research and development of many-agent reinforcement learning. Unlike previous research platforms on single or multi-agent reinforcement learning, MAgent focuses on supporting the tasks and the applications that require hundreds to millions of agents. Within the interactions among a population of agents, it enables not only the study of learning algorithms for agents' optimal polices, but more importantly, the observation and understanding of individual agent's behaviors and social phenomena emerging from the AI society, including communication languages, leaderships, altruism. MAgent is highly scalable and can host up to one million agents on a single GPU server. MAgent also provides flexible configurations for AI researchers to design their customized environments and agents. In this demo, we present three environments designed on MAgent and show emerged collective intelligence by learning from scratch.
Machine Learning,Artificial Intelligence,Multiagent Systems
What problem does this paper attempt to address?
This paper introduces a platform named MAgent, which aims to support the research and development of large - scale multi - agent reinforcement learning. Different from previous research platforms that focus on single - agent or a small number of agents in reinforcement learning, MAgent pays special attention to tasks and applications that require hundreds, thousands or even millions of agents. Through the interactions between agents, MAgent can not only study the learning algorithms of agents' optimal strategies, but more importantly, it can also observe and understand individual behaviors and social phenomena emerging from the AI society, such as communication languages, leadership and altruism, etc. Specifically, the problems that the paper attempts to solve include: 1. **Challenges of large - scale multi - agent systems**: Existing experimental platforms such as ALE, OpenAI Gym/Universe, Malmo, ELF and SC2LE have begun to attempt to cover multi - agent scenarios, but their designs can basically only handle situations with no more than dozens of agents. Therefore, there is a need for a platform specifically for large - scale multi - agent reinforcement learning, which is crucial for artificial collective intelligence (ACI). 2. **Limitations of existing algorithms**: Most state - of - the - art multi - agent reinforcement learning algorithms are also limited by scale and can usually only handle scenarios with dozens of agents. This poses a huge challenge to the research community. 3. **Promoting the research of artificial collective intelligence**: By providing a platform that can support hundreds, thousands or even millions of agents, MAgent aims to promote the research of artificial collective intelligence, including the exploration of phenomena such as cooperation, competition and social dynamics of agents in complex environments. The MAgent platform achieves high scalability through the ideas of network sharing and ID embedding, and can support up to one million agents on a single GPU server. In addition, MAgent also provides a flexible environment/agent configuration and reward description language to support researchers in designing customized environments and agents. The platform also includes a simple and visually appealing renderer for interactively displaying the states of the environment and agents. Users can explore the grid world by adjusting the window range, zooming, etc., and discover intelligent patterns and the diversity of reinforcement - learning agents.