Abstract:We introduce MAgent, a platform to support research and development of many-agent reinforcement learning. Unlike previous research platforms on single or multi-agent reinforcement learning, MAgent focuses on supporting the tasks and the applications that require hundreds to millions of agents. Within the interactions among a population of agents, it enables not only the study of learning algorithms for agents' optimal polices, but more importantly, the observation and understanding of individual agent's behaviors and social phenomena emerging from the AI society, including communication languages, leaderships, altruism. MAgent is highly scalable and can host up to one million agents on a single GPU server. MAgent also provides flexible configurations for AI researchers to design their customized environments and agents. In this demo, we present three environments designed on MAgent and show emerged collective intelligence by learning from scratch.

What problem does this paper attempt to address?

This paper introduces a platform named MAgent, which aims to support the research and development of large - scale multi - agent reinforcement learning. Different from previous research platforms that focus on single - agent or a small number of agents in reinforcement learning, MAgent pays special attention to tasks and applications that require hundreds, thousands or even millions of agents. Through the interactions between agents, MAgent can not only study the learning algorithms of agents' optimal strategies, but more importantly, it can also observe and understand individual behaviors and social phenomena emerging from the AI society, such as communication languages, leadership and altruism, etc. Specifically, the problems that the paper attempts to solve include: 1. **Challenges of large - scale multi - agent systems**: Existing experimental platforms such as ALE, OpenAI Gym/Universe, Malmo, ELF and SC2LE have begun to attempt to cover multi - agent scenarios, but their designs can basically only handle situations with no more than dozens of agents. Therefore, there is a need for a platform specifically for large - scale multi - agent reinforcement learning, which is crucial for artificial collective intelligence (ACI). 2. **Limitations of existing algorithms**: Most state - of - the - art multi - agent reinforcement learning algorithms are also limited by scale and can usually only handle scenarios with dozens of agents. This poses a huge challenge to the research community. 3. **Promoting the research of artificial collective intelligence**: By providing a platform that can support hundreds, thousands or even millions of agents, MAgent aims to promote the research of artificial collective intelligence, including the exploration of phenomena such as cooperation, competition and social dynamics of agents in complex environments. The MAgent platform achieves high scalability through the ideas of network sharing and ID embedding, and can support up to one million agents on a single GPU server. In addition, MAgent also provides a flexible environment/agent configuration and reward description language to support researchers in designing customized environments and agents. The platform also includes a simple and visually appealing renderer for interactively displaying the states of the environment and agents. Users can explore the grid world by adjusting the window range, zooming, etc., and discover intelligent patterns and the diversity of reinforcement - learning agents.

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence

ProAgent: Building Proactive Cooperative Agents with Large Language Models

A Study of AI Population Dynamics with Million-agent Reinforcement Learning.

LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation

MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems

Scaling Large-Language-Model-based Multi-Agent Collaboration

GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks

MindAgent: Emergent Gaming Interaction

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Socialized Learning: Making Each Other Better Through Multi-Agent Collaboration

Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination

AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making

MAexp: A Generic Platform for RL-based Multi-Agent Exploration

AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

Simulation Platform for Large-Scale Multi-Agent Team Coordination

Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network

Very Large Scale Multi-Agent Reinforcement Learning with Graph Attention Mean Field