Abstract:Multi-agent reinforcement learning (MARL) has received increasing attention for its applications in various domains. Researchers have paid much attention on its partially observable and cooperative settings for meeting real-world requirements. For testing performance of different algorithms, standardized environments are designed such as the StarCraft Multi-Agent Challenge, which is one of the most successful MARL benchmarks. To our best knowledge, most of current environments are synchronous, where agents execute actions in the same pace. However, heterogeneous agents usually have their own action spaces and there is no guarantee for actions from different agents to have the same executed cycle, which leads to asynchronous multi-agent cooperation. Inspired from the Wargame, a confrontation game between two armies abstracted from real world environment, we propose the first Partially Observable Asynchronous multi-agent Cooperation challenge (POAC) for the MARL community. Specifically, POAC supports two teams of heterogeneous agents to fight with each other, where an agent selects actions based on its own observations and cooperates asynchronously with its allies. Moreover, POAC is a light weight, flexible and easy to use environment, which can be configured by users to meet different experimental requirements such as self-play model, human-AI model and so on. Along with our benchmark, we offer six game scenarios of varying difficulties with the built-in rule-based AI as opponents. Finally, since most MARL algorithms are designed for synchronous agents, we revise several representatives to meet the asynchronous setting, and the relatively poor experimental results validate the challenge of POAC. Source code is released in \url{<a class="link-external link-http" href="http://turingai.ia.ac.cn/data" rel="external noopener nofollow">this http URL</a>\_center/show}.

Attentional Opponent Modelling for Multi-agent Cooperation

P-CMOMMT Algorithm For The Cooperative Multi-Robot Observation of Multiple Moving Targets

Model-Based Opponent Modeling

Decision-making with Speculative Opponent Models

Learning to Model Opponent Learning

Modeling opponent learning in multiagent repeated games

A brain-inspired theory of mind spiking neural network improves multi-agent cooperation and competition

Multi-agent Actor-Critic with Time Dynamical Opponent Model

Opponent portrait for multiagent reinforcement learning in competitive environment

Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning

Learning Attentional Communication for Multi-Agent Cooperation

ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind

Emergence of Theory of Mind Collaboration in Multiagent Systems

Multi-Agent Combat in Non-Stationary Environments

Inverse Attention Agent for Multi-Agent System

Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking

Modeling and reinforcement learning in partially observable many-agent systems

Opponent Modeling in Deep Reinforcement Learning

Optimistic sequential multi-agent reinforcement learning with motivational communication

The Partially Observable Asynchronous Multi-Agent Cooperation Challenge

Theory of Mind for Multi-Agent Collaboration via Large Language Models