Abstract:The Centralized Training and Decentralized Execution (CTDE) paradigm, where a centralized critic is allowed to access global information during the training phase while maintaining the learned policies executed with only local information in a decentralized way, has achieved great progress in recent years. Despite the progress, CTDE may suffer from the issue of Centralized-Decentralized Mismatch (CDM): the suboptimality of one agent's policy can exacerbate policy learning of other agents through the centralized joint critic. In contrast to centralized learning, the cooperative model that most closely resembles the way humans cooperate in nature is fully decentralized, i.e. Independent Learning (IL). However, there are still two issues that need to be addressed before agents coordinate through IL: (1) how agents are aware of the presence of other agents, and (2) how to coordinate with other agents to improve joint policy under IL. In this paper, we propose an inference-based coordinated MARL method: Deep Motor System (DMS). DMS first presents the idea of individual intention inference where agents are allowed to disentangle other agents from their environment. Secondly, causal inference was introduced to enhance coordination by reasoning each agent's effect on others' behavior. The proposed model was extensively experimented on a series of Multi-Agent MuJoCo and StarCraftII tasks. Results show that the proposed method outperforms independent learning algorithms and the coordination behavior among agents can be learned even without the CTDE paradigm compared to the state-of-the-art baselines including IPPO and HAPPO.

Consciousness-Aware Multi-Agent Reinforcement Learning

Complementary Attention for Multi-Agent Reinforcement Learning.

S2rl

S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

S2RL: DoWe Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

Enhancing cooperation by cognition differences and consistent representation in multi-agent reinforcement learning

Multiagent Q-learning with Sub-Team Coordination.

Attention Enhanced Reinforcement Learning for Multi agent Cooperation

Cooperative multi-agent game based on reinforcement learning

Multi-Agent Concentrative Coordination with Decentralized Task Representation

Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

Optimistic sequential multi-agent reinforcement learning with motivational communication

Self-Motivated Multi-Agent Exploration

Coordination as inference in multi-agent reinforcement learning

Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning.

Multi-agent Continual Coordination Via Progressive Task Contextualization

Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey

Multiagent Continual Coordination via Progressive Task Contextualization