Abstract:Multi-agent reinforcement learning (MARL) is a challenging branch of reinforcement learning that requires cooperation of interactive learning agents to achieve individual objectives as well as shared team objectives. Existing MARL algorithms generally use either centralized global state representation or decentralized local observation to perform training and execution. In this paper, we introduce a novel MARL learning paradigm, centralized training with semi-centralized execution (CTSCE), and present a new MARL algorithm for addressing multi-agent problems: Semi-Centralized Multi-Agent Imitation Reinforcement Learning (SC-MAIRL). The semi-centralized approach aggregated with agents' spatial and temporal information serves as a joint knowledge base to facilitate a learning agent to discover team objectives and make fine-grained decisions. We also utilize a pre-trained performant teacher policy to guide an untrained model towards positive game states as a form of imitation learning, significantly increasing the agent's learning speed. In addition, to encourage agents to learn both offensive and defensive behaviors and smooth the high-dimensional learning curve, we present a new set of reward-shaping functions to further improve SC-MAIRL's learning performance. Our approach is evaluated using one of the most challenging scenarios within the StarCraft Multi-Agent Challenge environment, and the results show that SC-MAIRL outperforms the state-of-the-art MARL algorithm MAPPO in several metrics and allows our agents to learn and employ novel, complex macro strategies more effectively.

Cooperative multi-agent game based on reinforcement learning

Learning Intra-group Cooperation in Multi-agent Systems.

Efficient Multi-Agent Exploration with Mutual-Guided Actor-Critic

Deep reinforcement learning algorithm based on multi-agent parallelism and its application in game environment

Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning

Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

APC: Predict Global Representation from Local Observation in Multi-Agent Reinforcement Learning

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

SC-MAIRL: Semi-Centralized Multi-Agent Imitation Reinforcement Learning

Consciousness-Aware Multi-Agent Reinforcement Learning

F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning

Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition

Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning

CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning

Enhancing cooperation by cognition differences and consistent representation in multi-agent reinforcement learning

Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes

An Advanced Actor-Critic Algorithm for Training Video Game AI

Multi-Agent Game Abstraction Via Graph Attention Neural Network.

Self-attention-based multi-agent continuous control method in cooperative environments

Counterfactual Multi-Agent Policy Gradients