Abstract:Multi-Agent Reinforcement Learning (MARL) is extensively utilized for addressing intricate tasks that involve cooperation and competition among agents in Multi-Agent Systems (MAS). However, learning such tasks from scratch is challenging and often unfeasible, especially for MASs with a large number of agents. Hence, leveraging knowledge from prior experiences can effectively expedite the MARL learning process. Prior work has shown that we successfully facilitated transfer learning for MARL by consolidating various state spaces into fixed-size inputs, enabling a single unified deep-learning policy applicable to several scenarios within the StarCraft Multi-Agent Challenge (SMAC) environment. In this study, we expand SMAC to Multi-Player enabled SMAC (MP-SMAC) by enabling the dynamic selection of training opponents and introducing a co-evolving MARL framework, which creates a co-evolutionary arena where multiple policies learn simultaneously. Our arena comprised the simultaneous training of multiple policies in diverse scenarios, pitting them against both static AI opponents and their peers within MP-SMAC. Furthermore, we integrate co-evolution with curriculum transfer learning into Co-MACTRL framework, enabling our MARL policies to systematically acquire knowledge and skills across predetermined scenarios organized by varying difficulty levels, including evolving opponents. The results revealed significant enhancements in MARL learning performance, demonstrating the advantage of leveraging the co-evolving opponents and maneuvering skills obtained from different scenarios. Additionally, the Co-MACTRL learners consistently attained high performance across a range of SMAC scenarios, showcasing the robustness and generalizability of Co-MACTRL.

Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

S2rl

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

State-based episodic memory for multi-agent reinforcement learning

S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

Off-Beat Multi-Agent Reinforcement Learning

Continuous Policy Multi-Agent Deep Reinforcement Learning with Generalizable Episodic Memory

Multiexperience-Assisted Efficient Multiagent Reinforcement Learning

SC-MAIRL: Semi-Centralized Multi-Agent Imitation Reinforcement Learning

Co-Evolving Multi-Agent Transfer Reinforcement Learning Via Scenario Independent Representation

Priority over Quantity: A Self-Incentive Credit Assignment Scheme for Cooperative Multiagent Reinforcement Learning

Boosting Value Decomposition Via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

Sample-efficient multi-agent reinforcement learning with masked reconstruction

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward

LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning

Two-Memory Reinforcement Learning

Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning.

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning