Abstract:Multi-agent reinforcement learning shines as the pinnacle of multi-agent systems, conquering intricate real-world challenges, fostering collaboration and coordination among agents, and unleashing the potential for intelligent decision-making across domains. However, training a multi-agent reinforcement learning network is a formidable endeavor, demanding substantial computational resources to interact with diverse environmental variables, extract state representations, and acquire decision-making knowledge. The recent breakthroughs in large-scale pre-trained models ignite our curiosity: Can we uncover shared knowledge in multi-agent reinforcement learning and leverage pre-trained models to expedite training for future tasks? Addressing this issue, we present an innovative multi-task learning approach that aims to extract and harness common decision-making knowledge, like cooperation and competition, across different tasks. Our approach involves concurrent training of multiple multi-agent tasks, with each task employing independent front-end perception layers while sharing back-end decision-making layers. This effective decoupling of state representation extraction from decision-making allows for more efficient training and better transferability. To evaluate the efficacy of our proposed approach, we conduct comprehensive experiments in two distinct environments: the StarCraft Multi-agent Challenge (SMAC) and the Google Research Football (GRF) environments. The experimental results unequivocally demonstrate the smooth transferability of the shared decision-making network to other tasks, thereby significantly reducing training costs and improving final performance. Furthermore, visualizations authenticate the presence of general multi-agent decision-making knowledge within the shared network layers, further validating the effectiveness of our approach.

MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning Via Mixing Recurrent Soft Decision Trees

S2rl

S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning

Correcting Biased Value Estimation in Mixing Value-Based Multi-Agent Reinforcement Learning by Multiple Choice Learning.

MAR2MIX: A Novel Model for Dynamic Problem in Multi-agent Reinforcement Learning.

Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning

Boosting Value Decomposition Via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

CDT: Cascading Decision Trees for Explainable Reinforcement Learning

BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions

Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning

Embedding multi-agent reinforcement learning into behavior trees with unexpected interruptions

A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models

Multi-Agent Collaboration via Reward Attribution Decomposition

Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination

TLMIX: Twin Leader Mixing Network for Cooperative Multi-Agent Reinforcement Learning.

Multi-trainer binary feedback interactive reinforcement learning

Celebrating Diversity in Shared Multi-Agent Reinforcement Learning

Conservative Q-Improvement: Reinforcement Learning for an Interpretable Decision-Tree Policy

Learning Multi-Agent Cooperation via Considering Actions of Teammates