Abstract:Effective collaboration in multi-agent systems requires communicating goals and intentions between agents. Current agent frameworks often suffer from dependencies on single-agent execution and lack robust inter-module communication, frequently leading to suboptimal multi-agent reinforcement learning (MARL) policies and inadequate task coordination. To address these challenges, we present a framework for training large language models (LLMs) as collaborative agents to enable coordinated behaviors in cooperative MARL. Each agent maintains a private intention consisting of its current goal and associated sub-tasks. Agents broadcast their intentions periodically, allowing other agents to infer coordination tasks. A propagation network transforms broadcast intentions into teammate-specific communication messages, sharing relevant goals with designated teammates. The architecture of our framework is structured into planning, grounding, and execution modules. During execution, multiple agents interact in a downstream environment and communicate intentions to enable coordinated behaviors. The grounding module dynamically adapts comprehension strategies based on emerging coordination patterns, while feedback from execution agents influnces the planning module, enabling the dynamic re-planning of sub-tasks. Results in collaborative environment simulation demonstrate intention propagation reduces miscoordination errors by aligning sub-task dependencies between agents. Agents learn when to communicate intentions and which teammates require task details, resulting in emergent coordinated behaviors. This demonstrates the efficacy of intention sharing for cooperative multi-agent RL based on LLMs.

Closely Cooperative Multi-Agent Reinforcement Learning Based on Intention Sharing and Credit Assignment

S2rl

Situation-Dependent Causal Influence-Based Cooperative Multi-agent Reinforcement Learning

Credit assignment in heterogeneous multi-agent reinforcement learning for fully cooperative tasks

A Cooperative Multi-Agent Reinforcement Learning Method Based on Coordination Degree

Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions

SC-MAIRL: Semi-Centralized Multi-Agent Imitation Reinforcement Learning

Modeling the Interaction Between Agents in Cooperative Multi-Agent Reinforcement Learning

Multi-Agent Concentrative Coordination with Decentralized Task Representation

Multi-agent Continual Coordination Via Progressive Task Contextualization

Hemipotassium hemirubidium digallium(III) manganese(II) tris(phosphate) dihydrate

Multi-Task Multi-Agent Reinforcement Learning With Interaction and Task Representations

Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models

Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks

Multi-Agent Collaboration via Reward Attribution Decomposition

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning

MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment

Learning Reward Machines in Cooperative Multi-Agent Tasks

A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning