Abstract:Language models (LMs) possess a strong capability to comprehend natural language, making them effective in translating human instructions into detailed plans for simple robot tasks. Nevertheless, it remains a significant challenge to handle long-horizon tasks, especially in subtask identification and allocation for cooperative heterogeneous robot teams. To address this issue, we propose a Language Model-Driven Multi-Agent PDDL Planner (LaMMA-P), a novel multi-agent task planning framework that achieves state-of-the-art performance on long-horizon tasks. LaMMA-P integrates the strengths of the LMs' reasoning capability and the traditional heuristic search planner to achieve a high success rate and efficiency while demonstrating strong generalization across tasks. Additionally, we create MAT-THOR, a comprehensive benchmark that features household tasks with two different levels of complexity based on the AI2-THOR environment. The experimental results demonstrate that LaMMA-P achieves a 105% higher success rate and 36% higher efficiency than existing LM-based multi-agent planners. The experimental videos, code, and datasets of this work as well as the detailed prompts used in each module are available at <a class="link-external link-https" href="https://lamma-p.github.io" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper aims to address the challenges faced by multi-robot systems in executing long-term tasks, particularly the issues of subtask identification and allocation. Specifically, the paper focuses on how to effectively handle complex long-term tasks in heterogeneous robot teams, where multiple robots need to work collaboratively, and each robot has different skills and capabilities. ### Main Issues and Challenges 1. **Long-term Task Planning**: Traditional methods face difficulties in handling long-term tasks, especially in task decomposition, subtask allocation, and coordination. 2. **Task Generalization**: Existing methods perform poorly when dealing with tasks of varying difficulty, lacking generalization capabilities. 3. **Efficient Subtask Allocation**: How to efficiently allocate subtasks based on each robot's capabilities and the complexity of the tasks. 4. **Optimized Coordination**: Ensuring coordination among robots to maximize performance, particularly in tasks that require parallel execution. ### Solution To address the above issues, the paper proposes a new framework called LaMMA-P (Language Model-Driven Multi-Agent PDDL Planner). LaMMA-P combines the reasoning capabilities of large language models (LLMs) with the advantages of traditional heuristic search planners (such as PDDL) to achieve efficient long-term task planning and execution. ### Main Contributions 1. **Proposing the LaMMA-P Framework**: This framework is the first to combine PDDL with LLMs for handling multi-agent task planning problems with any number of agents. 2. **Modular Design**: LaMMA-P adopts a modular design, allowing seamless integration of LLMs, PDDL planning systems, and simulation environments, thereby enabling flexible task decomposition and efficient subtask allocation based on robot skills. 3. **Creating the MAT-THOR Benchmark**: This benchmark, based on the AI2-THOR simulator, includes household tasks of varying complexity to evaluate the effectiveness and robustness of multi-agent planning methods. Experimental results show that LaMMA-P outperforms existing LLM-based multi-agent planning methods in terms of success rate and efficiency. ### Summary By combining the strong reasoning capabilities of LLMs with the structured problem-solving approach of PDDL, LaMMA-P effectively addresses the challenges faced by multi-robot systems in executing long-term tasks, particularly excelling in task decomposition, subtask allocation, and optimized coordination.

LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner

MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks

Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents

SMART-LLM: Smart Multi-Agent Robot Task Planning using Large Language Models

Leveraging the Efficiency of Multi-Task Robot Manipulation Via Task-Evoked Planner and Reinforcement Learning

Nl2Hltl2Plan: Scaling Up Natural Language Understanding for Multi-Robots Through Hierarchical Temporal Logic Task Representation

COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models

SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models

DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models

DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models

LiP-LLM: Integrating Linear Programming and dependency graph with Large Language Models for multi-robot task planning

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots

AdaPlanner: Adaptive Planning from Feedback with Language Models

LLM A: Human in the Loop Large Language Models Enabled A Search for Robotics

LASP: Surveying the State-of-the-Art in Large Language Model-Assisted AI Planning