LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner

Xiaopan Zhang,Hao Qin,Fuquan Wang,Yue Dong,Jiachen Li
2024-10-01
Abstract:Language models (LMs) possess a strong capability to comprehend natural language, making them effective in translating human instructions into detailed plans for simple robot tasks. Nevertheless, it remains a significant challenge to handle long-horizon tasks, especially in subtask identification and allocation for cooperative heterogeneous robot teams. To address this issue, we propose a Language Model-Driven Multi-Agent PDDL Planner (LaMMA-P), a novel multi-agent task planning framework that achieves state-of-the-art performance on long-horizon tasks. LaMMA-P integrates the strengths of the LMs' reasoning capability and the traditional heuristic search planner to achieve a high success rate and efficiency while demonstrating strong generalization across tasks. Additionally, we create MAT-THOR, a comprehensive benchmark that features household tasks with two different levels of complexity based on the AI2-THOR environment. The experimental results demonstrate that LaMMA-P achieves a 105% higher success rate and 36% higher efficiency than existing LM-based multi-agent planners. The experimental videos, code, and datasets of this work as well as the detailed prompts used in each module are available at <a class="link-external link-https" href="https://lamma-p.github.io" rel="external noopener nofollow">this https URL</a>.
Robotics,Artificial Intelligence,Computer Vision and Pattern Recognition,Machine Learning,Multiagent Systems
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address the challenges faced by multi-robot systems in executing long-term tasks, particularly the issues of subtask identification and allocation. Specifically, the paper focuses on how to effectively handle complex long-term tasks in heterogeneous robot teams, where multiple robots need to work collaboratively, and each robot has different skills and capabilities. ### Main Issues and Challenges 1. **Long-term Task Planning**: Traditional methods face difficulties in handling long-term tasks, especially in task decomposition, subtask allocation, and coordination. 2. **Task Generalization**: Existing methods perform poorly when dealing with tasks of varying difficulty, lacking generalization capabilities. 3. **Efficient Subtask Allocation**: How to efficiently allocate subtasks based on each robot's capabilities and the complexity of the tasks. 4. **Optimized Coordination**: Ensuring coordination among robots to maximize performance, particularly in tasks that require parallel execution. ### Solution To address the above issues, the paper proposes a new framework called LaMMA-P (Language Model-Driven Multi-Agent PDDL Planner). LaMMA-P combines the reasoning capabilities of large language models (LLMs) with the advantages of traditional heuristic search planners (such as PDDL) to achieve efficient long-term task planning and execution. ### Main Contributions 1. **Proposing the LaMMA-P Framework**: This framework is the first to combine PDDL with LLMs for handling multi-agent task planning problems with any number of agents. 2. **Modular Design**: LaMMA-P adopts a modular design, allowing seamless integration of LLMs, PDDL planning systems, and simulation environments, thereby enabling flexible task decomposition and efficient subtask allocation based on robot skills. 3. **Creating the MAT-THOR Benchmark**: This benchmark, based on the AI2-THOR simulator, includes household tasks of varying complexity to evaluate the effectiveness and robustness of multi-agent planning methods. Experimental results show that LaMMA-P outperforms existing LLM-based multi-agent planning methods in terms of success rate and efficiency. ### Summary By combining the strong reasoning capabilities of LLMs with the structured problem-solving approach of PDDL, LaMMA-P effectively addresses the challenges faced by multi-robot systems in executing long-term tasks, particularly excelling in task decomposition, subtask allocation, and optimized coordination.