Tree-Planner: Efficient Close-loop Task Planning with Large Language Models

Mengkang Hu,Yao Mu,Xinmiao Yu,Mingyu Ding,Shiguang Wu,Wenqi Shao,Qiguang Chen,Bin Wang,Yu Qiao,Ping Luo
2024-07-24
Abstract:This paper studies close-loop task planning, which refers to the process of generating a sequence of skills (a plan) to accomplish a specific goal while adapting the plan based on real-time observations. Recently, prompting Large Language Models (LLMs) to generate actions iteratively has become a prevalent paradigm due to its superior performance and user-friendliness. However, this paradigm is plagued by two inefficiencies: high token consumption and redundant error correction, both of which hinder its scalability for large-scale testing and applications. To address these issues, we propose Tree-Planner, which reframes task planning with LLMs into three distinct phases: plan sampling, action tree construction, and grounded deciding. Tree-Planner starts by using an LLM to sample a set of potential plans before execution, followed by the aggregation of them to form an action tree. Finally, the LLM performs a top-down decision-making process on the tree, taking into account real-time environmental information. Experiments show that Tree-Planner achieves state-of-the-art performance while maintaining high efficiency. By decomposing LLM queries into a single plan-sampling call and multiple grounded-deciding calls, a considerable part of the prompt are less likely to be repeatedly consumed. As a result, token consumption is reduced by 92.2% compared to the previously best-performing model. Additionally, by enabling backtracking on the action tree as needed, the correction process becomes more flexible, leading to a 40.5% decrease in error corrections.
Computation and Language,Artificial Intelligence,Machine Learning,Robotics
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address two main issues in closed-loop task planning: high token consumption and redundant error correction. Specifically: 1. **High Token Consumption**: Existing task planning methods based on large language models (LLMs) require repeated provision of environment information, instructions, etc., for generating each action, leading to significant token consumption and increased costs. 2. **Redundant Error Correction**: When an action execution error occurs, existing methods typically need to regenerate the entire plan or regenerate actions from the current time step, which is time-consuming and further increases token consumption. To tackle these issues, the paper proposes a new framework called **TREE-PLANNER**. This framework decomposes the task planning process into three stages: plan sampling, action tree construction, and grounded deciding. Through this approach, TREE-PLANNER can significantly reduce token consumption and improve error correction efficiency. ### Main Contributions 1. **Reduced Token Consumption**: By separating plan sampling and grounded deciding, TREE-PLANNER only needs to provide global information and context examples once during the plan sampling stage, without repeating this information in subsequent decision processes. Experimental results show that token consumption is reduced by 92.2% compared to the best existing methods. 2. **Improved Error Correction Efficiency**: By backtracking on the action tree, TREE-PLANNER can more flexibly correct errors, reducing redundant decisions. Experimental results indicate that the number of error corrections is reduced by 40.5%. 3. **Superior Performance**: Experiments conducted in the VirtualHome simulation environment demonstrate that TREE-PLANNER achieves state-of-the-art performance in both settings of no error correction and with error correction, surpassing the best baseline model by 1.29% and 3.65% in success rate, respectively. ### Experimental Validation The paper conducts experiments in the VirtualHome simulation environment to validate the effectiveness and efficiency of TREE-PLANNER. The results show that TREE-PLANNER not only outperforms existing methods in terms of performance but also excels in token consumption and error correction efficiency. ### Conclusion By introducing TREE-PLANNER, the paper successfully addresses the issues of high token consumption and redundant error correction present in existing LLM-based task planning methods, providing a more efficient and feasible solution for large-scale testing and practical applications.