Abstract:This paper studies close-loop task planning, which refers to the process of generating a sequence of skills (a plan) to accomplish a specific goal while adapting the plan based on real-time observations. Recently, prompting Large Language Models (LLMs) to generate actions iteratively has become a prevalent paradigm due to its superior performance and user-friendliness. However, this paradigm is plagued by two inefficiencies: high token consumption and redundant error correction, both of which hinder its scalability for large-scale testing and applications. To address these issues, we propose Tree-Planner, which reframes task planning with LLMs into three distinct phases: plan sampling, action tree construction, and grounded deciding. Tree-Planner starts by using an LLM to sample a set of potential plans before execution, followed by the aggregation of them to form an action tree. Finally, the LLM performs a top-down decision-making process on the tree, taking into account real-time environmental information. Experiments show that Tree-Planner achieves state-of-the-art performance while maintaining high efficiency. By decomposing LLM queries into a single plan-sampling call and multiple grounded-deciding calls, a considerable part of the prompt are less likely to be repeatedly consumed. As a result, token consumption is reduced by 92.2% compared to the previously best-performing model. Additionally, by enabling backtracking on the action tree as needed, the correction process becomes more flexible, leading to a 40.5% decrease in error corrections.

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve The paper aims to address two main issues in closed-loop task planning: high token consumption and redundant error correction. Specifically: 1. **High Token Consumption**: Existing task planning methods based on large language models (LLMs) require repeated provision of environment information, instructions, etc., for generating each action, leading to significant token consumption and increased costs. 2. **Redundant Error Correction**: When an action execution error occurs, existing methods typically need to regenerate the entire plan or regenerate actions from the current time step, which is time-consuming and further increases token consumption. To tackle these issues, the paper proposes a new framework called **TREE-PLANNER**. This framework decomposes the task planning process into three stages: plan sampling, action tree construction, and grounded deciding. Through this approach, TREE-PLANNER can significantly reduce token consumption and improve error correction efficiency. ### Main Contributions 1. **Reduced Token Consumption**: By separating plan sampling and grounded deciding, TREE-PLANNER only needs to provide global information and context examples once during the plan sampling stage, without repeating this information in subsequent decision processes. Experimental results show that token consumption is reduced by 92.2% compared to the best existing methods. 2. **Improved Error Correction Efficiency**: By backtracking on the action tree, TREE-PLANNER can more flexibly correct errors, reducing redundant decisions. Experimental results indicate that the number of error corrections is reduced by 40.5%. 3. **Superior Performance**: Experiments conducted in the VirtualHome simulation environment demonstrate that TREE-PLANNER achieves state-of-the-art performance in both settings of no error correction and with error correction, surpassing the best baseline model by 1.29% and 3.65% in success rate, respectively. ### Experimental Validation The paper conducts experiments in the VirtualHome simulation environment to validate the effectiveness and efficiency of TREE-PLANNER. The results show that TREE-PLANNER not only outperforms existing methods in terms of performance but also excels in token consumption and error correction efficiency. ### Conclusion By introducing TREE-PLANNER, the paper successfully addresses the issues of high token consumption and redundant error correction present in existing LLM-based task planning methods, providing a more efficient and feasible solution for large-scale testing and practical applications.

Tree-Planner: Efficient Close-loop Task Planning with Large Language Models

AdaPlanner: Adaptive Planning from Feedback with Language Models

FLTRNN: Faithful Long-Horizon Task Planning for Robotics with Large Language Models

Learning to Plan by Updating Natural Language

LLM as BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning

Learning to Plan with Natural Language

Consolidating Trees of Robotic Plans Generated Using Large Language Models to Improve Reliability

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

Improving Planning with Large Language Models: A Modular Agentic Architecture

Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual Reasoning

Interactive and Expressive Code-Augmented Planning with Large Language Models

Tool-Planner: Task Planning with Clusters across Multiple Tools

Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents

Autonomous Tree-search Ability of Large Language Models

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Exploring and Benchmarking the Planning Capabilities of Large Language Models

Planning-Driven Programming: A Large Language Model Programming Workflow

DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models

SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator