Consolidating Trees of Robotic Plans Generated Using Large Language Models to Improve Reliability

Md Sadman Sakib,Yu Sun
DOI: https://doi.org/10.1142/S2972335324500029
2024-01-16
Abstract:The inherent probabilistic nature of Large Language Models (LLMs) introduces an element of unpredictability, raising concerns about potential discrepancies in their output. This paper introduces an innovative approach aims to generate correct and optimal robotic task plans for diverse real-world demands and scenarios. LLMs have been used to generate task plans, but they are unreliable and may contain wrong, questionable, or high-cost steps. The proposed approach uses LLM to generate a number of task plans as trees and amalgamates them into a graph by removing questionable paths. Then an optimal task tree can be retrieved to circumvent questionable and high-cost nodes, thereby improving planning accuracy and execution efficiency. The approach is further improved by incorporating a large knowledge network. Leveraging GPT-4 further, the high-level task plan is converted into a low-level Planning Domain Definition Language (PDDL) plan executable by a robot. Evaluation results highlight the superior accuracy and efficiency of our approach compared to previous methodologies in the field of task planning.
Robotics,Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The main problem addressed in this paper is the unreliability and potential errors of large-scale language models (LLMs) in generating robot task plans. The author proposes an innovative approach to enhance the ability of LLMs to generate correct and optimal robot task plans in varied real-world scenarios. Specifically, the method described in the paper includes the following steps: 1. Using LLMs like GPT-4 to generate multiple task trees. 2. Merging these trees into a network to eliminate unreliable paths. 3. Obtaining an optimized task tree through a search algorithm to avoid high-cost nodes and improve planning accuracy and execution efficiency. 4. Utilizing a large knowledge network to further enhance the method and translating advanced task plans into low-level Planning Domain Definition Language (PDDL) plans for execution by the robot. By comparing and analyzing with existing task planning methods, the paper demonstrates the superiority of the proposed method in accuracy and efficiency, especially in the experiment of cooking robot task planning. Additionally, the paper discusses how to integrate LLMs with external knowledge resources to reduce errors and improve the reliability of the plans.