Abstract:Large language models have found utility in the domain of robot task planning and task decomposition. Nevertheless, the direct application of these models for instructing robots in task execution is not without its challenges. Limitations arise in handling more intricate tasks, encountering difficulties in effective interaction with the environment, and facing constraints in the practical executability of machine control instructions directly generated by such models. In response to these challenges, this research advocates for the implementation of a multi-layer large language model to augment a robot's proficiency in handling complex tasks. The proposed model facilitates a meticulous layer-by-layer decomposition of tasks through the integration of multiple large language models, with the overarching goal of enhancing the accuracy of task planning. Within the task decomposition process, a visual language model is introduced as a sensor for environment perception. The outcomes of this perception process are subsequently assimilated into the large language model, thereby amalgamating the task objectives with environmental information. This integration, in turn, results in the generation of robot motion planning tailored to the specific characteristics of the current environment. Furthermore, to enhance the executability of task planning outputs from the large language model, a semantic alignment method is introduced. This method aligns task planning descriptions with the functional requirements of robot motion, thereby refining the overall compatibility and coherence of the generated instructions. To validate the efficacy of the proposed approach, an experimental platform is established utilizing an intelligent unmanned vehicle. This platform serves as a means to empirically verify the proficiency of the multi-layer large language model in addressing the intricate challenges associated with both robot task planning and execution.

Task Planning for Robot Manipulator Using Natural Language Task Input with Large Language Models

Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph Structures

MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model

Enhancing Robot Task Planning and Execution through Multi-Layer Large Language Models

LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots

Creative Robot Tool Use with Large Language Models

DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models

Interactively Robot Action Planning with Uncertainty Analysis and Active Questioning by Large Language Model

LiP-LLM: Integrating Linear Programming and dependency graph with Large Language Models for multi-robot task planning

Decision-Making in Robotic Grasping with Large Language Models.

ProgPrompt: Generating Situated Robot Task Plans using Large Language Models

Towards Human Awareness in Robot Task Planning with Large Language Models

Task and Motion Planning with Large Language Models for Object Rearrangement

Large Language Models as Commonsense Knowledge for Large-Scale Task Planning

Large Language Models for Robotics: Opportunities, Challenges, and Perspectives

TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage

Combining Ontological Knowledge and Large Language Model for User-Friendly Service Robots

Behavior Tree Generation using Large Language Models for Sequential Manipulation Planning with Human Instructions and Feedback

Interactive Task Planning with Language Models