A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics

Jonas Bode,Bastian Pätzold,Raphael Memmesheimer,Sven Behnke

2024-10-30

Abstract:Recent advances in LLM have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason across a wide range of tasks and scenarios. Previous works have investigated various prompt engineering techniques for improving the performance of \glspl{LLM} to accomplish tasks, while others have proposed methods that utilize LLMs to plan and execute tasks based on the available functionalities of a given robot platform. In this work, we consider both lines of research by comparing prompt engineering techniques and combinations thereof within the application of high-level task planning and execution in service robotics. We define a diverse set of tasks and a simple set of functionalities in simulation, and measure task completion accuracy and execution time for several state-of-the-art models.

Robotics,Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to improve the accuracy and efficiency of task completion through different prompt engineering techniques when using large - language models (LLMs) for task planning and execution in service robots. Specifically, the paper explores how to use LLMs to understand users' natural - language commands and plan and execute tasks accordingly, given the robot's function set and environmental state. The focus of the research is on comparing the effects of different prompt engineering techniques to determine which techniques can most effectively enhance the performance of LLMs in service robot tasks. The paper defines a series of tasks, from simple object fetching and delivering (such as "Please bring me a pen from the study") to more complex conditional tasks (such as "Check if there is a spoon in the kitchen. If there is, bring me a pen from the study; if not, bring me a comb from the bedroom"), and tasks that require repetitive operations and logical loops (such as "Move every orange from the kitchen to the living room"). These tasks are designed to test the ability of LLMs to understand and execute different types of tasks, especially their performance in the face of complex, dynamic or unknown environments. Through systematic experiments and evaluations of multiple prompt engineering techniques in a simulated environment, the paper hopes to provide effective solutions for advanced task planning and execution in service robots, thereby promoting research and technological development in this field.

A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics

ProgPrompt: Generating Situated Robot Task Plans using Large Language Models

LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots

ProgPrompt: program generation for situated robot task planning using large language models

A Comprehensive Review of Task Understanding of Command-Triggered Execution of Tasks for Service Robots

Ontology-driven Prompt Tuning for LLM-based Task and Motion Planning

Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph Structures

Deploying and Evaluating LLMs to Program Service Mobile Robots

What Should We Engineer in Prompts? Training Humans in Requirement-Driven LLM Use

Human Planning of Robot Actions through LLM-guided State Machine Synthesis

ConceptAgent: LLM-Driven Precondition Grounding and Tree Search for Robust Task Planning and Execution

RePrompt: Planning by Automatic Prompt Engineering for Large Language Models Agents

Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback

SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting

Task Planning in Robotics: an Empirical Comparison of PDDL-based and ASP-based Systems

Enhancing Robot Task Planning: Integrating Environmental Information and Feedback Insights Through Large Language Models

Text2Reaction : Enabling Reactive Task Planning Using Large Language Models

Prompter: Utilizing Large Language Model Prompting for a Data Efficient Embodied Instruction Following

Task and Motion Planning for Execution in the Real

Autonomous Prompt Engineering in Large Language Models