A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics

Jonas Bode,Bastian Pätzold,Raphael Memmesheimer,Sven Behnke
2024-10-30
Abstract:Recent advances in LLM have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason across a wide range of tasks and scenarios. Previous works have investigated various prompt engineering techniques for improving the performance of \glspl{LLM} to accomplish tasks, while others have proposed methods that utilize LLMs to plan and execute tasks based on the available functionalities of a given robot platform. In this work, we consider both lines of research by comparing prompt engineering techniques and combinations thereof within the application of high-level task planning and execution in service robotics. We define a diverse set of tasks and a simple set of functionalities in simulation, and measure task completion accuracy and execution time for several state-of-the-art models.
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the accuracy and efficiency of task completion through different prompt engineering techniques when using large - language models (LLMs) for task planning and execution in service robots. Specifically, the paper explores how to use LLMs to understand users' natural - language commands and plan and execute tasks accordingly, given the robot's function set and environmental state. The focus of the research is on comparing the effects of different prompt engineering techniques to determine which techniques can most effectively enhance the performance of LLMs in service robot tasks. The paper defines a series of tasks, from simple object fetching and delivering (such as "Please bring me a pen from the study") to more complex conditional tasks (such as "Check if there is a spoon in the kitchen. If there is, bring me a pen from the study; if not, bring me a comb from the bedroom"), and tasks that require repetitive operations and logical loops (such as "Move every orange from the kitchen to the living room"). These tasks are designed to test the ability of LLMs to understand and execute different types of tasks, especially their performance in the face of complex, dynamic or unknown environments. Through systematic experiments and evaluations of multiple prompt engineering techniques in a simulated environment, the paper hopes to provide effective solutions for advanced task planning and execution in service robots, thereby promoting research and technological development in this field.