NARRATE: Versatile Language Architecture for Optimal Control in Robotics

Seif Ismail,Antonio Arbues,Ryan Cotterell,René Zurbrügg,Carmen Amo Alonso

2024-03-16

Abstract:The impressive capabilities of Large Language Models (LLMs) have led to various efforts to enable robots to be controlled through natural language instructions, opening exciting possibilities for human-robot interaction The goal is for the motor-control task to be performed accurately, efficiently and safely while also enjoying the flexibility imparted by LLMs to specify and adjust the task through natural language. In this work, we demonstrate how a careful layering of an LLM in combination with a Model Predictive Control (MPC) formulation allows for accurate and flexible robotic control via natural language while taking into consideration safety constraints. In particular, we rely on the LLM to effectively frame constraints and objective functions as mathematical expressions, which are later used in the motor-control module via MPC. The transparency of the optimization formulation allows for interpretability of the task and enables adjustments through human feedback. We demonstrate the validity of our method through extensive experiments on long-horizon reasoning, contact-rich, and multi-object interaction tasks. Our evaluations show that NARRATE outperforms current existing methods on these benchmarks and effectively transfers to the real world on two different embodiments. Videos, Code and Prompts at

Robotics

What problem does this paper attempt to address?

The problem this paper attempts to address is how to efficiently, accurately, and safely control robots to perform complex operational tasks through natural language instructions. Specifically, the researchers aim to develop a system architecture that can translate high-level natural language instructions into precise low-level actions, enabling robots to understand human language descriptions and autonomously complete tasks based on these descriptions, while ensuring the safety and accuracy of the tasks. The challenge lies in combining the flexibility of natural language with the precise control of robots, especially when dealing with complex tasks such as long-term planning and multi-object interactions. The method proposed in the paper is called NARRATE (Natural Language Robot Control and Adaptive Task Execution Architecture), which achieves this goal by combining large language models (LLMs) with model predictive control (MPC)-based optimization controllers. The core contribution of NARRATE is the use of LLMs to generate free-form mathematical expressions that can be directly used in MPC to solve manipulation tasks specified by natural language. This approach not only overcomes the limitations of using predefined primitives but also does not require specific robot control data, thus ensuring task accuracy and safety while maintaining flexibility. Additionally, the design of NARRATE considers system interpretability and human-robot interaction, allowing task adjustments through natural language feedback, thereby improving system adaptability and performance. The researchers validated the effectiveness of NARRATE through a series of experiments, including long-term reasoning tasks in simulated environments, contact-rich tasks, and multi-robot collaboration tasks, demonstrating its potential for real-world applications.

NARRATE: Versatile Language Architecture for Optimal Control in Robotics

Language to Rewards for Robotic Skill Synthesis

Hey Robot! Personalizing Robot Navigation through Model Predictive Control with a Large Language Model

Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment

One to rule them all: natural language to bind communication, perception and action

MHRC: Closed-loop Decentralized Multi-Heterogeneous Robot Collaboration with Large Language Models

Verifiably Following Complex Robot Instructions with Foundation Models

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

REAL: Resilience and Adaptation using Large Language Models on Autonomous Aerial Robots

A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs

Large Language Models for Orchestrating Bimanual Robots

Leveraging Large Language Models for Comprehensive Locomotion Control in Humanoid Robots Design

Natural Language as Policies: Reasoning for Coordinate-Level Embodied Control with LLMs

Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

Grounding Language Models in Autonomous Loco-manipulation Tasks

A Smart Interactive Camera Robot Based on Large Language Models

LLM-Based Human-Robot Collaboration Framework for Manipulation Tasks

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Action Contextualization: Adaptive Task Planning and Action Tuning using Large Language Models

Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models