RobotGPT: Robot Manipulation Learning from ChatGPT

Yixiang Jin,Dingzhe Li,Yong A,Jun Shi,Peng Hao,Fuchun Sun,Jianwei Zhang,Bin Fang

2023-12-03

Abstract:We present RobotGPT, an innovative decision framework for robotic manipulation that prioritizes stability and safety. The execution code generated by ChatGPT cannot guarantee the stability and safety of the system. ChatGPT may provide different answers for the same task, leading to unpredictability. This instability prevents the direct integration of ChatGPT into the robot manipulation loop. Although setting the temperature to 0 can generate more consistent outputs, it may cause ChatGPT to lose diversity and creativity. Our objective is to leverage ChatGPT's problem-solving capabilities in robot manipulation and train a reliable agent. The framework includes an effective prompt structure and a robust learning model. Additionally, we introduce a metric for measuring task difficulty to evaluate ChatGPT's performance in robot manipulation. Furthermore, we evaluate RobotGPT in both simulation and real-world environments. Compared to directly using ChatGPT to generate code, our framework significantly improves task success rates, with an average increase from 38.5% to 91.5%. Therefore, training a RobotGPT by utilizing ChatGPT as an expert is a more stable approach compared to directly using ChatGPT as a task planner.

Robotics

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to address the following issues: 1. **Improving the stability and safety of robot operations**: - Directly using code generated by ChatGPT cannot guarantee the stability and safety of robot operations. Due to the instability of ChatGPT's output, directly integrating it into the robot control loop poses risks. 2. **Utilizing ChatGPT's capabilities for robot task planning**: - The paper proposes a framework (RobotGPT) that leverages ChatGPT's problem-solving abilities to train a reliable robot agent. This framework includes effective prompt structures and robust learning models. 3. **Evaluating ChatGPT's performance in robot operations**: - A method for measuring task difficulty is proposed, and ChatGPT's performance in different tasks is evaluated through simulations and real-world experiments. 4. **Developing a stable robot learning framework**: - By improving the code generated by ChatGPT through a self-correction mechanism, the framework ensures that the code can be successfully executed. This not only increases the task success rate but also demonstrates feasibility in practical applications. 5. **Validating the advantages of LLM-driven robots**: - Through comparative experiments (A/B testing), the superiority of LLM-based robots (such as RobotGPT) in complex tasks is demonstrated, particularly showing a stronger knowledge base advantage in open tasks (such as tidying up a room, word games). Overall, this paper significantly improves the stability and success rate of robot operations by developing the RobotGPT framework and demonstrates the potential of LLM in robot task planning.

RobotGPT: Robot Manipulation Learning from ChatGPT

KGGPT: Empowering Robots with OpenAI's ChatGPT and Knowledge Graph.

ChatGPT for Robotics: Design Principles and Model Abilities

Mani-GPT: A Generative Model for Interactive Robotic Manipulation

ChatGPT Empowered Long-Step Robot Control in Various Environments: A Case Application

Improved Trust in Human-Robot Collaboration with ChatGPT

Robot-Enabled Construction Assembly with Automated Sequence Planning based on ChatGPT: RoboGPT

GPT Models Meet Robotic Applications: Co-Speech Gesturing Chat System

Toward Automated Programming for Robotic Assembly Using ChatGPT

InterAct: Exploring the Potentials of ChatGPT as a Cooperative Agent

Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation

SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer

RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks

Next‐generation human‐robot interaction with ChatGPT and robot operating system

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

A Closer Look at Different Difficulty Levels Code Generation Abilities of ChatGPT.

Extending the Frontier of ChatGPT: Code Generation and Debugging

ChatGPT: A Study on its Utility for Ubiquitous Software Engineering Tasks

Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving