RobotGPT: Robot Manipulation Learning from ChatGPT

Yixiang Jin,Dingzhe Li,Yong A,Jun Shi,Peng Hao,Fuchun Sun,Jianwei Zhang,Bin Fang
2023-12-03
Abstract:We present RobotGPT, an innovative decision framework for robotic manipulation that prioritizes stability and safety. The execution code generated by ChatGPT cannot guarantee the stability and safety of the system. ChatGPT may provide different answers for the same task, leading to unpredictability. This instability prevents the direct integration of ChatGPT into the robot manipulation loop. Although setting the temperature to 0 can generate more consistent outputs, it may cause ChatGPT to lose diversity and creativity. Our objective is to leverage ChatGPT's problem-solving capabilities in robot manipulation and train a reliable agent. The framework includes an effective prompt structure and a robust learning model. Additionally, we introduce a metric for measuring task difficulty to evaluate ChatGPT's performance in robot manipulation. Furthermore, we evaluate RobotGPT in both simulation and real-world environments. Compared to directly using ChatGPT to generate code, our framework significantly improves task success rates, with an average increase from 38.5% to 91.5%. Therefore, training a RobotGPT by utilizing ChatGPT as an expert is a more stable approach compared to directly using ChatGPT as a task planner.
Robotics
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to address the following issues: 1. **Improving the stability and safety of robot operations**: - Directly using code generated by ChatGPT cannot guarantee the stability and safety of robot operations. Due to the instability of ChatGPT's output, directly integrating it into the robot control loop poses risks. 2. **Utilizing ChatGPT's capabilities for robot task planning**: - The paper proposes a framework (RobotGPT) that leverages ChatGPT's problem-solving abilities to train a reliable robot agent. This framework includes effective prompt structures and robust learning models. 3. **Evaluating ChatGPT's performance in robot operations**: - A method for measuring task difficulty is proposed, and ChatGPT's performance in different tasks is evaluated through simulations and real-world experiments. 4. **Developing a stable robot learning framework**: - By improving the code generated by ChatGPT through a self-correction mechanism, the framework ensures that the code can be successfully executed. This not only increases the task success rate but also demonstrates feasibility in practical applications. 5. **Validating the advantages of LLM-driven robots**: - Through comparative experiments (A/B testing), the superiority of LLM-based robots (such as RobotGPT) in complex tasks is demonstrated, particularly showing a stronger knowledge base advantage in open tasks (such as tidying up a room, word games). Overall, this paper significantly improves the stability and success rate of robot operations by developing the RobotGPT framework and demonstrates the potential of LLM in robot task planning.