DrEureka: Language Model Guided Sim-To-Real Transfer

Yecheng Jason Ma,William Liang,Hung-Ju Wang,Sam Wang,Yuke Zhu,Linxi Fan,Osbert Bastani,Dinesh Jayaraman
2024-06-04
Abstract:Transferring policies learned in simulation to the real world is a promising strategy for acquiring robot skills at scale. However, sim-to-real approaches typically rely on manual design and tuning of the task reward function as well as the simulation physics parameters, rendering the process slow and human-labor intensive. In this paper, we investigate using Large Language Models (LLMs) to automate and accelerate sim-to-real design. Our LLM-guided sim-to-real approach, DrEureka, requires only the physics simulation for the target task and automatically constructs suitable reward functions and domain randomization distributions to support real-world transfer. We first demonstrate that our approach can discover sim-to-real configurations that are competitive with existing human-designed ones on quadruped locomotion and dexterous manipulation tasks. Then, we showcase that our approach is capable of solving novel robot tasks, such as quadruped balancing and walking atop a yoga ball, without iterative manual design.
Robotics,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The main goal of this paper is to address the problem of sim-to-real robot skill transfer. Specifically, existing sim-to-real methods typically rely on manually designing and tuning task reward functions and simulation physical parameters, making the entire process both time-consuming and labor-intensive. Therefore, the paper proposes a new algorithm called DrEureka, which leverages large-scale language models (LLMs) to automate and accelerate the sim-to-real design process. The main contributions of the DrEureka algorithm include: 1. **Automated Reward Design and Domain Randomization Configuration**: By combining the capabilities of large language models, DrEureka can automatically generate appropriate reward functions and domain randomization distributions based on the physical simulation of the given target task, thereby supporting skill transfer to the real world. 2. **Experimental Validation**: Extensive real-world validation was conducted on quadrupedal walking and dexterous manipulation tasks, demonstrating the effectiveness and generality of the method. 3. **New Task Application Demonstration**: Showcased how DrEureka can accelerate the sim-to-real process for new tasks (such as walking on a yoga ball) without pre-existing sim-to-real configurations. Through these contributions, DrEureka aims to reduce the need for human intervention and improve the efficiency and effectiveness of the sim-to-real process.