Learning dexterous in-hand manipulation
OpenAI: Marcin Andrychowicz,Bowen Baker,Maciek Chociej,Rafal Józefowicz,Bob McGrew,Jakub Pachocki,Arthur Petron,Matthias Plappert,Glenn Powell,Alex Ray,Jonas Schneider,Szymon Sidor,Josh Tobin,Peter Welinder,Lilian Weng,Wojciech Zaremba
DOI: https://doi.org/10.1177/0278364919887447
2019-11-18
The International Journal of Robotics Research
Abstract:We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies that can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The training is performed in a simulated environment in which we randomize many of the physical properties of the system such as friction coefficients and an object’s appearance. Our policies transfer to the physical robot despite being trained entirely in simulation. Our method does not rely on any human demonstrations, but many behaviors found in human manipulation emerge naturally, including finger gaiting, multi-finger coordination, and the controlled use of gravity. Our results were obtained using the same distributed RL system that was used to train OpenAI Five. We also include a video of our results: https://youtu.be/jwSbzNHGflM .
robotics