GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks

Kaylee Burns,Ajinkya Jain,Keegan Go,Fei Xia,Michael Stark,Stefan Schaal,Karol Hausman
2024-04-10
Abstract:Large Language Models (LLMs) have been successful at generating robot policy code, but so far these results have been limited to high-level tasks that do not require precise movement. It is an open question how well such approaches work for tasks that require reasoning over contact forces and working within tight success tolerances. We find that, with the right action space, LLMs are capable of successfully generating policies for a variety of contact-rich and high-precision manipulation tasks, even under noisy conditions, such as perceptual errors or grasping inaccuracies. Specifically, we reparameterize the action space to include compliance with constraints on the interaction forces and stiffnesses involved in reaching a target pose. We validate this approach on subtasks derived from the Functional Manipulation Benchmark (FMB) and NIST Task Board Benchmarks. Exposing this action space alongside methods for estimating object poses improves policy generation with an LLM by greater than 3x and 4x when compared to non-compliant action spaces
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the problem of how to utilize large language models (LLMs) to generate effective robot policy code when robots perform high-precision, contact-rich manipulation tasks. Currently, large language models have achieved success in generating robot policy code for high-level tasks such as navigation and open-vocabulary pick-and-place, but challenges remain for tasks requiring precise motion and contact force control. Therefore, this paper proposes a new method called GenCHiP, which re-parameterizes the action space to include constraints on interaction forces and stiffness, enabling large language models to generate policy code suitable for various high-precision and contact-rich manipulation tasks, even under noisy conditions such as perception errors or inaccurate grasping. The researchers validated the effectiveness of this method through a series of experiments and demonstrated its superior performance on the Functional Manipulation Benchmark (FMB) and the National Institute of Standards and Technology (NIST) task board benchmark.