Imitation learning for sim-to-real transfer of robotic cutting policies based on residual Gaussian process disturbance force model

Jamie Hathaway,Rustam Stolkin,Alireza Rastegarpanah
2024-09-06
Abstract:Robotic cutting, or milling, plays a significant role in applications such as disassembly, decommissioning, and demolition. Planning and control of cutting in real-world scenarios in uncertain environments is a complex task, with the potential to benefit from simulated training environments. This letter focuses on sim-to-real transfer for robotic cutting policies, addressing the need for effective policy transfer from simulation to practical implementation. We extend our previous domain generalisation approach to learning cutting tasks based on a mechanistic model-based simulation framework, by proposing a hybrid approach for sim-to-real transfer based on a milling process force model and residual Gaussian process (GP) force model, learned from either single or multiple real-world cutting force examples. We demonstrate successful sim-to-real transfer of a robotic cutting policy without the need for fine-tuning on the real robot setup. The proposed approach autonomously adapts to materials with differing structural and mechanical properties. Furthermore, we demonstrate the proposed method outperforms fine-tuning or re-training alone.
Robotics
What problem does this paper attempt to address?
The paper aims to address the issue of sim-to-real adaptation in robotic cutting tasks. Specifically, the researchers propose a novel approach that enhances the transferability of strategies from simulated environments to real-world scenarios by modeling the disturbance forces encountered during the cutting process using Gaussian Process (GP) regression. This method leverages a limited amount of real-world experimental data to capture the residual dynamic processes, enabling the algorithm to effectively adapt to different materials without the need for fine-tuning on the physical robot. The core of the research lies in using imitation learning to align the action distributions in both simulated and real environments by pairing the expert operations in the simulated environment with the observations corrected by GP. This promotes robust strategy transfer. The effectiveness of this method was validated through actual cutting experiments, and the results showed that this approach not only outperformed retraining methods but also provided effects similar to direct fine-tuning. Notably, the strategies transferred using this method exhibited stronger noise resistance and robustness to disturbances while maintaining consistency with the source domain expert behavior.