Abstract:Reinforcement Learning has been shown to have a great potential for robotics. It demonstrated the capability to solve complex manipulation and locomotion tasks, even by learning end-to-end policies that operate directly on visual input, removing the need for custom perception systems. However, for practical robotics applications, its scarce sample efficiency, the need for huge amounts of resources, data, and computation time can be an insurmountable obstacle. One potential solution to this sample efficiency issue is the use of simulated environments. However, the discrepancy in visual and physical characteristics between reality and simulation, namely the sim-to-real gap, often significantly reduces the real-world performance of policies trained within a simulator. In this work we propose a sim-to-real technique that trains a Soft-Actor Critic agent together with a decoupled feature extractor and a latent-space dynamics model. The decoupled nature of the method allows to independently perform the sim-to-real transfer of feature extractor and control policy, and the presence of the dynamics model acts as a constraint on the latent representation when finetuning the feature extractor on real-world data. We show how this architecture can allow the transfer of a trained agent from simulation to reality without retraining or finetuning the control policy, but using real-world data only for adapting the feature extractor. By avoiding training the control policy in the real domain we overcome the need to apply Reinforcement Learning on real-world data, instead, we only focus on the unsupervised training of the feature extractor, considerably reducing real-world experience collection requirements. We evaluate the method on sim-to-sim and sim-to-real transfer of a policy for table-top robotic object pushing. We demonstrate how the method is capable of adapting to considerable variations in the task observations, such as changes in point-of-view, colors, and lighting, all while substantially reducing the training time with respect to policies trained directly in the real.

Imitation learning for sim-to-real transfer of robotic cutting policies based on residual Gaussian process disturbance force model

Learning robotic milling strategies based on passive variable operational space interaction control

Modular Deep Q Networks for Sim-to-real Transfer of Visuo-motor Policies

Sim-to-real via latent prediction: Transferring visual non-prehensile manipulation policies

Adversarial Discriminative Sim-to-real Transfer of Visuo-motor Policies

Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and Sequencing

A novel simulation reality closed loop learning framework for autonomous robot skill learning

One-shot sim-to-real transfer policy for robotic assembly via reinforcement learning with visual demonstration

Multimodality Driven Impedance-Based Sim2Real Transfer Learning for Robotic Multiple Peg-in-Hole Assembly

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer

AutomaChef: A Physics-informed Demonstration-guided Learning Framework for Granular Material Manipulation

Sim-to-Real Transfer Learning using Robustified Controllers in Robotic Tasks involving Complex Dynamics

Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning

TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction

A digital twin-based sim-to-real transfer for deep reinforcement learning-enabled industrial robot grasping

Kalman Filter-Based One-Shot Sim-to-Real Transfer Learning

RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning

Sim-To-Real Transfer for Visual Reinforcement Learning of Deformable Object Manipulation for Robot-Assisted Surgery

Real–Sim–Real Transfer for Real-World Robot Control Policy Learning with Deep Reinforcement Learning

DiAReL: Reinforcement Learning with Disturbance Awareness for Robust Sim2Real Policy Transfer in Robot Control