Abstract:A novel efficient reinforcement learning paradigm combining human knowledge, model-based and model-free methods is presented for optimal robot manipulation control during complex multi-phase robot manipulation tasks, e.g., the peg-in-hole tasks with tight fit and nut-and-bolt assembly. Firstly, human demonstration is conducted to collect the data during successful robot manipulation, and manipulation phase estimation method integrating with human knowledge is presented to obtain the higher-level planning of the multi-phase robot manipulation tasks. Typical robot manipulation tasks can usually be decomposed into three types of phases, namely free motion, discontinuous contact, and continuous contact. For phase with free motion, the motion planning method is utilized for generating smooth trajectory. For phase with discontinuous contact in the axes of interest during the pre-manipulation process, the rule-based model-free method, namely the Policy Gradients with Human-Guided Parameter-based Exploration (PGHGPE) method is utilized. For the manipulation phase with continuous contacts, the model-based method is utilized because of its higher sample efficiency. Finally, the simulation and experimental studies verify the effectiveness of the presented algorithm Note to Practitioners —The important premise for the future robot assistants is that the robots should have certain ability of complex manipulation skill learning. Complex manipulation tasks can be decomposed into multiple stages, and HRL is a suitable method for solving this kind of problems. However, HRL faces the challenge of low computational efficiency. To this end, efficient manipulation skill learning for complex manipulation tasks via human knowledge, model-based and model-free reinforcement learning methods are presented, which improves the efficiency of the skill learning process to a practical level.

Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations

Learning Robot Manipulation Skills from Human Demonstration Videos Using Two-Stream 2-D/3-D Residual Networks with Self-Attention

Watch and Act: Learning Robotic Manipulation from Visual Demonstration.

Efficient Robot Skill Learning with Imitation from a Single Video for Contact-Rich Fabric Manipulation

Robust and High-Precision End-to-End Control Policy for Multi-stage Manipulation Task with Behavioral Cloning.

Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

Vision-based Robot Manipulation Learning via Human Demonstrations

Giving Robots a Hand: Learning Generalizable Manipulation with Eye-in-Hand Human Video Demonstrations

Learning Multi-Step Manipulation Tasks from A Single Human Demonstration

Learning to Design and Use Tools for Robotic Manipulation

Language-Conditioned Imitation Learning for Robot Manipulation Tasks

One-Shot Imitation Learning with Invariance Matching for Robotic Manipulation

Learning to combine primitive skills: A step towards versatile robotic manipulation

Efficient Reinforcement Learning Method for Multi-Phase Robot Manipulation Skill Acquisition Via Human Knowledge, Model-Based, and Model-Free Methods

Cooperative Manipulation for a Mobile Dual-Arm Robot Using Sequences of Dynamic Movement Primitives

Unified Learning from Demonstrations, Corrections, and Preferences during Physical Human-Robot Interaction

Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations

Object-Centric Dexterous Manipulation from Human Motion Data

Learning Robotic Manipulation through Visual Planning and Acting

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation From Single-Camera Teleoperation