Abstract:Reinforcement learning demonstrates significant potential in automatically building control policies in numerous domains, but shows low efficiency when applied to robot manipulation tasks due to the curse of dimensionality. To facilitate the learning of such tasks, prior knowledge or heuristics that incorporate inherent simplification can effectively improve the learning performance. This paper aims to define and incorporate the natural symmetry present in physical robotic environments. Then, sample-efficient policies are trained by exploiting the expert demonstrations in symmetrical environments through an amalgamation of reinforcement and behavior cloning, which gives the off-policy learning process a diverse yet compact initiation. Furthermore, it presents a rigorous framework for a recent concept and explores its scope for robot manipulation tasks. The proposed method is validated via two point-to-point reaching tasks of an industrial arm, with and without an obstacle, in a simulation experiment study. A PID controller, which tracks the linear joint-space trajectories with hard-coded temporal logic to produce interim midpoints, is used to generate demonstrations in the study. The results of the study present the effect of the number of demonstrations and quantify the magnitude of behavior cloning to exemplify the possible improvement of model-free reinforcement learning in common manipulation tasks. A comparison study between the proposed method and a traditional off-policy reinforcement learning algorithm indicates its advantage in learning performance and potential value for applications.

Discovering Synergies for Robot Manipulation with Multi-Task Reinforcement Learning

Learning Robot Manipulation Skills from Human Demonstration Videos Using Two-Stream 2-D/3-D Residual Networks with Self-Attention

Extracting bimanual synergies with reinforcement learning

Leveraging the Efficiency of Multi-Task Robot Manipulation Via Task-Evoked Planner and Reinforcement Learning

Exploiting Symmetry and Heuristic Demonstrations in Off-policy Reinforcement Learning for Robotic Manipulation

Acquisition of synergy for low-dimensional control of multi-fingered hands by reinforcement learning

Enabling Multi-Robot Collaboration from Single-Human Guidance

Disentangled Attention As Intrinsic Regularization for Bimanual Multi-Object Manipulation

Object Manipulation with an Anthropomorphic Robotic Hand via Deep Reinforcement Learning with a Synergy Space of Natural Hand Poses

Leveraging Kernelized Synergies on Shared Subspace for Precision Grasp and Dexterous Manipulation

Multi-target Approaching Control of Hyper-redundant Manipulators Using Reinforcement Learning

A Dual-Arm Collaborative Framework for Dexterous Manipulation in Unstructured Environments with Contrastive Planning

Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation

Diffusion Co-Policy for Synergistic Human-Robot Collaborative Tasks

Enhancing Dexterity in Robotic Manipulation via Hierarchical Contact Exploration

Sampling-based Exploration for Reinforcement Learning of Dexterous Manipulation

Learning Dual-Arm Push and Grasp Synergy in Dense Clutter

A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation

DAIR: Disentangled Attention Intrinsic Regularization for Safe and Efficient Bimanual Manipulation

Enhancing Task Performance of Learned Simplified Models via Reinforcement Learning