Abstract:Closed-chain manipulation occurs when several robot arms perform tasks in cooperation. It is complex to control a dual-arm system because it requires flexible and adaptable operation ability to realize closed-chain manipulation. In this study, a deep reinforcement learning (DRL) framework based on actor-critic algorithm is proposed to drive the closed-chain manipulation of a dual-arm robotic system. The proposed framework is designed to train dual robot arms to transport a large object cooperatively. In order to sustain strict constraints of closed-chain manipulation, the actor part of the proposed framework is designed in a leader-follower mode. The leader part consists of a policy trained from the DRL algorithm and works on the leader arm. The follower part consists of an inverse kinematics solver based on Damped Least Squares (DLS) and works on the follower arm. Two experiments are designed to prove the task adaptability, one of which is manipulating an object to a random pose within a defined range, the other is manipulating a delicate structural object within a narrow space Note to Practitioners—In common industrial manipulation scenarios, there are requirements to employ robotic arms to transport a large object relative to the robotic arm, e.g., moving a payload onto a loader and assembling big craft parts. It is a cost-effective way to use a dual-arm system to extend the loading capacity of robotic arms while preserving the flexibility of manipulation. Moreover, the dual-arm system is expected to manipulate different objects without complicated reprogramming, especially in small batch production scenarios. This study proposes a task-adaptive deep reinforcement learning framework for dual-arm robot manipulation. The task adaptability includes two specific aspects, one being adaptability in targeting the pose, such as manipulating an object to a random pose within a specified range. The other is the adaptivity on the task prerequisites such as manipulating a delicate structural object within a narrow space. For future research, the dual-arm system may autonomously plan the grab positions, and additional investigations should address more common scenarios involving various object shapes.

A hierarchical deep reinforcement learning algorithm for typing with a dual-arm humanoid robot

Ensemble Bootstrapped Deep Deterministic Policy Gradient For Vision-Based Robotic Grasping

Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

A Collaborative Control Method of Dual-Arm Robots Based on Deep Reinforcement Learning

A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control

ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning

Robot Control in Human Environment Using Deep Reinforcement Learning and Convolutional Neural Network.

A Hierarchical Reinforcement Learning Approach to Control Legged Mobile Manipulators

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

A Modified Convergence DDPG Algorithm for Robotic Manipulation

Reinforcement learning of dual-arm cooperation for a mobile manipulator with sequences of dynamical movement primitives

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks With Base Controllers

A Task-Adaptive Deep Reinforcement Learning Framework for Dual-Arm Robot Manipulation

A High-Efficient Reinforcement Learning Approach for Dexterous Manipulation

Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost

A Deep Reinforcement Learning Approach for Dynamically Stable Inverse Kinematics of Humanoid Robots

Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction

DA-VIL: Adaptive Dual-Arm Manipulation with Reinforcement Learning and Variable Impedance Control

HDPG: hyperdimensional policy-based reinforcement learning for continuous control

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation