Abstract:Mobile Manipulation (MM) systems are ideal candidates for taking up the role of a personal assistant in unstructured real-world environments. Among other challenges, MM requires effective coordination of the robot's embodiments for executing tasks that require both mobility and manipulation. Reinforcement Learning (RL) holds the promise of endowing robots with adaptive behaviors, but most methods require prohibitively large amounts of data for learning a useful control policy. In this work, we study the integration of robotic reachability priors in actor-critic RL methods for accelerating the learning of MM for reaching and fetching tasks. Namely, we consider the problem of optimal base placement and the subsequent decision of whether to activate the arm for reaching a 6D target. For this, we devise a novel Hybrid RL method that handles discrete and continuous actions jointly, resorting to the Gumbel-Softmax reparameterization. Next, we train a reachability prior using data from the operational robot workspace, inspired by classical methods. Subsequently, we derive Boosted Hybrid RL (BHyRL), a novel algorithm for learning Q-functions by modeling them as a sum of residual approximators. Every time a new task needs to be learned, we can transfer our learned residuals and learn the component of the Q-function that is task-specific, hence, maintaining the task structure from prior behaviors. Moreover, we find that regularizing the target policy with a prior policy yields more expressive behaviors. We evaluate our method in simulation in reaching and fetching tasks of increasing difficulty, and we show the superior performance of BHyRL against baseline methods. Finally, we zero-transfer our learned 6D fetching policy with BHyRL to our MM robot TIAGo++. For more details and code release, please refer to our project site: <a class="link-external link-http" href="http://irosalab.com/rlmmbp" rel="external noopener nofollow">this http URL</a>

Stage-Wise Learning of Reaching Using Little Prior Knowledge

A learning framework for semantic reach-to-grasp tasks integrating machine learning and optimization.

Vision-Based Robotic Object Grasping—A Deep Reinforcement Learning Approach

Learning Efficient Robot Arm Reaching

Developing Robot Reaching Skill Via Look-ahead Planning

Robot Learning of Mobile Manipulation with Reachability Behavior Priors

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Learn to grasp unknown objects in robotic manipulation

Automatic Acquisition of a Repertoire of Diverse Grasping Trajectories through Behavior Shaping and Novelty Search

Developing Robot Reaching Skill with Relative-Location Based Approximating

Guiding real-world reinforcement learning for in-contact manipulation tasks with Shared Control Templates

Learning Robotic Manipulation through Visual Planning and Acting

Note: Rapid Genotyping of Human ERCC1 Exon 4 Polymorphism with Fluorescence Analysis Using Fluorophore-Labeled Hybridization Probes and a LightCycler

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks With Base Controllers

Learning to Scaffold the Development of Robotic Manipulation Skills

Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

How Does a Robot Develop Its Reaching Ability Like Human Infants Do?

Exploiting Symmetry and Heuristic Demonstrations in Off-policy Reinforcement Learning for Robotic Manipulation

Learning Kinematic Feasibility for Mobile Manipulation through Deep Reinforcement Learning

Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation