A Hierarchical Reinforcement Learning Approach to Control Legged Mobile Manipulators

Abstract:: Recent years have seen a Cambrian explosion of robotic systems yield-1 ing ever more capable and affordable systems, with quadrupedal robotic platforms 2 emerging as a commercially-viable base to perform a wide variety of tasks across 3 uneven terrain. Augmenting these with a robotic arm allows the possibility of even 4 more complex interactions. At the same time, there has been a growing body of 5 research into using deep reinforcement learning (DRL) for embodied agent navi-6 gation and object manipulation, which promises a more sample-efficient, flexible, 7 and robust approach to learning such policies than existing classical methods. Re-8 cent works have shown a functional approach for learning a joint base and arm 9 policy with DRL but have not yet demonstrated how the result can be used in 10 downstream tasks. In this work, we investigate the problem of learning an ob-11 ject manipulation and navigation policy for a quadrupedal robot with a mounted 12 robotic arm - specifically, we address the problem of fetching stationary and mov-13 ing objects autonomously (“playing fetch” with the robot dog). Our method con-14 sists of (a) a low-level policy that moves the base and arm and (b) a high-level 15 policy that generates the commands for the low-level policy. The low-level policy 16 is jointly learned for both the arm and the base which generates joint torques for 17 directional commands. The high-level policy is task-specific, translates the ball 18 position to directional commands for the low-level policy, and deals with accel-19 eration/deceleration and stability. We demonstrate that our high-level policy can 20 outperform a tuned Proportional-Derivative (PD) controller. 21
Engineering,Computer Science
What problem does this paper attempt to address?