Abstract:Recent advances of locomotion controllers utilizing deep reinforcement learning (RL) have yielded impressive results in terms of achieving rapid and robust locomotion across challenging terrain, such as rugged rocks, non-rigid ground, and slippery surfaces. However, while these controllers primarily address challenges underneath the robot, relatively little research has investigated legged mobility through confined 3D spaces, such as narrow tunnels or irregular voids, which impose all-around constraints. The cyclic gait patterns resulted from existing RL-based methods to learn parameterized locomotion skills characterized by motion parameters, such as velocity and body height, may not be adequate to navigate robots through challenging confined 3D spaces, requiring both agile 3D obstacle avoidance and robust legged locomotion. Instead, we propose to learn locomotion skills end-to-end from goal-oriented navigation in confined 3D spaces. To address the inefficiency of tracking distant navigation goals, we introduce a hierarchical locomotion controller that combines a classical planner tasked with planning waypoints to reach a faraway global goal location, and an RL-based policy trained to follow these waypoints by generating low-level motion commands. This approach allows the policy to explore its own locomotion skills within the entire solution space and facilitates smooth transitions between local goals, enabling long-term navigation towards distant goals. In simulation, our hierarchical approach succeeds at navigating through demanding confined 3D environments, outperforming both pure end-to-end learning approaches and parameterized locomotion skills. We further demonstrate the successful real-world deployment of our simulation-trained controller on a real robot.

Demonstrating A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning

Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning

Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

DeepWalk: Omnidirectional Bipedal Gait by Deep Reinforcement Learning

Walking with Terrain Reconstruction: Learning to Traverse Risky Sparse Footholds

Learning Robust, Agile, Natural Legged Locomotion Skills in the Wild

Learning Bipedal Walking On Planned Footsteps For Humanoid Robots

Bipedal Walking Robot using Deep Deterministic Policy Gradient

Modelling Human Kinetics and Kinematics during Walking using Reinforcement Learning

Learning Bipedal Walking for Humanoids with Current Feedback

Latent Action Priors From a Single Gait Cycle Demonstration for Online Imitation Learning

Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion

Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots

A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots

Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning

Dynamic Bipedal Maneuvers through Sim-to-Real Reinforcement Learning

Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning

Reinforcement Learning With Evolutionary Trajectory Generator: A General Approach for Quadrupedal Locomotion