Abstract:Legged robots are better able to adapt to different terrains compared with wheeled robots. However, traditional motion controllers suffer from extremely complex dynamics properties. Reinforcement learning (RL) helps to overcome the complications of dynamics design and calculation. In addition, the high autonomy of the RL controller results in a more robust response to complex environments and terrains compared with traditional controllers. However, RL algorithms are limited by the problems of convergence and training efficiency due to the complexity of the task. Learn and outperform the reference motion (LORM), an RL based framework for gait controlling of biped robot is proposed leveraging the prior knowledge of reference motion. The proposed trained agent outperformed the reference motion and existing motion-based methods. The RL environment was finely crafted for optimal performance, including the pruning of state space and action space, reward shaping, and design of episode criterion. Several improvements were implemented to further improve the training efficiency and performance including: random state initialization (RSI), the noise of joint angles, and a novel improvement based on symmetrization of gait. To validate the proposed method, the Darwin-op robot was set as the target platform and two different tasks were designed: (I) Walking as fast as possible and (II) Tracking specific velocity. In task (I), the proposed method resulted in the walking velocity of 0.488 m/s, with a 5.8 times improvement compared with the original traditional reference controller. The directional accuracy improved by 87.3%. The velocity performance achieved 2× compared with the rated max velocity and more than 8× compared with other recent works. To our knowledge, our work achieved the best velocity performance on the platform Darwin-op. In task (II), the proposed method achieved a tracking accuracy of over 95%. Different environments are introduced including plains, slopes, uneven terrains, and walking with external force, where the robot was expected to maintain walking stability with ideal speed and little direction deviation, to validate the performance and robustness of the proposed method.

A Heuristics-Based Reinforcement Learning Method to Control Bipedal Robots

Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control

Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics

Agile and versatile bipedal robot tracking control through reinforcement learning

A Balance Control Method for Wheeled Bipedal Robot Based on Reinforcement Learning

RL + Model-based Control: Using On-demand Optimal Control to Learn Versatile Legged Locomotion

CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning.

Modeling and reinforcement learning-based locomotion control for a humanoid robot with kinematic loop closures

LORM: a Novel Reinforcement Learning Framework for Biped Gait Control

Structure Modeling and Iterative Learning Control Simulation of Biped Robot with Heterogeneous Legs

Learning Bipedal Walking On Planned Footsteps For Humanoid Robots

Learning Agile, Robust Locomotion Skills for Quadruped Robot.

Hybrid Autonomous Controller for Bipedal Robot Balance with Deep Reinforcement Learning and Pattern Generators

Research of gait planning and control for biped robot with heterogeneous legs

A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots

Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots

Robust Optimal Control of Point-Feet Biped Robots Using a Reinforcement Learning Approach

Fusing Dynamics and Reinforcement Learning for Control Strategy: Achieving Precise Gait and High Robustness in Humanoid Robot Locomotion*

Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

A Hierarchical Framework for Quadruped Locomotion Based on Reinforcement Learning