Abstract:Reinforcement learning method is extremely competitive in gait generation techniques for quadrupedal robot, which is mainly due to the fact that stochastic exploration in reinforcement training is beneficial to achieve an autonomous gait. Nevertheless, although incremental reinforcement learning is employed to improve training success and movement smoothness by relying on the continuity inherent during limb movements, challenges remain in adapting gait policy to diverse terrain and external disturbance. Inspired by the association between reinforcement learning and the evolution of animal motion behavior, a self-improvement mechanism for reference gait is introduced in this paper to enable incremental learning of action and self-improvement of reference action together to imitate the evolution of animal motion behavior. Further, a new framework for reinforcement training of quadruped gait is proposed. In this framework, genetic algorithm is specifically adopted to perform global probabilistic search for the initial value of the arbitrary foot trajectory to update the reference trajectory with better fitness. Subsequently, the improved reference gait is used for incremental reinforcement learning of gait. The above process is repeatedly and alternatively executed to finally train the gait policy. The analysis considering terrain, model dimensions, and locomotion condition is presented in detail based on simulation, and the results show that the framework is significantly more adaptive to terrain compared to regular incremental reinforcement learning.

Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm

Quadruped Robot Locomotion in Unknown Terrain Using Deep Reinforcement Learning

Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation

Gait Learning of Quadruped Robot Based on Deep Arbitration Strategy

A Hierarchical Framework for Quadruped Robots Gait Planning Based on DDPG

Agile Control for Quadruped Robot in Complex Environment Based on Deep Reinforcement Learning Method.

Bipedal Walking Robot using Deep Deterministic Policy Gradient

A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design

A Motion Planning and Control Method of Quadruped Robot Based on Deep Reinforcement Learning

Reinforcement Learning based Control of a Quadruped Robot

Energy Consumption Minimization of Quadruped Robot Based on Reinforcement Learning of DDPG Algorithm

Reward-Adaptive Reinforcement Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion

Learning the Quadruped Robot by Reinforcement Learning (RL)

Gait Learning Reproduction for Quadruped Robots Based on Experience Evolution Proximal Policy Optimization

Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots

A Novel Framework for Adaptive Quadruped Robot Locomotion Learning in Uncertain Environments.

Path Following for Autonomous Ground Vehicle Using DDPG Algorithm: A Reinforcement Learning Approach

CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning.

Hybrid and dynamic policy gradient optimization for bipedal robot locomotion

Stable Skill Improvement of Quadruped Robot Based on Privileged Information and Curriculum Guidance