Abstract:This paper proposes the implementation of fuzzy motion control based on reinforcement learning (RL) and Lagrange polynomial interpolation (LPI) for gait synthesis of biped robots. First, the procedure of a walking gait is redefined into three states, and the parameters of this designed walking gait are determined. Then, the machine learning approach applied to adjusting the walking parameters is policy gradient RL (PGRL), which can execute real-time performance and directly modify the policy without calculating the dynamic function. Given a parameterized walking motion designed for biped robots, the PGRL algorithm automatically searches the set of possible parameters and finds the fastest possible walking motion. The reward function mainly considered is first the walking speed, which can be estimated from the vision system. However, the experiment illustrates that there are some stability problems in this kind of learning process. To solve these problems, the desired zero moment point trajectory is added to the reward function. The results show that the robot not only has more stable walking but also increases its walking speed after learning. This is more effective and attractive than manual trial-and-error tuning. LPI, moreover, is employed to transform the existing motions to the motion which has a revised angle determined by the fuzzy motion controller. Then, the biped robot can continuously walk in any desired direction through this fuzzy motion control. Finally, the fuzzy-based gait synthesis control is demonstrated by tasks and point- and line-target tracking. The experiments show the feasibility and effectiveness of gait learning with PGRL and the practicability of the proposed fuzzy motion control scheme.

Bipedal Walking Robot using Deep Deterministic Policy Gradient

Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation

DeepWalk: Omnidirectional Bipedal Gait by Deep Reinforcement Learning

Soft Soil Gait Planning and Control for Biped Robot using Deep Deterministic Policy Gradient Approach

A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design

Reinforcement Learning based Control of a Quadruped Robot

Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm

Gait Learning of Quadruped Robot Based on Deep Arbitration Strategy

Learning Bipedal Walking for Humanoids with Current Feedback

Quadruped Robot Locomotion in Unknown Terrain Using Deep Reinforcement Learning

Learning Bipedal Walking On Planned Footsteps For Humanoid Robots

Learning the Quadruped Robot by Reinforcement Learning (RL)

A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots

Biped Robots Control in Gusty Environments with Adaptive Exploration Based DDPG

A Biped Robot Learning to Walk like Human by Reinforcement Learning.

Learning Bipedal Walking for Humanoid Robots in Challenging Environments with Obstacle Avoidance

Walking motion generation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic

Robust biped locomotion using deep reinforcement learning on top of an analytical control approach

Learning Generic and Dynamic Locomotion of Humanoids Across Discrete Terrains

Teach Biped Robots to Walk via Gait Principles and Reinforcement Learning with Adversarial Critics

CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning.