Abstract:A multiobjective collaborative deep reinforcement learning approach is presented to develop efficient jumping strategies for bipedal robots. By integrating dual networks, experience replay, timely adjustment, and a Markov decision process, the method enables bipedal robots to learn robust policies and execute jumps with extended height and distance, outperforming baseline algorithms. Due to the nonlinearity and underactuation of bipedal robots, developing efficient jumping strategies remains challenging. To address this, a multiobjective collaborative deep reinforcement learning algorithm based on the actor‐critic framework is presented. Initially, two deep deterministic policy gradient (DDPG) networks are established for training the jumping motion, each focusing on different objectives and collaboratively learning the optimal jumping policy. Following this, a recovery experience replay mechanism, predicated on dynamic time warping, is integrated into the DDPG to enhance sample utilization efficiency. Concurrently, a timely adjustment unit is incorporated, which works in tandem with the training frequency to improve the convergence accuracy of the algorithm. Additionally, a Markov decision process is designed to manage the complexity and parameter uncertainty in the dynamic model of the bipedal robot. Finally, the proposed method is validated on a PyBullet platform. The results show that the method outperforms baseline methods by improving learning speed and enabling robust jumps with greater height and distance.

Ramp Jump Control of Single-track Two-wheeled Robot Using Reinforcement Learning with Demonstration Data.

Continuous Reinforcement Learning Based Ramp Jump Control for Single-Track Two-Wheeled Robots

High Maneuverability Control of Single-track Two-wheeled Robot in Narrow Terrain Based on Reinforcement Learning.

Reinforcement Learning-Based Control of Single-Track Two-Wheeled Robots in Narrow Terrain

A Deep Reinforcement Learning Control Method for a Four-Link Brachiation Robot

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

Stable Jumping Control Based on Deep Reinforcement Learning for a Locust-Inspired Robot

A Multiobjective Collaborative Deep Reinforcement Learning Algorithm for Jumping Optimization of Bipedal Robot

Deep Reinforcement Learning-Based Control of Bicycle Robots on Rough Terrain

Continuous Versatile Jumping Using Learned Action Residuals

Perception-Driven Learning of High-Dynamic Jumping Motions for Single-Legged Robots

An advanced reinforcement learning control method for quadruped robots in typical urban terrains

Agile Continuous Jumping in Discontinuous Terrains

A Motion Planning and Control Method of Quadruped Robot Based on Deep Reinforcement Learning

Research on Dynamic Path Planning of Wheeled Robot Based on Deep Reinforcement Learning on the Slope Ground

Motion Simulation of Flying Quadruped Robot Based on Deep Reinforcement Learning

Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain

Robust Quadruped Jumping via Deep Reinforcement Learning

Curriculum-Based Reinforcement Learning for Quadrupedal Jumping: A Reference-free Design

End-to-End Reinforcement Learning for Torque Based Variable Height Hopping

Agile and versatile bipedal robot tracking control through reinforcement learning