AllGaits: Learning All Quadruped Gaits and Transitions

Guillaume Bellegarda,Milad Shafiee,Auke Ijspeert
2024-11-07
Abstract:We present a framework for learning a single policy capable of producing all quadruped gaits and transitions. The framework consists of a policy trained with deep reinforcement learning (DRL) to modulate the parameters of a system of abstract oscillators (i.e. Central Pattern Generator), whose output is mapped to joint commands through a pattern formation layer that sets the gait style, i.e. body height, swing foot ground clearance height, and foot offset. Different gaits are formed by changing the coupling between different oscillators, which can be instantaneously selected at any velocity by a user. With this framework, we systematically investigate which gait should be used at which velocity, and when gait transitions should occur from a Cost of Transport (COT), i.e. energy-efficiency, point of view. Additionally, we note how gait style changes as a function of locomotion speed for each gait to keep the most energy-efficient locomotion. While the currently most popular gait (trot) does not result in the lowest COT, we find that considering different co-dependent metrics such as mean base velocity and joint acceleration result in different `optimal' gaits than those that minimize COT. We deploy our controller in various hardware experiments, showing all 9 typical quadruped animal gaits, and demonstrate generalizability to unseen gaits during training, and robustness to leg failures. Video results can be found at <a class="link-external link-https" href="https://youtu.be/OLoWSX_R868" rel="external noopener nofollow">this https URL</a>.
Robotics
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: How to achieve all gaits of quadruped animals and their transitions through a single strategy, and explore the most effective gaits and transition timings at different speeds. Specifically, the paper aims to answer the following three key questions: 1. **Which gaits are the most efficient at which speeds, and when should gait transitions occur?** 2. **How should the parameters (such as body height, posture, and swinging - foot trajectory) of different gaits change at different speeds to maintain the most energy - efficient movement?** 3. **Can new gaits not seen in training be generated, and how robust is this strategy to leg failures?** To answer these questions, the authors propose a hierarchical bio - inspired architecture, combining deep reinforcement learning (DRL) and central pattern generators (CPG), and control gaits by adjusting the coupling matrix and pattern - formation parameters. This method can not only generate all common quadruped gaits but also determine the optimal gait - transition timings and styles. ### Main contributions: - **No need for manual parameter tuning or expert demonstration**: Unlike previous work, this method can achieve all gaits and their transitions without the need for explicit reward - function parameter tuning or expert demonstration. - **Explore the efficiency of multiple gaits**: The energy consumption (Cost of Transport, COT) of different gaits at different speeds has been studied, and other important indicators such as the average base angular velocity and joint acceleration have been explored. - **Robustness and generalization ability**: The ability to generate new gaits and robustness to leg failures have been demonstrated. ### Experimental results: - **The influence of different gait and style parameters on COT**: Through experiments, it has been found that different body heights, swinging - foot ground clearances, and hip - joint offsets significantly affect COT, and the optimal parameter combinations for different gaits at different speeds are also different. - **The most energy - efficient gaits**: Walk is the most energy - efficient in the speed range of 0.3 - 0.9 m/s, while Pace is the most energy - efficient in the speed range of 0.9 - 3.0 m/s. - **Multi - objective optimization**: Although from the perspective of COT, some gaits are the most effective at specific speeds, when considering other factors (such as base stability and joint acceleration), the optimal gait may be different. In conclusion, this research not only provides a new solution for quadruped - robot gait control but also deeply explores the advantages and disadvantages of different gaits under different conditions, providing a theoretical basis for further optimizing the motion performance of quadruped robots.