Abstract:In recent years, reinforcement learning and imitation learning have shown great potential for controlling humanoid robots' motion. However, these methods typically create simulation environments and rewards for specific tasks, resulting in the requirements of multiple policies and limited capabilities for tackling complex and unknown tasks. To overcome these issues, we present a novel approach that combines adversarial imitation learning with large language models (LLMs). This innovative method enables the agent to learn reusable skills with a single policy and solve zero-shot tasks under the guidance of LLMs. In particular, we utilize the LLM as a strategic planner for applying previously learned skills to novel tasks through the comprehension of task-specific prompts. This empowers the robot to perform the specified actions in a sequence. To improve our model, we incorporate codebook-based vector quantization, allowing the agent to generate suitable actions in response to unseen textual commands from LLMs. Furthermore, we design general reward functions that consider the distinct motion features of humanoid robots, ensuring the agent imitates the motion data while maintaining goal orientation without additional guiding direction approaches or policies. To the best of our knowledge, this is the first framework that controls humanoid robots using a single learning policy network and LLM as a planner. Extensive experiments demonstrate that our method exhibits efficient and adaptive ability in complicated motion tasks.

Acquisition of A Gymnast-Like Robotic Giant-Swing Motion by Q-Learning and Improvement of the Repeatability

Learning biped locomotion based on Q-learning and neural networks

Learning Accurate and Robust Velocity Tracking for Quadrupedal Robots

Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation

Learning and Adapting Agile Locomotion Skills by Transferring Experience

A Multi-Stage Approach for Efficiently Learning Humanoid Robot Stand-Up Behavior

Learning and Reusing Quadruped Robot Movement Skills from Biological Dogs for Higher-Level Tasks

Learning Highly Dynamic Behaviors for Quadrupedal Robots

Agile and versatile bipedal robot tracking control through reinforcement learning

Learning neural-shaped quadratic Lyapunov function for stable, accurate and generalizable human–robot skills transfer

Learning Arm Movements Of Target Reaching For Humanoid Robot

Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized Imitation Learning

Achieving Stable High-Speed Locomotion for Humanoid Robots with Deep Reinforcement Learning

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

Learning Robust, Agile, Natural Legged Locomotion Skills in the Wild

A Q-learning approach to the continuous control problem of robot inverted pendulum balancing

Learning Agile Bipedal Motions on a Quadrupedal Robot

Lifelike Agility and Play in Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models

Dynamic Motion Primitives-Based Trajectory Learning for Physical Human–Robot Interaction Force Control

Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning

Robot Policy Improvement With Natural Evolution Strategies for Stable Nonlinear Dynamical System