HumanMimic: Learning Natural Locomotion and Transitions for Humanoid Robot via Wasserstein Adversarial Imitation

Annan Tang,Takuma Hiraoka,Naoki Hiraoka,Fan Shi,Kento Kawaharazuka,Kunio Kojima,Kei Okada,Masayuki Inaba

2024-04-23

Abstract:Transferring human motion skills to humanoid robots remains a significant challenge. In this study, we introduce a Wasserstein adversarial imitation learning system, allowing humanoid robots to replicate natural whole-body locomotion patterns and execute seamless transitions by mimicking human motions. First, we present a unified primitive-skeleton motion retargeting to mitigate morphological differences between arbitrary human demonstrators and humanoid robots. An adversarial critic component is integrated with Reinforcement Learning (RL) to guide the control policy to produce behaviors aligned with the data distribution of mixed reference motions. Additionally, we employ a specific Integral Probabilistic Metric (IPM), namely the Wasserstein-1 distance with a novel soft boundary constraint to stabilize the training process and prevent mode collapse. Our system is evaluated on a full-sized humanoid JAXON in the simulator. The resulting control policy demonstrates a wide range of locomotion patterns, including standing, push-recovery, squat walking, human-like straight-leg walking, and dynamic running. Notably, even in the absence of transition motions in the demonstration dataset, robots showcase an emerging ability to transit naturally between distinct locomotion patterns as desired speed changes.

Robotics

What problem does this paper attempt to address?

The problem this paper attempts to address is the transfer of natural human motor skills to humanoid robots, specifically achieving natural whole-body movement patterns and smooth transitions between different movement modes. Specifically, the paper proposes a system based on Wasserstein adversarial imitation learning, aiming to enable humanoid robots to replicate natural walking patterns by imitating human actions and seamlessly transition between different movement modes when speed changes. This challenge mainly stems from the complexity of control and the intricacy of motion design, especially the significant morphological differences between humanoid robots and humans, which make direct imitation of human demonstrations difficult. The method introduced in the paper not only addresses these issues but also demonstrates that the robot can exhibit natural transition capabilities even in the absence of transition actions in the demonstration dataset.

HumanMimic: Learning Natural Locomotion and Transitions for Humanoid Robot via Wasserstein Adversarial Imitation

Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation

Learning Responsive Humanoid Motion Skills from Graph-Powered Motion Matching

Bi-Level Motion Imitation for Humanoid Robots

FastMimic: Model-Based Motion Imitation for Agile, Diverse and Generalizable Quadrupedal Locomotion

Fast Human Whole Body Motion Imitation Algorithm For Humanoid Robots

Whole-body Humanoid Robot Locomotion with Human Reference

Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion

SLoMo: A General System for Legged Robot Motion Imitation from Casual Videos

Robust Real-Time Upper Limb Motion Transfer from Human to Humanoid for Manipulation Skill Teaching

Learning Agile Bipedal Motions on a Quadrupedal Robot

Motion Segmentation and Balancing for a Biped Robot's Imitation Learning

Imitation of Dynamic Walking with BSN for Humanoid Robot

Real-Time Whole-Body Imitation by Humanoid Robots and Task-Oriented Teleoperation Using an Analytical Mapping Method and Quantitative Evaluation

Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior

Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation

Learning to Walk and Fly with Adversarial Motion Priors

Skeleton2Humanoid: Animating Simulated Characters for Physically-plausible Motion In-betweening

Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis

Learning Humanoid Locomotion over Challenging Terrain

Learning Bipedal Robot Locomotion from Human Movement