LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion

Firas Al-Hafez,Guoping Zhao,Jan Peters,Davide Tateo

2023-12-01

Abstract:Imitation Learning (IL) holds great promise for enabling agile locomotion in embodied agents. However, many existing locomotion benchmarks primarily focus on simplified toy tasks, often failing to capture the complexity of real-world scenarios and steering research toward unrealistic domains. To advance research in IL for locomotion, we present a novel benchmark designed to facilitate rigorous evaluation and comparison of IL algorithms. This benchmark encompasses a diverse set of environments, including quadrupeds, bipeds, and musculoskeletal human models, each accompanied by comprehensive datasets, such as real noisy motion capture data, ground truth expert data, and ground truth sub-optimal data, enabling evaluation across a spectrum of difficulty levels. To increase the robustness of learned agents, we provide an easy interface for dynamics randomization and offer a wide range of partially observable tasks to train agents across different embodiments. Finally, we provide handcrafted metrics for each task and ship our benchmark with state-of-the-art baseline algorithms to ease evaluation and enable fast benchmarking.

Machine Learning,Robotics

What problem does this paper attempt to address?

The main goal of this paper is to address the lack of standardized benchmarks in the field of robotic locomotion. Specifically: 1. **Existing Issues**: Many current gait simulation benchmarks focus on overly simplified tasks that do not adequately reflect the complexity of the real world, leading research to be directed towards impractical application scenarios. Additionally, due to the lack of unified standards, it is difficult to effectively compare and validate results between different studies. 2. **Proposed Solution**: To advance research in imitation learning (IL) within gait simulation, the authors propose a new benchmarking framework—LocoMuJoCo. This framework includes various environments (such as quadrupedal, bipedal, and musculoskeletal human models), each equipped with extensive datasets (such as real and noisy motion capture data, expert-level data, and suboptimal data) to support evaluations at different difficulty levels. 3. **Key Features**: - Provides a diverse range of tasks and datasets, covering tasks from simple to highly challenging. - Supports dynamics randomization, which helps improve the robustness of the trained agents. - Offers partially observable tasks (POMDP), enhancing the adaptability of algorithms. - Provides handcrafted evaluation metrics for each task, along with the latest baseline algorithms, enabling users to quickly assess the effectiveness of their methods. Through these measures, LocoMuJoCo aims to fill the current gap in benchmarking complex gait tasks in the field of imitation learning, promoting further development in this area.

LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion

HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation

FastMimic: Model-Based Motion Imitation for Agile, Diverse and Generalizable Quadrupedal Locomotion

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

MetaLoco: Universal Quadrupedal Locomotion with Meta-Reinforcement Learning and Motion Imitation

Open-Source Reinforcement Learning Environments Implemented in MuJoCo with Franka Manipulator

Imitation Learning from Observations under Transition Model Disparity

Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior

PUMA: Deep Metric Imitation Learning for Stable Motion Primitives

BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents

Bi-Level Motion Imitation for Humanoid Robots

BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation

Off-Policy Imitation Learning from Observations

Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

HumanMimic: Learning Natural Locomotion and Transitions for Humanoid Robot via Wasserstein Adversarial Imitation

Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI

Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response

SLoMo: A General System for Legged Robot Motion Imitation from Casual Videos

MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench

FMB: A functional manipulation benchmark for generalizable robotic learning