LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion

Firas Al-Hafez,Guoping Zhao,Jan Peters,Davide Tateo
2023-12-01
Abstract:Imitation Learning (IL) holds great promise for enabling agile locomotion in embodied agents. However, many existing locomotion benchmarks primarily focus on simplified toy tasks, often failing to capture the complexity of real-world scenarios and steering research toward unrealistic domains. To advance research in IL for locomotion, we present a novel benchmark designed to facilitate rigorous evaluation and comparison of IL algorithms. This benchmark encompasses a diverse set of environments, including quadrupeds, bipeds, and musculoskeletal human models, each accompanied by comprehensive datasets, such as real noisy motion capture data, ground truth expert data, and ground truth sub-optimal data, enabling evaluation across a spectrum of difficulty levels. To increase the robustness of learned agents, we provide an easy interface for dynamics randomization and offer a wide range of partially observable tasks to train agents across different embodiments. Finally, we provide handcrafted metrics for each task and ship our benchmark with state-of-the-art baseline algorithms to ease evaluation and enable fast benchmarking.
Machine Learning,Robotics
What problem does this paper attempt to address?
The main goal of this paper is to address the lack of standardized benchmarks in the field of robotic locomotion. Specifically: 1. **Existing Issues**: Many current gait simulation benchmarks focus on overly simplified tasks that do not adequately reflect the complexity of the real world, leading research to be directed towards impractical application scenarios. Additionally, due to the lack of unified standards, it is difficult to effectively compare and validate results between different studies. 2. **Proposed Solution**: To advance research in imitation learning (IL) within gait simulation, the authors propose a new benchmarking framework—LocoMuJoCo. This framework includes various environments (such as quadrupedal, bipedal, and musculoskeletal human models), each equipped with extensive datasets (such as real and noisy motion capture data, expert-level data, and suboptimal data) to support evaluations at different difficulty levels. 3. **Key Features**: - Provides a diverse range of tasks and datasets, covering tasks from simple to highly challenging. - Supports dynamics randomization, which helps improve the robustness of the trained agents. - Offers partially observable tasks (POMDP), enhancing the adaptability of algorithms. - Provides handcrafted evaluation metrics for each task, along with the latest baseline algorithms, enabling users to quickly assess the effectiveness of their methods. Through these measures, LocoMuJoCo aims to fill the current gap in benchmarking complex gait tasks in the field of imitation learning, promoting further development in this area.