Abstract:Simulated humanoids are an appealing research domain due to their physical capabilities. Nonetheless, they are also challenging to control, as a policy must drive an unstable, discontinuous, and high-dimensional physical system. One widely studied approach is to utilize motion capture (MoCap) data to teach the humanoid agent low-level skills (e.g., standing, walking, and running) that can then be re-used to synthesize high-level behaviors. However, even with MoCap data, controlling simulated humanoids remains very hard, as MoCap data offers only kinematic information. Finding physical control inputs to realize the demonstrated motions requires computationally intensive methods like reinforcement learning. Thus, despite the publicly available MoCap data, its utility has been limited to institutions with large-scale compute. In this work, we dramatically lower the barrier for productive research on this topic by training and releasing high-quality agents that can track over three hours of MoCap data for a simulated humanoid in the dm_control physics-based environment. We release MoCapAct (Motion Capture with Actions), a dataset of these expert agents and their rollouts, which contain proprioceptive observations and actions. We demonstrate the utility of MoCapAct by using it to train a single hierarchical policy capable of tracking the entire MoCap dataset within dm_control and show the learned low-level component can be re-used to efficiently learn downstream high-level tasks. Finally, we use MoCapAct to train an autoregressive GPT model and show that it can control a simulated humanoid to perform natural motion completion given a motion prompt. Videos of the results and links to the code and dataset are available at https://microsoft.github.io/MoCapAct.

MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

Cascaded Compositional Residual Learning for Complex Interactive Behaviors

Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation

PoCo: Policy Composition from and for Heterogeneous Robot Learning

A Multi-Agent Approach for Adaptive Finger Cooperation in Learning-based In-Hand Manipulation

Unsupervised Skill Discovery for Robotic Manipulation through Automatic Task Generation

Skill Learning Strategy Based on Dynamic Motion Primitives for Human–Robot Cooperative Manipulation

Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized Imitation Learning

Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation

Multi-Agent Behavior Retrieval: Retrieval-Augmented Policy Training for Cooperative Push Manipulation by Mobile Robots

Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and Sequencing

Composable Part-Based Manipulation

HYPERmotion: Learning Hybrid Behavior Planning for Autonomous Loco-manipulation

Learning Reusable Manipulation Strategies

An Approach for Robotic Leaning Inspired by Biomimetic Adaptive Control

Representing, learning, and controlling complex object interactions

MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control

Scaling simulation-to-real transfer by learning composable robot skills

A Hierarchical Compliance-Based Contextual Policy Search for Robotic Manipulation Tasks With Multiple Objectives