Abstract:Imitation Learning (IL) is a promising paradigm for teaching robots to perform novel tasks using demonstrations. Most existing approaches for IL utilize neural networks (NN), however, these methods suffer from several well-known limitations: they 1) require large amounts of training data, 2) are hard to interpret, and 3) are hard to repair and adapt. There is an emerging interest in programmatic imitation learning (PIL), which offers significant promise in addressing the above limitations. In PIL, the learned policy is represented in a programming language, making it amenable to interpretation and repair. However, state-of-the-art PIL algorithms assume access to action labels and struggle to learn from noisy real-world demonstrations. In this paper, we propose PLUNDER, a novel PIL algorithm that integrates a probabilistic program synthesizer in an iterative Expectation-Maximization (EM) framework to address these shortcomings. Unlike existing PIL approaches, PLUNDER synthesizes probabilistic programmatic policies that are particularly well-suited for modeling the uncertainties inherent in real-world demonstrations. Our approach leverages an EM loop to simultaneously infer the missing action labels and the most likely probabilistic policy. We benchmark PLUNDER against several established IL techniques, and demonstrate its superiority across five challenging imitation learning tasks under noise. PLUNDER policies achieve 95% accuracy in matching the given demonstrations, outperforming the next best baseline by 19%. Additionally, policies generated by PLUNDER successfully complete the tasks 17% more frequently than the nearest baseline.

Third-Person Imitation Learning Via Image Difference and Variational Discriminator Bottleneck (student Abstract)

CEIL: Generalized Contextual Imitation Learning

TiLD: Third-person Imitation Learning by Estimating Domain Cognitive Differences of Visual Demonstrations

Active Third-Person Imitation Learning

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

Imitation Learning from Observations under Transition Model Disparity

Extraneousness-Aware Imitation Learning

Off-policy Imitation Learning from Visual Inputs

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction

Programmatic Imitation Learning from Unlabeled and Noisy Demonstrations

Resolving Copycat Problems in Visual Imitation Learning Via Residual Action Prediction

Manipulator-Independent Representations for Visual Imitation

Adversarial Imitation Learning from Visual Observations using Latent Information

Learning from demonstrations: An intuitive VR environment for imitation learning of construction robots

Co-Imitation Learning without Expert Demonstration

Robust Visual Imitation Learning with Inverse Dynamics Representations

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow

Deconfounding Imitation Learning with Variational Inference

What I See Is What You See: Joint Attention Learning for First and Third Person Video Co-analysis

Offline Imitation Learning with Variational Counterfactual Reasoning