Abstract:Behavior cloning is a common imitation learning paradigm. Under behavior cloning the robot collects expert demonstrations, and then trains a policy to match the actions taken by the expert. This works well when the robot learner visits states where the expert has already demonstrated the correct action; but inevitably the robot will also encounter new states outside of its training dataset. If the robot learner takes the wrong action at these new states it could move farther from the training data, which in turn leads to increasingly incorrect actions and compounding errors. Existing works try to address this fundamental challenge by augmenting or enhancing the training data. By contrast, in our paper we develop the control theoretic properties of behavior cloned policies. Specifically, we consider the error dynamics between the system's current state and the states in the expert dataset. From the error dynamics we derive model-based and model-free conditions for stability: under these conditions the robot shapes its policy so that its current behavior converges towards example behaviors in the expert dataset. In practice, this results in Stable-BC, an easy to implement extension of standard behavior cloning that is provably robust to covariate shift. We demonstrate the effectiveness of our algorithm in simulations with interactive, nonlinear, and visual environments. We also conduct experiments where a robot arm uses Stable-BC to play air hockey. See our website here: <a class="link-external link-https" href="https://collab.me.vt.edu/Stable-BC/" rel="external noopener nofollow">this https URL</a>

Expert Data Augmentation in Imitation Learning (Student Abstract)

Expert Data Augmentation in Imitation Learning (Student Abstract)

Diffusion Model-Augmented Behavioral Cloning

Data augmentation for efficient learning from parametric experts

Imitation Learning from Imperfection: Theoretical Justifications and Algorithms

Theoretical Analysis of Offline Imitation With Supplementary Dataset

Behavioral Cloning from Observation

Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning

Fighting Copycat Agents in Behavioral Cloning from Observation Histories

CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning

Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms

DABI: Evaluation of Data Augmentation Methods Using Downsampling in Bilateral Control-Based Imitation Learning with Images

Constrained Behavior Cloning for Robotic Learning

MEGA-DAgger: Imitation Learning with Multiple Imperfect Experts

Improving Generalization in Game Agents with Data Augmentation in Imitation Learning

Adaptive t-Momentum-based Optimization for Unknown Ratio of Outliers in Amateur Data in Imitation Learning

Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

SAFE-GIL: SAFEty Guided Imitation Learning

Stable-BC: Controlling Covariate Shift with Stable Behavior Cloning

On Generalization of Adversarial Imitation Learning and Beyond

ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning