Abstract:Imitation learning, which learns agent policy by mimicking expert demonstration, has shown promising results in many applications such as medical treatment regimes and self-driving vehicles. However, it remains a difficult task to interpret control policies learned by the agent. Difficulties mainly come from two aspects: 1) agents in imitation learning are usually implemented as deep neural networks, which are black-box models and lack interpretability; 2) the latent causal mechanism behind agents' decisions may vary along the trajectory, rather than staying static throughout time steps. To increase transparency and offer better interpretability of the neural agent, we propose to expose its captured knowledge in the form of a directed acyclic causal graph, with nodes being action and state variables and edges denoting the causal relations behind predictions. Furthermore, we design this causal discovery process to be state-dependent, enabling it to model the dynamics in latent causal graphs. Concretely, we conduct causal discovery from the perspective of Granger causality and propose a self-explainable imitation learning framework, {\method}. The proposed framework is composed of three parts: a dynamic causal discovery module, a causality encoding module, and a prediction module, and is trained in an end-to-end manner. After the model is learned, we can obtain causal relations among states and action variables behind its decisions, exposing policies learned by it. Experimental results on both synthetic and real-world datasets demonstrate the effectiveness of the proposed {\method} in learning the dynamic causal graphs for understanding the decision-making of imitation learning meanwhile maintaining high prediction accuracy.

Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing Maps

Intrinsic Motivation Driven Intuitive Physics Learning using Deep Reinforcement Learning with Intrinsic Reward Normalization

Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation

Learning Generative State Space Models for Active Inference

Toward an AI Physicist for Unsupervised Learning

Interpretable Imitation Learning with Dynamic Causal Relations

Learning Non-Markovian Decision-Making from State-only Sequences

Curiosity-driven Intuitive Physics Learning

Inference of Affordances and Active Motor Control in Simulated Agents

Learning Dynamic Cognitive Map with Autonomous Navigation

MimicPlay: Long-Horizon Imitation Learning by Watching Human Play

Learning One-Shot Imitation From Humans Without Humans

Imitating by Generating: Deep Generative Models for Imitation of Interactive Tasks

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning

Efficient Intrinsically Motivated Robotic Grasping with Learning-Adaptive Imagination in Latent Space

Learning Latent Plans from Play

A structured prediction approach for robot imitation learning

Deep Imitative Models for Flexible Inference, Planning, and Control

Language-Conditioned Imitation Learning for Robot Manipulation Tasks

Interactive Imitation Learning in State-Space

Learning Efficient Representation for Intrinsic Motivation