Abstract:To make daily decisions, human agents devise their own "strategies" governing their mobility dynamics (e.g., taxi drivers have preferred working regions and times, and urban commuters have preferred routes and transit modes). Recent research such as generative adversarial imitation learning (GAIL) demonstrates successes in learning human decision-making strategies from their behavior data using deep neural networks (DNNs), which can accurately mimic how humans behave in various scenarios, e.g., playing video games, etc. However, such DNN-based models are "black box" models in nature, making it hard to explain what knowledge the models have learned from human, and how the models make such decisions, which was not addressed in the literature of imitation learning. This paper addresses this research gap by proposing xGAIL, the first explainable generative adversarial imitation learning framework. The proposed xGAIL framework consists of two novel components, including Spatial Activation Maximization (SpatialAM) and Spatial Randomized Input Sampling Explanation (SpatialRISE), to extract both global and local knowledge from a well-trained GAIL model that explains how a human agent makes decisions. Especially, we take taxi drivers' passenger-seeking strategy as an example to validate the effectiveness of the proposed xGAIL framework. Our analysis on a large-scale real-world taxi trajectory data shows promising results from two aspects: i) global explainable knowledge of what nearby traffic condition impels a taxi driver to choose a particular direction to find the next passenger, and ii) local explainable knowledge of what key (sometimes hidden) factors a taxi driver considers when making a particular decision.

Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective

When Will Generative Adversarial Imitation Learning Algorithms Attain Global Convergence

On Computation and Generalization of Generative Adversarial Imitation Learning.

Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate

C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory

Improve generated adversarial imitation learning with reward variance regularization

GAILPG: Multi-Agent Policy Gradient with Generative Adversarial Imitation Learning

Distributional generative adversarial imitation learning with reproducing kernel generalization

Generative Adversarial Imitation Learning from Failed Experiences

Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration

f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Diffusion-Reward Adversarial Imitation Learning

Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning

Ranking-Based Generative Adversarial Imitation Learning

On Value Discrepancy of Imitation Learning

A Perturbation-Based Policy Distillation Framework with Generative Adversarial Nets

xGAIL: Explainable Generative Adversarial Imitation Learning for Explainable Human Decision Analysis

Adaptive Generative Adversarial Maximum Entropy Inverse Reinforcement Learning

Diffusing States and Matching Scores: A New Framework for Imitation Learning

Interaction Matters: A Note on Non-asymptotic Local Convergence of Generative Adversarial Networks