Abstract:Recently, explanatory natural language inference has attracted much attention for the interpretability of logic relationship prediction, which is also known as explanation generation for Natural Language Inference (NLI). Existing explanation generators based on discriminative Encoder-Decoder architecture have achieved noticeable results. However, we find that these discriminative generators usually generate explanations with correct evidence but incorrect logic semantic. It is due to that logic information is implicitly encoded in the premise-hypothesis pairs and difficult to model. Actually, logic information identically exists between premise-hypothesis pair and explanation. And it is easy to extract logic information that is explicitly contained in the target explanation. Hence we assume that there exists a latent space of logic information while generating explanations. Specifically, we propose a generative model called Variational Explanation Generator (VariationalEG) with a latent variable to model this space. Training with the guide of explicit logic information in target explanations, latent variable in VariationalEG could capture the implicit logic information in premise-hypothesis pairs effectively. Additionally, to tackle the problem of posterior collapse while training VariaztionalEG, we propose a simple yet effective approach called Logic Supervision on the latent variable to force it to encode logic information. Experiments on explanation generation benchmark—explanation-Stanford Natural Language Inference (e-SNLI) demonstrate that the proposed VariationalEG achieves significant improvement compared to previous studies and yields a state-of-the-art result. Furthermore, we perform the analysis of generated explanations to demonstrate the effect of the latent variable. Keywords—Natural Language Inference, explanation generation, variational auto-encoder, generative model.

Stable local interpretable model-agnostic explanations based on a variational autoencoder

Generative Local Interpretable Model-Agnostic Explanations

Local Interpretable Model Agnostic Shap Explanations for machine learning models

An Extension of LIME with Improvement of Interpretability and Fidelity

BMB-LIME: LIME with modeling local nonlinearity and uncertainty in explainability

SEGAL time series classification - Stable explanations using a generative model and an adaptive weighting method for LIME

Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant Learning

Are Your Explanations Reliable? Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial Attack

VAE-LIME: Deep Generative Model Based Approach for Local Data-Driven Model Interpretability Applied to the Ironmaking Industry

GLIME: General, Stable and Local LIME Explanation

Interpretability and Transparency of Machine Learning in File Fragment Analysis with Explainable Artificial Intelligence

"Why Should You Trust My Explanation?" Understanding Uncertainty in LIME Explanations

Explaining machine learning models using entropic variable projection

ViCE: Visual Counterfactual Explanations for Machine Learning Models

G-LIME: Statistical Learning for Local Interpretations of Deep Neural Networks Using Global Priors.

Sampling - Variational Auto Encoder - Ensemble: In the Quest of Explainable Artificial Intelligence

Exploring local explanations of nonlinear models using animated linear projections

Local Interpretable Model-agnostic Explanations of Bayesian Predictive Models via Kullback-Leibler Projections

Interpretable Deep Learning Models: Enhancing Transparency and Trustworthiness in Explainable AI

Towards Interpretable Natural Language Understanding with Explanations As Latent Variables

Variational Explanation Generator: Generating Explanation for Natural Language Inference Using Variational Auto-Encoder