Model-Agnostic Local Explanations with Genetic Algorithms for Text Classification

Qingfeng Du,Jincheng Xu
DOI: https://doi.org/10.18293/seke2021-040
2021-01-01
Abstract:The interpretability of black-box text classification models has been receiving widespread attention in recent years accompanying the growing popularity of artificial intelligence.To garner user trust on the model's decision-making process, it is imperative to provide faithful instance-wise justifications and rationalize the prediction in a human-readable way.In this paper, we address this challenge by introducing Locally Universal Rules (LURs) as model-agnostic local explanations.LURs are a subset of input words sufficient for the model to arrive at a particular prediction, even if the rest of words are perturbed slightly.We show the identification of the optimal LUR is NP-complete.Consequently, we propose a population-based algorithm LUR-Locator to perform the constrained optimization efficiently.We conduct extensive experiments to evaluate our algorithm on a cross product of well-established text classification datasets and models.The empirical results demonstrate that LURLocator can efficiently generate high-quality local explanations, as compared to existing explanatory methods.
What problem does this paper attempt to address?