Improving the Quality of Explanations with Local Embedding Perturbations

Yunzhe Jia,James Bailey,Kotagiri Ramamohanarao,Christopher Leckie,Michael E. Houle
DOI: https://doi.org/10.1145/3292500.3330930
2019-01-01
Abstract:Classifier explanations have been identified as a crucial component of knowledge discovery. Local explanations evaluate the behavior of a classifier in the vicinity of a given instance. A key step in this approach is to generate synthetic neighbors of the given instance. This neighbor generation process is challenging and it has considerable impact on the quality of explanations. To assess quality of generated neighborhoods, we propose a local intrinsic dimensionality (LID) based locality constraint. Based on this, we then propose a new neighborhood generation method. Our method first fits a local embedding/subspace around a given instance using the LID of the test instance as the target dimensionality, then generates neighbors in the local embedding and projects them back to the original space. Experimental results show that our method generates more realistic neighborhoods and consequently better explanations. It can be used in combination with existing local explanation algorithms.
What problem does this paper attempt to address?