Refutation of Shapley Values for XAI -- Additional Evidence

Xuanxiang Huang,Joao Marques-Silva
2023-09-30
Abstract:Recent work demonstrated the inadequacy of Shapley values for explainable artificial intelligence (XAI). Although to disprove a theory a single counterexample suffices, a possible criticism of earlier work is that the focus was solely on Boolean classifiers. To address such possible criticism, this paper demonstrates the inadequacy of Shapley values for families of classifiers where features are not boolean, but also for families of classifiers for which multiple classes can be picked. Furthermore, the paper shows that the features changed in any minimal $l_0$ distance adversarial examples do not include irrelevant features, thus offering further arguments regarding the inadequacy of Shapley values for XAI.
Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the following core issues: ### Main Issues Addressed by the Paper 1. **Questioning the Application of Shapley Values in Explainable Artificial Intelligence (XAI)**: - The paper presents a series of evidences to further demonstrate the shortcomings of Shapley values as a method for measuring feature importance in the field of explainable artificial intelligence (XAI). - Specifically, for certain classifiers, there is an inconsistency between the feature importance obtained by Shapley values and the feature relevance measures based on logical reasoning. 2. **Extending Previous Research Results**: - The authors' previous research has already demonstrated the limitations of Shapley values on Boolean classifiers. - This paper extends these findings by considering non-Boolean classifiers and multi-class classifiers, addressing potential criticisms that the previous analysis was limited to Boolean classifiers. 3. **Exploring the Role of Irrelevant Features in Minimum Distance Adversarial Examples**: - The study shows that irrelevant features do not need to be changed in the process of finding minimum distance adversarial examples. - This observation supports the paper's viewpoint that Shapley values provide misleading information, as they may incorrectly assign importance to irrelevant features. ### Main Contributions - **Theoretical Contribution**: The paper provides theoretical arguments illustrating the inconsistency between Shapley values and methods based on logical reasoning (such as inductive explanations and contrastive explanations). - **Empirical Analysis**: By analyzing different types of classifiers (including tabular representations, decision trees, and ordered multi-valued decision diagrams), the paper demonstrates the limitations of Shapley values in these scenarios. - **Case Studies**: Through specific classifier examples, such as multi-valued classifiers and discrete feature classifiers, the paper explains in detail how to calculate Shapley values and compares their results with feature importance derived from logical reasoning. ### Conclusion In summary, through theoretical analysis and empirical research, this paper delves into the applicability of Shapley values in the XAI field and presents a series of evidences to question their effectiveness as a tool for explaining machine learning models. In particular, it emphasizes that Shapley values may provide misleading feature importance rankings, especially when dealing with non-Boolean classifiers. These findings are significant for understanding and improving explainable artificial intelligence technologies.