Abstract:Recent work demonstrated the inadequacy of Shapley values for explainable artificial intelligence (XAI). Although to disprove a theory a single counterexample suffices, a possible criticism of earlier work is that the focus was solely on Boolean classifiers. To address such possible criticism, this paper demonstrates the inadequacy of Shapley values for families of classifiers where features are not boolean, but also for families of classifiers for which multiple classes can be picked. Furthermore, the paper shows that the features changed in any minimal $l_0$ distance adversarial examples do not include irrelevant features, thus offering further arguments regarding the inadequacy of Shapley values for XAI.

What problem does this paper attempt to address?

The paper aims to address the following core issues: ### Main Issues Addressed by the Paper 1. **Questioning the Application of Shapley Values in Explainable Artificial Intelligence (XAI)**: - The paper presents a series of evidences to further demonstrate the shortcomings of Shapley values as a method for measuring feature importance in the field of explainable artificial intelligence (XAI). - Specifically, for certain classifiers, there is an inconsistency between the feature importance obtained by Shapley values and the feature relevance measures based on logical reasoning. 2. **Extending Previous Research Results**: - The authors' previous research has already demonstrated the limitations of Shapley values on Boolean classifiers. - This paper extends these findings by considering non-Boolean classifiers and multi-class classifiers, addressing potential criticisms that the previous analysis was limited to Boolean classifiers. 3. **Exploring the Role of Irrelevant Features in Minimum Distance Adversarial Examples**: - The study shows that irrelevant features do not need to be changed in the process of finding minimum distance adversarial examples. - This observation supports the paper's viewpoint that Shapley values provide misleading information, as they may incorrectly assign importance to irrelevant features. ### Main Contributions - **Theoretical Contribution**: The paper provides theoretical arguments illustrating the inconsistency between Shapley values and methods based on logical reasoning (such as inductive explanations and contrastive explanations). - **Empirical Analysis**: By analyzing different types of classifiers (including tabular representations, decision trees, and ordered multi-valued decision diagrams), the paper demonstrates the limitations of Shapley values in these scenarios. - **Case Studies**: Through specific classifier examples, such as multi-valued classifiers and discrete feature classifiers, the paper explains in detail how to calculate Shapley values and compares their results with feature importance derived from logical reasoning. ### Conclusion In summary, through theoretical analysis and empirical research, this paper delves into the applicability of Shapley values in the XAI field and presents a series of evidences to question their effectiveness as a tool for explaining machine learning models. In particular, it emphasizes that Shapley values may provide misleading feature importance rankings, especially when dealing with non-Boolean classifiers. These findings are significant for understanding and improving explainable artificial intelligence technologies.

Refutation of Shapley Values for XAI -- Additional Evidence

On the Failings of Shapley Values for Explainability

Explainability is NOT a Game

Absolute Shapley Value

Rational Shapley Values

Combining Counterfactuals With Shapley Values To Explain Image Models

Fooling SHAP with Output Shuffling Attacks

Evaluating the Correctness of Explainable AI Algorithms for Classification

On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

Precision of Individual Shapley Value Explanations

Error Analysis of Shapley Value-Based Model Explanations: An Informative Perspective

Feature Inference Attack on Shapley Values

Causal Analysis of Shapley Values: Conditional vs. Marginal

Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables

The Struggles of Feature-Based Explanations: Shapley Values vs. Minimal Sufficient Subsets

Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research

Manifold-based Shapley explanations for high dimensional correlated features

A Quantum Algorithm for Shapley Value Estimation

Explainable AI as evidence of fair decisions

XAI-TRIS: Non-linear image benchmarks to quantify false positive post-hoc attribution of feature importance

Why is plausibility surprisingly problematic as an XAI criterion?