Abstract:Shapley values originated in cooperative game theory but are extensively used today as a model-agnostic explanation framework to explain predictions made by complex machine learning models in the industry and academia. There are several algorithmic approaches for computing different versions of Shapley value explanations. Here, we consider Shapley values incorporating feature dependencies, referred to as conditional Shapley values, for predictive models fitted to tabular data. Estimating precise conditional Shapley values is difficult as they require the estimation of non-trivial conditional expectations. In this article, we develop new methods, extend earlier proposed approaches, and systematize the new refined and existing methods into different method classes for comparison and evaluation. The method classes use either Monte Carlo integration or regression to model the conditional expectations. We conduct extensive simulation studies to evaluate how precisely the different method classes estimate the conditional expectations, and thereby the conditional Shapley values, for different setups. We also apply the methods to several real-world data experiments and provide recommendations for when to use the different method classes and approaches. Roughly speaking, we recommend using parametric methods when we can specify the data distribution almost correctly, as they generally produce the most accurate Shapley value explanations. When the distribution is unknown, both generative methods and regression models with a similar form as the underlying predictive model are good and stable options. Regression-based methods are often slow to train but quickly produce the Shapley value explanations once trained. The vice versa is true for Monte Carlo-based methods, making the different methods appropriate in different practical situations.

Shapley Values for Explaining the Black Box Nature of Machine Learning Model Clustering

Explaining the Model and Feature Dependencies by Decomposition of the Shapley Value

Variational Shapley Network: A Probabilistic Approach to Self-Explaining Shapley values with Uncertainty Quantification

Shapley Marginal Surplus for Strong Models

Coalitional Strategies for Efficient Individual Prediction Explanation

Shapley variable importance clouds for interpretable machine learning

Shapley variable importance cloud for interpretable machine learning

Explanation of Machine Learning Models Using Shapley Additive Explanation and Application for Real Data in Hospital

Explaining the data or explaining a model? Shapley values that uncover non-linear dependencies

Calculation of exact Shapley values for explaining support vector machine models using the radial basis function kernel

CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine Learning

Explainable AI: Using Shapley Value to Explain Complex Anomaly Detection ML-Based Systems

Local Interpretable Model Agnostic Shap Explanations for machine learning models

Extracting spatial effects from machine learning model using local interpretation method: An example of SHAP and XGBoost

A comparative study of methods for estimating model-agnostic Shapley value explanations

Shapley-based Explainable AI for Clustering Applications in Fault Diagnosis and Prognosis

Deep Descriptive Clustering

Using Shapley Values and Variational Autoencoders to Explain Predictive Models with Dependent Mixed Features

Why Groups Matter: Necessity of Group Structures in Attributions

Causal Analysis of Shapley Values: Conditional vs. Marginal

A $k$-additive Choquet integral-based approach to approximate the SHAP values for local interpretability in machine learning