ShapG: new feature importance method based on the Shapley value

Chi Zhao,Jing Liu,Elena Parilina
2024-06-30
Abstract:With wide application of Artificial Intelligence (AI), it has become particularly important to make decisions of AI systems explainable and transparent. In this paper, we proposed a new Explainable Artificial Intelligence (XAI) method called ShapG (Explanations based on Shapley value for Graphs) for measuring feature importance. ShapG is a model-agnostic global explanation method. At the first stage, it defines an undirected graph based on the dataset, where nodes represent features and edges are added based on calculation of correlation coefficients between features. At the second stage, it calculates an approximated Shapley value by sampling the data taking into account this graph structure. The sampling approach of ShapG allows to calculate the importance of features efficiently, i.e. to reduce computational complexity. Comparison of ShapG with other existing XAI methods shows that it provides more accurate explanations for two examined datasets. We also compared other XAI methods developed based on cooperative game theory with ShapG in running time, and the results show that ShapG exhibits obvious advantages in its running time, which further proves efficiency of ShapG. In addition, extensive experiments demonstrate a wide range of applicability of the ShapG method for explaining complex models. We find ShapG an important tool in improving explainability and transparency of AI systems and believe it can be widely used in various fields.
Artificial Intelligence,Computer Science and Game Theory
What problem does this paper attempt to address?
The main goal of this paper is to propose a new feature importance evaluation method, which is based on Shapley values and optimized for graph structures to improve interpretability and computational efficiency in machine learning models. Specifically, the paper introduces a new method called ShapG (Graph-based Shapley value explanation method), which is a globally interpretable artificial intelligence (XAI) technique used to measure the importance of features in machine learning models. The characteristics of the ShapG method are as follows: 1. **Graph-based Shapley value computation**: ShapG defines the relationships between features by constructing an undirected graph, where nodes represent features and edges are added based on the correlation coefficients between features. This allows the use of graph structures to approximate the computation of Shapley values, thereby reducing computational complexity. 2. **Efficiency**: Compared to traditional Shapley value computation, ShapG only considers the coalition between each node and its neighbors, rather than all possible coalitions, which significantly improves computation speed. 3. **Generality**: ShapG can explain any type of model, including complex neural networks or hybrid models, making it a model-agnostic method. 4. **Experimental validation**: The paper demonstrates the effectiveness and accuracy of ShapG through experimental results on two datasets (the "Housing Prices" regression task and the "H1N1 Flu Vaccine" classification task). Compared to existing XAI methods (such as LIME, SHAP, etc.), ShapG provides more accurate explanations and shows significant advantages in runtime. 5. **Contribution**: The main contribution of the paper is the proposal of a new graph-based Shapley value computation method, proving its effectiveness and efficiency in explaining complex models, especially when dealing with large-scale data. In summary, ShapG aims to provide a more computationally efficient and interpretable method for evaluating feature importance in machine learning models by combining concepts from graph theory and cooperative game theory. This is crucial for enhancing the transparency and trustworthiness of artificial intelligence systems.