Interpretable Graph Neural Networks for Tabular Data

Amr Alkhatib,Sofiane Ennadir,Henrik Boström,Michalis Vazirgiannis
2024-08-13
Abstract:Data in tabular format is frequently occurring in real-world applications. Graph Neural Networks (GNNs) have recently been extended to effectively handle such data, allowing feature interactions to be captured through representation learning. However, these approaches essentially produce black-box models, in the form of deep neural networks, precluding users from following the logic behind the model predictions. We propose an approach, called IGNNet (Interpretable Graph Neural Network for tabular data), which constrains the learning algorithm to produce an interpretable model, where the model shows how the predictions are exactly computed from the original input features. A large-scale empirical investigation is presented, showing that IGNNet is performing on par with state-of-the-art machine-learning algorithms that target tabular data, including XGBoost, Random Forests, and TabNet. At the same time, the results show that the explanations obtained from IGNNet are aligned with the true Shapley values of the features without incurring any additional computational overhead.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the **interpretability problem** when using Graph Neural Networks (GNNs) in tabular data. Specifically, the paper proposes a method named IGNNet (Interpretable Graph Neural Network for tabular data) to ensure that the model not only has strong predictive performance but also can provide transparent and easy - to - understand explanations. #### Main problem background: 1. **Importance of tabular data**: Tabular data is very common in real - world applications, such as in the medical, financial and other fields. 2. **Application of GNN in tabular data**: In recent years, GNN has been extended to process tabular data and can capture the interactions between features through representation learning. 3. **Problems of black - box models**: Existing GNN - based methods usually produce black - box models, and users cannot understand the logic behind the model predictions, which is unacceptable in fields requiring legal and ethical considerations (such as medicine and law). 4. **Limitations of existing explanation methods**: Posteriori explanation techniques (such as SHAP, LIME, etc.) can explain black - box models, but these methods are computationally expensive and lack fidelity guarantees. #### Specific objectives of the paper: - **Propose IGNNet**: A new GNN method that can generate an interpretable model while maintaining high predictive performance. - **Verify the effectiveness of IGNNet**: Through large - scale empirical research, prove that the predictive performance of IGNNet is comparable to existing state - of - the - art machine learning algorithms (such as XGBoost, Random Forests, TabNet, etc.), and its explanations are consistent with the true Shapley values without additional computational overhead. ### Core ideas of IGNNet: - **Interpretability constraints**: By restricting the learning algorithm, IGNNet can show how the prediction is accurately calculated from the original input features. - **Graph representation of tabular data**: Represent each data instance as a graph, where features are nodes and the linear correlations between features are the weights of the edges. - **Message passing and graph pooling**: Update the node representations through the message - passing mechanism and map the entire graph representation to a single value through graph pooling operations, thereby achieving the tracking of the contribution of each feature. ### Summary: The main contribution of the paper is to propose a GNN method that can maintain high performance and provide transparent explanations, solving the black - box problem when existing GNNs are applied to tabular data, and is especially significant in application scenarios requiring trust and interpretability.