Abstract:Data in tabular format is frequently occurring in real-world applications. Graph Neural Networks (GNNs) have recently been extended to effectively handle such data, allowing feature interactions to be captured through representation learning. However, these approaches essentially produce black-box models, in the form of deep neural networks, precluding users from following the logic behind the model predictions. We propose an approach, called IGNNet (Interpretable Graph Neural Network for tabular data), which constrains the learning algorithm to produce an interpretable model, where the model shows how the predictions are exactly computed from the original input features. A large-scale empirical investigation is presented, showing that IGNNet is performing on par with state-of-the-art machine-learning algorithms that target tabular data, including XGBoost, Random Forests, and TabNet. At the same time, the results show that the explanations obtained from IGNNet are aligned with the true Shapley values of the features without incurring any additional computational overhead.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to solve the **interpretability problem** when using Graph Neural Networks (GNNs) in tabular data. Specifically, the paper proposes a method named IGNNet (Interpretable Graph Neural Network for tabular data) to ensure that the model not only has strong predictive performance but also can provide transparent and easy - to - understand explanations. #### Main problem background: 1. **Importance of tabular data**: Tabular data is very common in real - world applications, such as in the medical, financial and other fields. 2. **Application of GNN in tabular data**: In recent years, GNN has been extended to process tabular data and can capture the interactions between features through representation learning. 3. **Problems of black - box models**: Existing GNN - based methods usually produce black - box models, and users cannot understand the logic behind the model predictions, which is unacceptable in fields requiring legal and ethical considerations (such as medicine and law). 4. **Limitations of existing explanation methods**: Posteriori explanation techniques (such as SHAP, LIME, etc.) can explain black - box models, but these methods are computationally expensive and lack fidelity guarantees. #### Specific objectives of the paper: - **Propose IGNNet**: A new GNN method that can generate an interpretable model while maintaining high predictive performance. - **Verify the effectiveness of IGNNet**: Through large - scale empirical research, prove that the predictive performance of IGNNet is comparable to existing state - of - the - art machine learning algorithms (such as XGBoost, Random Forests, TabNet, etc.), and its explanations are consistent with the true Shapley values without additional computational overhead. ### Core ideas of IGNNet: - **Interpretability constraints**: By restricting the learning algorithm, IGNNet can show how the prediction is accurately calculated from the original input features. - **Graph representation of tabular data**: Represent each data instance as a graph, where features are nodes and the linear correlations between features are the weights of the edges. - **Message passing and graph pooling**: Update the node representations through the message - passing mechanism and map the entire graph representation to a single value through graph pooling operations, thereby achieving the tracking of the contribution of each feature. ### Summary: The main contribution of the paper is to propose a GNN method that can maintain high performance and provide transparent explanations, solving the black - box problem when existing GNNs are applied to tabular data, and is especially significant in application scenarios requiring trust and interpretability.

Interpretable Graph Neural Networks for Tabular Data

Interpretable Graph Neural Networks for Heterogeneous Tabular Data

The Intelligible and Effective Graph Neural Additive Networks

How Interpretable Are Interpretable Graph Neural Networks?

InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

Interpretability in Graph Neural Networks

Graph Neural Machine: A New Model for Learning with Tabular Data

Stable and Interpretable Deep Learning for Tabular Data: Introducing InterpreTabNet with the Novel InterpreStability Metric

Interpretable Mesomorphic Networks for Tabular Data

Neural Reasoning Networks: Efficient Interpretable Neural Networks With Automatic Textual Explanations

Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions

Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks

Escaping the Forest: Sparse Interpretable Neural Networks for Tabular Data

TabularNet: A Neural Network Architecture for Understanding Semantic Structures of Tabular Data

EGNN: Constructing explainable graph neural networks via knowledge distillation

TabGraphs: A Benchmark and Strong Baselines for Learning on Graphs with Tabular Node Features

Interpretable A-posteriori Error Indication for Graph Neural Network Surrogate Models

TabGSL: Graph Structure Learning for Tabular Data Prediction

Towards Efficient Learning of GNNs on High-Dimensional Multilayered Representations of Tabular Data

Improving the interpretability of GNN predictions through conformal-based graph sparsification

Globally Interpretable Graph Learning via Distribution Matching