Abstract: Interpretability of intelligent algorithms represented by deep learning has been yet an open problem. We discuss the shortcomings of the existing explainable method based on the two attributes of explanation, which are called completeness and explicitness. Furthermore, we point out that a model that completely relies on feed-forward mapping is extremely easy to cause inexplicability because it is hard to quantify the relationship between this mapping and the final model. Based on the perspective of the data space division, the principle of complete local interpretable model-agnostic explanations (CLIMEP) is proposed in this paper. To study the classification problems, we further discussed the equivalence of the CLIMEP and the decision boundary. As a matter of fact, it is also difficult to implementation of CLIMEP. To tackle the challenge, motivated by the fact that a fully-connected neural network (FCNN) with piece-wise linear activation functions (PWLs) can partition the input space into several linear regions, we extend this result to arbitrary FCNNs by the strategy of linearizing the activation functions. Applying this technique to solving classification problems, it is the first time that the complete decision boundary of FCNNs has been able to be obtained. Finally, we propose the DecisionNet (DNet), which divides the input space by the hyper-planes of the decision boundary. Hence, each linear interval of the DNet merely contains samples of the same label. Experiments show that the surprising model compression efficiency of the DNet with an arbitrary controlled precision.

Interpretability of Neural Networks Based on Game-theoretic Interactions

Which Neural Network Makes More Explainable Decisions? an Approach Towards Measuring Explainability

Discovering the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions

Explaining Generalization Power of a DNN Using Interactive Concepts

GraphGI:A GNN Explanation Method using Game Interaction

The two-way knowledge interaction interface between humans and neural networks

A game method for improving the interpretability of convolution neural network

Discovering and Explaining the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions

Distributing Synergy Functions: Unifying Game-Theoretic Interaction Methods for Machine-Learning Explainability

Discovering and Explaining the Representation Bottleneck of DNNs

Interpretability for Reliable, Efficient, and Self-Cognitive DNNs: from Theories to Applications.

Defining and Extracting generalizable interaction primitives from DNNs

How to Explain Neural Networks: an Approximation Perspective

Interpreting Multivariate Shapley Interactions in DNNs

How to Explain Neural Networks: A perspective of data space division

Explaining How a Neural Network Play the Go Game and Let People Learn

GAMI-Net: An Explainable Neural Network based on Generalized Additive Models with Structured Interactions

A Survey of the Interpretability Aspect of Deep Learning Models

Towards the Dynamics of a DNN Learning Symbolic Interactions

Technical Note: Defining and Quantifying AND-OR Interactions for Faithful and Concise Explanation of DNNs

Interpretability in Graph Neural Networks