Abstract:Although neural networks have achieved great successes in various machine learning tasks, people can hardly know what neural networks learn from data due to their black-box nature. The lack of such explainability is one of the limitations of neural networks when applied in domains, e.g., healthcare and finance, that demand transparency and accountability. Moreover, explainability is beneficial for guiding a neural network to learn the causal patterns that can extrapolate out-of-distribution (OOD) data, which is critical in real-world applications and has surged as a hot research topic. In order to improve the explainability of neural networks, we propose a novel method—Explainable Neural Rule Learning (denoted as ENRL), with the aim to integrate the expressiveness of neural networks and the explainability of rule-based systems. Specifically, we first design several operator modules and guide them to behave as certain relational operators via self-supervised learning. With input feature fields and learnable context values serving as arguments, these operator modules are used as predicates to constitute the atomic propositions. Then we employ neural logical operations to combine atomic propositions into a collection of rules. Finally, we design a voting mechanism for these rules so that they collaboratively make up our predictive model. Thus, rule learning is transformed to neural architecture search, that is, to choose the appropriate arrangements of feature fields and operator modules. After searching for a specific architecture and learning the involved modules, the resulting neural network explicitly expresses some rules and thus possesses explainability. Therefore, we can predict for each input instance according to rules it satisfies, which at the same time explains how the neural network makes that decision. We conduct a series of experiments on both synthetic and real-world datasets to evaluate ENRL. Compared with conventional neural networks, ENRL achieves competitive in-distribution performance while providing the extra benefits of explainability. Meanwhile, ENRL significantly alleviates performance drop on OOD test data, implying the effectiveness of rule learning. Codes are provided at https://github.com/Shuriken13/ENRL.

Novel approach for explaining the behavior of trained artificial neural networks with distributed representations

Using learning and searching approach to explain neural network with distributed representations

Which Neural Network Makes More Explainable Decisions? an Approach Towards Measuring Explainability

Using Two-Phase Approach to Extract Knowledge from Artificial Neural Networks

Towards Interpreting Recurrent Neural Networks Through Probabilistic Abstraction

Extract Interpretability-Accuracy balanced Rules from Artificial Neural Networks: A Review

Rule Extraction using Artificial Neural Networks

Extracting Symbolic Rules from Trained Neural Network Ensembles

Rule Extraction Algorithm for Deep Neural Networks: A Review

AN APPROACH TO RULE EXTRACTION OF NEURAL NETWORKS

NN2Rules: Extracting Rule List from Neural Networks

Learning distributed representations of knowledge that preserve deductive reasoning

ReNN: Rule-embedded Neural Networks

Functional Rule Extraction Method for Artificial Neural Networks

Enabling Regional Explainability by Automatic and Model-agnostic Rule Extraction

Interpretable Disentanglement of Neural Networks by Extracting Class-Specific Subnetwork

Interpret Neural Networks by Extracting Critical Subnetworks

Explainable Neural Rule Learning

Explainable Neural Networks: Achieving Interpretability in Neural Models

RMNA: A Neighbor Aggregation-Based Knowledge Graph Representation Learning Model Using Rule Mining

A Statistics Based Approach for Extracting Priority Rules from Trained Neural Networks.