Abstract:Recent advances in machine learning have led to a surge in adoption of neural networks for various tasks, but lack of interpretability remains an issue for many others in which an understanding of the features influencing the prediction is necessary to ensure fairness, safety, and legal compliance. In this paper we consider one class of such tasks, tabular dataset classification, and propose a novel neuro-symbolic architecture, Neural Reasoning Networks (NRN), that is scalable and generates logically sound textual explanations for its predictions. NRNs are connected layers of logical neurons which implement a form of real valued logic. A training algorithm (R-NRN) learns the weights of the network as usual using gradient descent optimization with backprop, but also learns the network structure itself using a bandit-based optimization. Both are implemented in an extension to PyTorch (<a class="link-external link-https" href="https://github.com/IBM/torchlogic" rel="external noopener nofollow">this https URL</a>) that takes full advantage of GPU scaling and batched training. Evaluation on a diverse set of 22 open-source datasets for tabular classification demonstrates performance (measured by ROC AUC) which improves over multi-layer perceptron (MLP) and is statistically similar to other state-of-the-art approaches such as Random Forest, XGBoost and Gradient Boosted Trees, while offering 43% faster training and a more than 2 orders of magnitude reduction in the number of parameters required, on average. Furthermore, R-NRN explanations are shorter than the compared approaches while producing more accurate feature importance scores.

NeuroView: Explainable Deep Network Decision Making

Which Neural Network Makes More Explainable Decisions? an Approach Towards Measuring Explainability

NeuroView-RNN: It's About Time

A Peek Into the Reasoning of Neural Networks: Interpreting with Structural Visual Concepts

Neural Networks Decoded: Targeted and Robust Analysis of Neural Network Decisions via Causal Explanations and Reasoning

How to Explain Neural Networks: A perspective of data space division

NeuralVis: Visualizing and Interpreting Deep Learning Models

Neural network interpretability with layer-wise relevance propagation: novel techniques for neuron selection and visualization

Deeper Interpretability of Deep Networks

BrainNNExplainer: An Interpretable Graph Neural Network Framework for Brain Network based Disease Analysis

Editorial: Deep neural network based decision-making interpretability.

Neural Reasoning Networks: Efficient Interpretable Neural Networks With Automatic Textual Explanations

Visual Interpretability forDeepLearning

A Deep Network for Explainable Prediction of Non-Imaging Phenotypes using Anatomical Multi-View Data

Understanding Neural Networks Through Deep Visualization

Library network, a possible path to explainable neural networks

Novel Interpretable Mechanism of Neural Networks Based on Network Decoupling Method

Interpreting Deep Neural Networks Through Variable Importance

Interpretability of Neural Networks Based on Game-theoretic Interactions

Interpretability for Reliable, Efficient, and Self-Cognitive DNNs: from Theories to Applications.

Embedding deep networks into visual explanations