Abstract:With the widespread application of Deep Learning (DL), the black-box characteristics of DL raise questions, especially in high-stake decision-making fields like autonomous driving. Consequently, there is a growing demand for research on the interpretability of DL, leading to the emergence of eXplainable Artificial Intelligence as a current research hotspot. Current research on DL interpretability primarily focuses on transparency and post-hoc interpretability. Enhancing interpretability in transparency often requires targeted modifications to the model structure, potentially compromising the model's accuracy. Conversely, improving the interpretability of DL models based on post-hoc interpretability usually does not necessitate adjustments to the model itself. To provide a fast and accurate counterfactual explanation of DL without compromising its performance, this paper proposes a post-hoc interpretation method called relevance inference based on direct contribution to employ counterfactual reasoning in DL. In this method, direct contribution is first designed by improving Layer-wise Relevance Propagation to measure the relevance between the outputs and the inputs. Subsequently, we produce counterfactual examples based on direct contribution. Ultimately, counterfactual results for the DL model are obtained with these counterfactual examples. These counterfactual results effectively describe the behavioral boundaries of the model, facilitating a better understanding of its behavior. Additionally, direct contribution offers an easily implementable interpretable analysis method for studying model behavior. Experiments conducted on various datasets demonstrate that relevance inference can be more efficiently and accurately generate counterfactual examples compared to the state-of-the-art methods, aiding in the analysis of behavioral boundaries in intelligent decision-making models for vehicles.

FENet: A Feature Explanation Network with a Hierarchical Interpretable Architecture for Intelligent Decision-Making

Which Neural Network Makes More Explainable Decisions? an Approach Towards Measuring Explainability

Feature Analysis Network: An Interpretable Idea in Deep Learning

Relevance Inference Based on Direct Contribution: Counterfactual Explanation to Deep Networks for Intelligent Decision-making

Fenet: Feature Enhancement Network for Arbitrary Direction Text Detection

FENet: feature enhancement network for land cover classification

FENet: Roles Classification of IP Addresses Using Connection Patterns

FINER: Enhancing State-of-the-art Classifiers with Feature Attribution to Facilitate Security Analysis

An Explainable Intrusion Detection System Based on Feature Importance

FE-RNN: A fuzzy embedded recurrent neural network for improving interpretability of underlying neural network

Explainable Deep Learning-Based Feature Selection and Intrusion Detection Method on the Internet of Things

Feature Enhancement Network for Object Detection in Optical Remote Sensing Images

Fuzzy-clustering and fuzzy network based interpretable fuzzy model for prediction

Deep Neural-Fuzzy System Algorithms with Improved Interpretability for Classification Problems

FAPI-Net: A Lightweight Interpretable Network Based on Feature Augmentation and Prototype Interpretation.

A Lightweight Feature Distillation and Enhancement Network for Super-Resolution Remote Sensing Images

Feature-Enhanced Neural Collaborative Reasoning for Explainable Recommendation

Construction of a Feature Enhancement Network for Small Object Detection

Xnids: Explaining Deep Learning-based Network Intrusion Detection Systems for Active Intrusion Responses.

HEN: a Novel Hybrid Explainable Neural Network Based Framework for Robust Network Intrusion Detection

WRNFS: Width Residual Neuro Fuzzy System, a Fast-Learning Algorithm with High Interpretability