Abstract:Data mining is the most widely used method for discovering knowledge. There are numerous data mining tasks, with classification being the most frequently encountered task in various application domains such as fraud detection, disease diagnosis, text classification, and so on. Many classification techniques, such as Bayesian classifiers, decision trees, genetic algorithms, neural networks (NNs), and so on, are available to help researchers solve problems in a variety of domains. However, NNs are the most frequently used classification approach because they are effective at solving classification problems that cannot be divided into linear and non-linear categories, have high classification accuracy on large datasets, and require minimal processing effort. Despite having good classification performances, NNs have a pitfall associated with them which hinders their applicability in some real-world applications. NNs are black boxes in nature, which means they cannot make transparent decisions that humans can interpret. Because of this limitation, NNs are unsuitable for many applications that require transparency in decision-making as well as high accuracy, such as audit mining or medical diagnosis. The well-known solution to this inherent disadvantage of NNs is to extract explainable decision rules from them. The extracted rules provide a detailed understanding of how NNs work in a human-readable format. Rule extraction is a well-established technique with a plethora of literature on the subject. However, there are very few papers whose primary goal is to survey the existing literature. As a result, the goal of this work is to provide a detailed analysis of the existing literature and to create a framework for existing and new researchers to conduct research in this field. The paper examines the state-of art from the perspective of designing framework of the algorithms, evaluation criteria, and applications.

Which Neural Network Makes More Explainable Decisions? an Approach Towards Measuring Explainability

A Pixel-Level Explainable Approach of Convolutional Neural Networks and Its Application

Do Explanations Reflect Decisions? A Machine-centric Strategy to Quantify the Performance of Explainability Algorithms

Can I Trust the Explainer? Verifying Post-hoc Explanatory Methods

Explainable Neural Networks: Achieving Interpretability in Neural Models

Improving Network Interpretability via Explanation Consistency Evaluation

On the Robustness of Explanations of Deep Neural Network Models: A Survey

Neuro-Symbolic AI: Explainability, Challenges, and Future Trends

Neural Networks Decoded: Targeted and Robust Analysis of Neural Network Decisions via Causal Explanations and Reasoning

Solving the enigma: Deriving optimal explanations of deep networks

How Well Do Feature-Additive Explainers Explain Feature-Additive Predictors?

Disentangled Explanations of Neural Network Predictions by Finding Relevant Subspaces

Evaluating The Explainability of State-of-the-Art Machine Learning-based Online Network Intrusion Detection Systems

A Survey on the Explainability of Supervised Machine Learning

Stop ordering machine learning algorithms by their explainability! A user-centered investigation of performance and explainability

Trustworthy Conceptual Explanations for Neural Networks in Robot Decision-Making

GraphFramEx: Towards Systematic Evaluation of Explainability Methods for Graph Neural Networks

Sensitivity based Neural Networks Explanations

Model Explainability in Deep Learning Based Natural Language Processing

Explaining Network Intrusion Detection System Using Explainable AI Framework

Explainability through uncertainty: Trustworthy decision-making with neural networks