Abstract:The non-transparency of artificial intelligence (AI) systems, particularly in deep learning (DL), poses significant challenges to their comprehensibility and trustworthiness. This study aims to enhance the explainability of DL models through visual analytics (VA) and human-in-the-loop (HITL) principles, making these systems more transparent and understandable to end users. In this work, we propose a novel approach that utilizes a transition matrix to interpret results from DL models through more comprehensible machine learning (ML) models. The methodology involves constructing a transition matrix between the feature spaces of DL and ML models as formal and mental models, respectively, improving the explainability for classification tasks. We validated our approach with computational experiments on the MNIST, FNC-1, and Iris datasets using a qualitative and quantitative comparison criterion, that is, how different the results obtained by our approach are from the ground truth of the training and testing samples. The proposed approach significantly enhanced model clarity and understanding in the MNIST dataset, with SSIM and PSNR values of 0.697 and 17.94, respectively, showcasing high-fidelity reconstructions. Moreover, achieving an F1m score of 77.76% and a weighted accuracy of 89.38%, our approach proved its effectiveness in stance detection with the FNC-1 dataset, complemented by its ability to explain key textual nuances. For the Iris dataset, the separating hyperplane constructed based on the proposed approach allowed for enhancing classification accuracy. Overall, using VA, HITL principles, and a transition matrix, our approach significantly improves the explainability of DL models without compromising their performance, marking a step forward in developing more transparent and trustworthy AI systems.

Explainable Deep Learning: A Visual Analytics Approach with Transition Matrices

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Interpretable Deep Learning Models: Enhancing Transparency and Trustworthiness in Explainable AI

Visual Analytics for Explainable Deep Learning

Understanding the black-box: towards interpretable and reliable deep learning models

Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability

A Survey of the Interpretability Aspect of Deep Learning Models

Transparency and Explanation in Deep Reinforcement Learning Neural Networks

Reason induced visual attention for explainable autonomous driving

Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

Explainable Deep Classification Models for Domain Generalization

Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Representing visual classification as a linear combination of words

Improving Explainability of Disentangled Representations using Multipath-Attribution Mappings

Disentangled Explanations of Neural Network Predictions by Finding Relevant Subspaces

Explainability in AI Based Applications: A Framework for Comparing Different Techniques

Improving Network Interpretability via Explanation Consistency Evaluation

Explaining Network Intrusion Detection System Using Explainable AI Framework

Achievements and Challenges in Explaining Deep Learning based Computer-Aided Diagnosis Systems

Solving the enigma: Deriving optimal explanations of deep networks

Visual Interpretability forDeepLearning