Black Box Model Explanations and the Human Interpretability Expectations -- An Analysis in the Context of Homicide Prediction

José Ribeiro,Níkolas Carneiro,Ronnie Alves

2024-07-04

Abstract:Strategies based on Explainable Artificial Intelligence (XAI) have promoted better human interpretability of the results of black box models. This opens up the possibility of questioning whether explanations created by XAI methods meet human expectations. The XAI methods being currently used (Ciu, Dalex, Eli5, Lofo, Shap, and Skater) provide various forms of explanations, including global rankings of relevance of features, which allow for an overview of how the model is explained as a result of its inputs and outputs. These methods provide for an increase in the explainability of the model and a greater interpretability grounded on the context of the problem. Intending to shed light on the explanations generated by XAI methods and their interpretations, this research addresses a real-world classification problem related to homicide prediction, already peer-validated, replicated its proposed black box model and used 6 different XAI methods to generate explanations and 6 different human experts. The results were generated through calculations of correlations, comparative analysis and identification of relationships between all ranks of features produced. It was found that even though it is a model that is difficult to explain, 75\% of the expectations of human experts were met, with approximately 48\% agreement between results from XAI methods and human experts. The results allow for answering questions such as: "Are the Expectation of Interpretation generated among different human experts similar?", "Do the different XAI methods generate similar explanations for the proposed problem?", "Can explanations generated by XAI methods meet human expectation of Interpretations?", and "Can Explanations and Expectations of Interpretation work together?".

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

The main problem this paper attempts to address is evaluating whether the explanations generated by Explainable Artificial Intelligence (XAI) methods meet the expectations of human experts, and exploring the reliability and effectiveness of these explanations in practical applications. Specifically, the paper addresses this issue through the following aspects: 1. **Evaluating the consistency of XAI methods**: The study examines whether different XAI methods (such as CIU, Dalex, Eli5, Lofo, SHAP, and Skater) can generate similar explanations when addressing the same problem. This helps to understand the reliability and consistency of different XAI methods in explaining black-box models. 2. **Comparing human experts' expectations**: By inviting 6 experts from different fields to interpret the model, the study evaluates whether their expectations for explanations are consistent. This helps to understand the degree of consensus among different experts when explaining the same model. 3. **Validating the consistency between XAI methods and human experts' expectations**: By comparing the explanations generated by XAI methods with the expectations of human experts, the study evaluates whether XAI methods can meet the explanatory needs of human experts. This helps to understand the effectiveness and reliability of XAI methods in practical applications. 4. **Combining XAI methods and human experts' explanations**: The paper proposes a technique called ConeXi, which combines the explanations generated by XAI methods with the expectations of human experts to construct a comprehensive model explanation. This helps to improve the overall quality and credibility of model explanations. The paper selects a specific real-world classification problem—homicide prediction—as the research subject. Through this problem, the paper explores the application of XAI methods in sensitive areas and how to improve the interpretability and credibility of models by combining technical means and human expertise.

Black Box Model Explanations and the Human Interpretability Expectations -- An Analysis in the Context of Homicide Prediction

Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence

How Reliable and Stable are Explanations of XAI Methods?

Explainable AI without Interpretable Model

"How do I fool you?": Manipulating User Trust via Misleading Black Box Explanations

Critical Empirical Study on Black-box Explanations in AI

Explain To Decide: A Human-Centric Review on the Role of Explainable Artificial Intelligence in AI-assisted Decision Making

EXplainable Artificial Intelligence (XAI)—From Theory to Methods and Applications

Human-in-the-Loop Model Explanation via Verbatim Boundary Identification in Generated Neighborhoods

The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

Unraveling the Dilemma of AI Errors: Exploring the Effectiveness of Human and Machine Explanations for Large Language Models

Model Interpretation and Explainability: Towards Creating Transparency in Prediction Models

Harnessing Explanations to Bridge AI and Humans

Machine Explanations and Human Understanding

Quantifying Explainability in Outcome-Oriented Predictive Process Monitoring

Explaining Black-Box Algorithms Using Probabilistic Contrastive Counterfactuals

Explainability of Artificial Intelligence Methods, Applications and Challenges: A Comprehensive Survey

Shedding Light on Black Box Machine Learning Algorithms: Development of an Axiomatic Framework to Assess the Quality of Methods that Explain Individual Predictions

Fool Me Once? Contrasting Textual and Visual Explanations in a Clinical Decision-Support Setting

Unveiling the Black Box: A Comprehensive Review of Explainable AI Techniques

Explainable artificial intelligence (XAI) post-hoc explainability methods: risks and limitations in non-discrimination law