Abstract:Strategies based on Explainable Artificial Intelligence (XAI) have promoted better human interpretability of the results of black box models. This opens up the possibility of questioning whether explanations created by XAI methods meet human expectations. The XAI methods being currently used (Ciu, Dalex, Eli5, Lofo, Shap, and Skater) provide various forms of explanations, including global rankings of relevance of features, which allow for an overview of how the model is explained as a result of its inputs and outputs. These methods provide for an increase in the explainability of the model and a greater interpretability grounded on the context of the problem. Intending to shed light on the explanations generated by XAI methods and their interpretations, this research addresses a real-world classification problem related to homicide prediction, already peer-validated, replicated its proposed black box model and used 6 different XAI methods to generate explanations and 6 different human experts. The results were generated through calculations of correlations, comparative analysis and identification of relationships between all ranks of features produced. It was found that even though it is a model that is difficult to explain, 75\% of the expectations of human experts were met, with approximately 48\% agreement between results from XAI methods and human experts. The results allow for answering questions such as: "Are the Expectation of Interpretation generated among different human experts similar?", "Do the different XAI methods generate similar explanations for the proposed problem?", "Can explanations generated by XAI methods meet human expectation of Interpretations?", and "Can Explanations and Expectations of Interpretation work together?".

A Study on Trust in Black Box Models and Post-hoc Explanations

"How do I fool you?": Manipulating User Trust via Misleading Black Box Explanations

Which Neural Network Makes More Explainable Decisions? an Approach Towards Measuring Explainability

Critical Empirical Study on Black-box Explanations in AI

A Survey on the Explainability of Supervised Machine Learning

Benchmarking and survey of explanation methods for black box models

Is Ignorance Bliss? The Role of Post Hoc Explanation Faithfulness and Alignment in Model Trust in Laypeople and Domain Experts

A Survey of Methods for Explaining Black Box Models

Shedding Light on Black Box Machine Learning Algorithms: Development of an Axiomatic Framework to Assess the Quality of Methods that Explain Individual Predictions

Trust Regions for Explanations via Black-Box Probabilistic Certification

A Grey-Box Ensemble Model Exploiting Black-Box Accuracy and White-Box Intrinsic Interpretability

Open the Black Box Data-Driven Explanation of Black Box Decision Systems

Explaining black boxes with a SMILE: Statistical Model-agnostic Interpretability with Local Explanations

Can I Trust the Explainer? Verifying Post-hoc Explanatory Methods

Stop ordering machine learning algorithms by their explainability! A user-centered investigation of performance and explainability

When Post Hoc Explanation Knocks: Consumer Responses to Explainable AI Recommendations

Model Interpretation and Explainability: Towards Creating Transparency in Prediction Models

"Why Should You Trust My Explanation?" Understanding Uncertainty in LIME Explanations

Trusting AI made decisions in healthcare by making them explainable

Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence

Black Box Model Explanations and the Human Interpretability Expectations -- An Analysis in the Context of Homicide Prediction