Exploring the Relationship Between Feature Attribution Methods and Model Performance

Priscylla Silva,Claudio T. Silva,Luis Gustavo Nonato

2024-05-23

Abstract:Machine learning and deep learning models are pivotal in educational contexts, particularly in predicting student success. Despite their widespread application, a significant gap persists in comprehending the factors influencing these models' predictions, especially in explainability within education. This work addresses this gap by employing nine distinct explanation methods and conducting a comprehensive analysis to explore the correlation between the agreement among these methods in generating explanations and the predictive model's performance. Applying Spearman's correlation, our findings reveal a very strong correlation between the model's performance and the agreement level observed among the explanation methods.

Machine Learning

What problem does this paper attempt to address?

The paper discusses the relationship between feature attribution methods and the performance of machine learning and deep learning models, particularly in the field of education. The research reveals that these models are widely used for predicting student success, but there is a significant gap in understanding the predictive factors, particularly in terms of interpretability. The paper conducts a comprehensive analysis of the models using nine different explanation methods to explore the relationship between the consensus level in generating explanations and the predictive performance of the models. The author calculates the (in)consistency measures between different explanation methods, such as feature consistency, sign consistency, rank consistency, and sign rank consistency, and uses Spearman correlation to study the relationship between these measures and the model performance (measured by AUC). Experimental results on two real-world datasets, one about an introductory programming course and another from a study by Amrieh et al., show that there is a strong positive correlation between model performance and the consistency of the explanation methods in most cases. When the model performance improves, the consensus among the explanation methods also increases, indicating that higher quality models are easier to interpret. This finding is of great significance to practitioners in the field of education because choosing well-performing models can enhance the consistency and reliability among explanation methods, leading to a better understanding of the factors that influence student success. The research highlights the close connection between model performance and interpretability, suggesting that the model's performance should be taken into account before using explanation methods.

Exploring the Relationship Between Feature Attribution Methods and Model Performance

A Study of Academic Achievement Attribution Analysis Based on Explainable Machine Learning Techniques.

Toward Understanding the Disagreement Problem in Neural Network Feature Attribution

Evaluating Explainability in Machine Learning Predictions through Explainer-Agnostic Metrics

A Unified Framework for Input Feature Attribution Analysis

Provably Better Explanations with Optimized Aggregation of Feature Attributions

Selective Explanations

Cross-model consensus of explanations and beyond for image classification models: an empirical study

A Dual-Perspective Approach to Evaluating Feature Attribution Methods

Multi-objective Feature Attribution Explanation For Explainable Machine Learning

Assessing the Reliability of Machine Learning Explanations in ECG Analysis Through Feature Attribution

Comparison of feature importance measures as explanations for classification models

Explaining the Unexplained: Revealing Hidden Correlations for Better Interpretability

A Song of (Dis)agreement: Evaluating the Evaluation of Explainable Artificial Intelligence in Natural Language Processing

Learning to Scaffold: Optimizing Model Explanations for Teaching

Explaining Predictions from Machine Learning Models: Algorithms, Users, and Pedagogy

Model Explainability in Deep Learning Based Natural Language Processing

Model Selection and Evaluation for Learning Analytics Via Interpretable Machine Learning

The Exploration of Modelling for The Student Achievement Predictor

Explaining Language Models' Predictions with High-Impact Concepts

Explainable Recommendation via Interpretable Feature Mapping and Evaluation of Explainability