Abstract:Algorithmic scoring methods are widely used in the finance industry for several decades in order to prevent risk and to automate and optimize decisions. Regulatory requirements as given by the Basel Committee on Banking Supervision (BCBS) or the EU data protection regulations have led to an increasing interest and research activity on understanding black box machine learning models by means of explainable machine learning. Even though this is a step into a right direction, such methods are not able to guarantee for a fair scoring as machine learning models are not necessarily unbiased and may discriminate with respect to certain subpopulations such as a particular race, gender, or sexual orientation-even if the variable itself is not used for modeling. This is also true for white box methods like logistic regression. In this study, a framework is presented that allows analyzing and developing models with regard to fairness. The proposed methodology is based on techniques of causal inference and some of the methods can be linked to methods from explainable machine learning. A definition of counterfactual fairness is given together with an algorithm that results in a fair scoring model. The concepts are illustrated by means of a transparent simulation and a popular real-world example, the German Credit data using traditional scorecard models based on logistic regression and weight of evidence variable pre-transform. In contrast to previous studies in the field for our study, a corrected version of the data is presented and used. With the help of the simulation, the trade-off between fairness and predictive accuracy is analyzed. The results indicate that it is possible to remove unfairness without a strong performance decrease unless the correlation of the discriminative attributes on the other predictor variables in the model is not too strong. In addition, the challenge in explaining the resulting scoring model and the associated fairness implications to users is discussed.

A novel framework for enhancing transparency in credit scoring: Leveraging Shapley values for interpretable credit scorecards

Transparency, Auditability and eXplainability of Machine Learning Models in Credit Scoring

A federated interpretable scorecard and its application in credit scoring

Explainable AI for Interpretable Credit Scoring

Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach

A Vertical Federated Learning Method for Interpretable Scorecard and Its Application in Credit Scoring

Linear Discriminant Analysis in Credit Scoring: A Transparent Hybrid Model Approach

Interpretable machine learning for imbalanced credit scoring datasets

Prediction of bank credit worthiness through credit risk analysis: an explainable machine learning study

Enabling Machine Learning Algorithms for Credit Scoring -- Explainable Artificial Intelligence (XAI) methods for clear understanding complex predictive models

Feature Enhanced Ensemble Modeling with Voting Optimization for Credit Risk Assessment

Explaining Credit Risk Scoring through Feature Contribution Alignment with Expert Risk Analysts

Explainable AI in Credit Risk Management

Evolving Transparent Credit Risk Models: A Symbolic Regression Approach Using Genetic Programming

Less Discriminatory Alternative and Interpretable XGBoost Framework for Binary Classification

Analyzing Machine Learning Models for Credit Scoring with Explainable AI and Optimizing Investment Decisions

PSD2 Explainable AI Model for Credit Scoring

Facing the Challenges of Developing Fair Risk Scoring Models

Investigating the beneficial impact of segmentation-based modelling for credit scoring

Application of Machine Learning in Credit Risk Scorecard

SHAP and LIME: An Evaluation of Discriminative Power in Credit Risk