Abstract:While machine learning models have achieved unprecedented success in real-world applications, they might make biased/unfair decisions for specific demographic groups and hence result in discriminative outcomes. Although research efforts have been devoted to measuring and mitigating bias, they mainly study bias from the result-oriented perspective while neglecting the bias encoded in the decision-making procedure. This results in their inability to capture procedure-oriented bias, which therefore limits the ability to have a fully debiasing method. Fortunately, with the rapid development of explainable machine learning, explanations for predictions are now available to gain insights into the procedure. In this work, we bridge the gap between fairness and explainability by presenting a novel perspective of procedure-oriented fairness based on explanations. We identify the procedure-based bias by measuring the gap of explanation quality between different groups with Ratio-based and Value-based Explanation Fairness. The new metrics further motivate us to design an optimization objective to mitigate the procedure-based bias where we observe that it will also mitigate bias from the prediction. Based on our designed optimization objective, we propose a Comprehensive Fairness Algorithm (CFA), which simultaneously fulfills multiple objectives - improving traditional fairness, satisfying explanation fairness, and maintaining the utility performance. Extensive experiments on real-world datasets demonstrate the effectiveness of our proposed CFA and highlight the importance of considering fairness from the explainability perspective. Our code is publicly available at <a class="link-external link-https" href="https://github.com/YuyingZhao/FairExplanations-CFA" rel="external noopener nofollow">this https URL</a> .

An Explainable Feature Selection Approach for Fair Machine Learning

Fairness and Explainability: Bridging the Gap Towards Fair Model Explanations

Procedural Fairness in Machine Learning

Fair Feature Selection: A Causal Perspective

Fair Causal Feature Selection

Explainable Fairness in Recommendation

An ExplainableFair Framework for Prediction of Substance Use Disorder Treatment Completion

Explaining Algorithmic Fairness Through Fairness-Aware Causal Path Decomposition

Making Fair ML Software using Trustworthy Explanation

Fair Streaming Feature Selection

Explaining contributions of features towards unfairness in classifiers: A novel threshold-dependent Shapley value-based approach

How Biased are Your Features?: Computing Fairness Influence Functions with Global Sensitivity Analysis

Evaluating Fair Feature Selection in Machine Learning for Healthcare

Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking

Explainability for fair machine learning

Modeling Techniques for Machine Learning Fairness: A Survey

Designing a feature selection method based on explainable artificial intelligence

Model Debiasing via Gradient-based Explanation on Representation

FairerML: An Extensible Platform for Analysing, Visualising, and Mitigating Biases in Machine Learning

Explainable AI for Fair Sepsis Mortality Predictive Model

Beyond Distributive Fairness in Algorithmic Decision Making: Feature Selection for Procedurally Fair Learning