FairXGBoost: Fairness-aware Classification in XGBoost

Srinivasan Ravichandran,Drona Khurana,Bharath Venkatesh,Narayanan Unny Edakunni

DOI: https://doi.org/10.48550/arXiv.2009.01442

2020-10-07

Abstract:Highly regulated domains such as finance have long favoured the use of machine learning algorithms that are scalable, transparent, robust and yield better performance. One of the most prominent examples of such an algorithm is XGBoost. Meanwhile, there is also a growing interest in building fair and unbiased models in these regulated domains and numerous bias-mitigation algorithms have been proposed to this end. However, most of these bias-mitigation methods are restricted to specific model families such as logistic regression or support vector machine models, thus leaving modelers with a difficult decision of choosing between fairness from the bias-mitigation algorithms and scalability, transparency, performance from algorithms such as XGBoost. We aim to leverage the best of both worlds by proposing a fair variant of XGBoost that enjoys all the advantages of XGBoost, while also matching the levels of fairness from the state-of-the-art bias-mitigation algorithms. Furthermore, the proposed solution requires very little in terms of changes to the original XGBoost library, thus making it easy for adoption. We provide an empirical analysis of our proposed method on standard benchmark datasets used in the fairness community.

Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to achieve model fairness while maintaining the advantages of the XGBoost algorithm (such as scalability, transparency, performance) in highly regulated fields (such as finance). Specifically, most of the existing bias - mitigation methods are applicable to specific model families, such as logistic regression or support vector machine models, and these methods often sacrifice the performance of the model while improving fairness. Therefore, the author proposes a new bias - mitigation technique, aiming to provide a fair variant for XGBoost, so that it can not only maintain its original advantages but also reach a fairness level comparable to the state - of - the - art bias - mitigation algorithms. The main contributions of the paper include: 1. Proposing a regularization - based bias - mitigation technique that can be seamlessly integrated into the greedy tree - building algorithm of XGBoost. 2. Conducting an empirical analysis of the proposed bias - mitigation framework, using benchmark datasets commonly used in fairness research, and comparing it with existing bias - mitigation strategies. In this way, the paper attempts to bridge the gap between high - performance machine - learning algorithms and bias - mitigation techniques in practical applications, especially in strictly regulated industries such as finance.

FairXGBoost: Fairness-aware Classification in XGBoost

How Biased are Your Features?: Computing Fairness Influence Functions with Global Sensitivity Analysis

Less Discriminatory Alternative and Interpretable XGBoost Framework for Binary Classification

Adaptive Boosting with Fairness-aware Reweighting Technique for Fair Classification

Fair Adversarial Gradient Tree Boosting

FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainability

Fair-AdaBoost: Extending AdaBoost method to achieve fair classification

FairMask: Better Fairness via Model-based Rebalancing of Protected Attributes

Understanding Fairness Surrogate Functions in Algorithmic Fairness

Individually Fair Gradient Boosting

AdaFair: Cumulative Fairness Adaptive Boosting

OxonFair: A Flexible Toolkit for Algorithmic Fairness

Towards A Holistic View of Bias in Machine Learning: Bridging Algorithmic Fairness and Imbalanced Learning

Fairness with Adaptive Weights.

Unfairness Despite Awareness: Group-Fair Classification with Strategic Agents

Rethinking Fair Graph Neural Networks from Re-balancing

Explainable and Fair AI: Balancing Performance in Financial and Real Estate Machine Learning Models

Boosting Fair Classifier Generalization through Adaptive Priority Reweighing

Auditing Fairness by Betting

Bayes-Optimal Classifiers under Group Fairness