FairXGBoost: Fairness-aware Classification in XGBoost

Srinivasan Ravichandran,Drona Khurana,Bharath Venkatesh,Narayanan Unny Edakunni
DOI: https://doi.org/10.48550/arXiv.2009.01442
2020-10-07
Abstract:Highly regulated domains such as finance have long favoured the use of machine learning algorithms that are scalable, transparent, robust and yield better performance. One of the most prominent examples of such an algorithm is XGBoost. Meanwhile, there is also a growing interest in building fair and unbiased models in these regulated domains and numerous bias-mitigation algorithms have been proposed to this end. However, most of these bias-mitigation methods are restricted to specific model families such as logistic regression or support vector machine models, thus leaving modelers with a difficult decision of choosing between fairness from the bias-mitigation algorithms and scalability, transparency, performance from algorithms such as XGBoost. We aim to leverage the best of both worlds by proposing a fair variant of XGBoost that enjoys all the advantages of XGBoost, while also matching the levels of fairness from the state-of-the-art bias-mitigation algorithms. Furthermore, the proposed solution requires very little in terms of changes to the original XGBoost library, thus making it easy for adoption. We provide an empirical analysis of our proposed method on standard benchmark datasets used in the fairness community.
Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve model fairness while maintaining the advantages of the XGBoost algorithm (such as scalability, transparency, performance) in highly regulated fields (such as finance). Specifically, most of the existing bias - mitigation methods are applicable to specific model families, such as logistic regression or support vector machine models, and these methods often sacrifice the performance of the model while improving fairness. Therefore, the author proposes a new bias - mitigation technique, aiming to provide a fair variant for XGBoost, so that it can not only maintain its original advantages but also reach a fairness level comparable to the state - of - the - art bias - mitigation algorithms. The main contributions of the paper include: 1. Proposing a regularization - based bias - mitigation technique that can be seamlessly integrated into the greedy tree - building algorithm of XGBoost. 2. Conducting an empirical analysis of the proposed bias - mitigation framework, using benchmark datasets commonly used in fairness research, and comparing it with existing bias - mitigation strategies. In this way, the paper attempts to bridge the gap between high - performance machine - learning algorithms and bias - mitigation techniques in practical applications, especially in strictly regulated industries such as finance.