FairML: A Julia Package for Fair Classification

Jan Pablo Burgard,João Vitor Pamplona
2024-12-02
Abstract:In this paper, we propose <a class="link-external link-http" href="http://FairML.jl" rel="external noopener nofollow">this http URL</a>, a Julia package providing a framework for fair classification in machine learning. In this framework, the fair learning process is divided into three stages. Each stage aims to reduce unfairness, such as disparate impact and disparate mistreatment, in the final prediction. For the preprocessing stage, we present a resampling method that addresses unfairness coming from data imbalances. The in-processing phase consist of a classification method. This can be either one coming from the <a class="link-external link-http" href="http://MLJ.jl" rel="external noopener nofollow">this http URL</a> package, or a user defined one. For this phase, we incorporate fair ML methods that can handle unfairness to a certain degree through their optimization process. In the post-processing, we discuss the choice of the cut-off value for fair prediction. With simulations, we show the performance of the single phases and their combinations.
Machine Learning,Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to reduce unfairness in machine - learning classification, such as disparate impact and disparate mistreatment. Specifically, the paper proposes a Julia language package named FairML.jl, aiming to provide a framework for fair classification. This framework reduces unfairness in the final prediction through three stages: 1. **Pre - processing stage**: A resampling method is proposed to address the unfairness caused by data imbalance. 2. **Processing stage**: It includes a classification method, which can be from the MLJ.jl package or a user - defined method. In this stage, fair machine - learning methods are introduced, and these methods can handle unfairness to a certain extent through the optimization process. 3. **Post - processing stage**: The selection of cut - off values for fair prediction is discussed. The paper demonstrates the performance of each stage and their combinations through simulation, aiming to ensure that the classification results can be more fair and just under different sensitive features. In particular, the paper focuses on the following fairness metrics: - Disparate Impact (DI) - Disparate Mistreatment (DM) - False Positive Rate Equality (FPR) - False Negative Rate Equality (FNR) These metrics are used as constraints in the optimization problem to ensure that potential unfairness is reduced while improving classification accuracy.