Abstract:Motivated by settings in which predictive models may be required to be non-discriminatory with respect to certain attributes (such as race), but even collecting the sensitive attribute may be forbidden or restricted, we initiate the study of fair learning under the constraint of differential privacy. We design two learning algorithms that simultaneously promise differential privacy and equalized odds, a 'fairness' condition that corresponds to equalizing false positive and negative rates across protected groups. Our first algorithm is a private implementation of the equalized odds post-processing approach of [Hardt et al., 2016]. This algorithm is appealingly simple, but must be able to use protected group membership explicitly at test time, which can be viewed as a form of 'disparate treatment'. Our second algorithm is a differentially private version of the oracle-efficient in-processing approach of [Agarwal et al., 2018] that can be used to find the optimal fair classifier, given access to a subroutine that can solve the original (not necessarily fair) learning problem. This algorithm is more complex but need not have access to protected group membership at test time. We identify new tradeoffs between fairness, accuracy, and privacy that emerge only when requiring all three properties, and show that these tradeoffs can be milder if group membership may be used at test time. We conclude with a brief experimental evaluation.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to train non - discriminatory prediction models while protecting sensitive attributes (such as race). Specifically, researchers are concerned with how to design learning algorithms that satisfy both differential privacy (DP) and fairness conditions (such as Equalized Odds) during the data collection process, even when collecting sensitive attributes is not allowed or restricted. This involves ensuring that the model does not discriminate against specific groups without directly using protected attributes (such as race). The paper proposes two learning algorithms to achieve differential privacy and fairness simultaneously: 1. **Differential Privacy Post - processing Method** (DP - postprocessing): This is a privatized implementation of the post - processing method proposed in [Hardt et al., 2016]. This method adjusts the model's prediction results by explicitly using the protected group membership at test time to meet the fairness condition of Equalized Odds. Although this method is simple and easy to implement, it requires access to protected group membership information at test time, which may be infeasible or illegal in some applications. 2. **Differential Privacy Pre - processing Method** (DP - oracle - learner): This is a differential privacy version of the pre - processing method based on [Agarwal et al., 2018], which can find the optimal fair classifier without accessing protected group membership information. This method is more complex, but does not require access to protected group membership information at test time. The paper also explores the new trade - offs that emerge when differential privacy, accuracy, and fairness are required simultaneously, and shows that these trade - offs may become more moderate when group membership information is allowed to be used at test time. Finally, the paper experimentally evaluates the performance of the proposed algorithms.

Differentially Private Fair Learning

Stochastic Differentially Private and Fair Learning

Mitigating Disparate Impact on Model Accuracy in Differentially Private Learning.

Differentially Private Fair Binary Classifications

Differentially Private Post-Processing for Fair Regression

Differentially Private Algorithms for Empirical Machine Learning

Oracle-Efficient Differentially Private Learning with Public Data

Fair Differential Privacy Can Mitigate the Disparate Impact on Model Accuracy

Fair Differentially Private Federated Learning Framework

Towards Understanding the Fairness of Differentially Private Margin Classifiers

An Empirical Analysis of Fairness Notions under Differential Privacy

Differentially Private Learning with Small Public Data.

A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data

Differential Privacy Under Class Imbalance: Methods and Empirical Insights

Fairness-aware Differentially Private Collaborative Filtering

Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness

A Distributed Fair Machine Learning Framework with Private Demographic Data Protection

Differential Fairness: An Intersectional Framework for Fair AI

Differentially Private Active Learning: Balancing Effective Data Selection and Privacy

Differentially Private and Fair Classification Via Calibrated Functional Mechanism.