Retiring $Δ$DP: New Distribution-Level Metrics for Demographic Parity

Xiaotian Han,Zhimeng Jiang,Hongye Jin,Zirui Liu,Na Zou,Qifan Wang,Xia Hu

2023-06-10

Abstract:Demographic parity is the most widely recognized measure of group fairness in machine learning, which ensures equal treatment of different demographic groups. Numerous works aim to achieve demographic parity by pursuing the commonly used metric $\Delta DP$. Unfortunately, in this paper, we reveal that the fairness metric $\Delta DP$ can not precisely measure the violation of demographic parity, because it inherently has the following drawbacks: i) zero-value $\Delta DP$ does not guarantee zero violation of demographic parity, ii) $\Delta DP$ values can vary with different classification thresholds. To this end, we propose two new fairness metrics, Area Between Probability density function Curves (ABPC) and Area Between Cumulative density function Curves (ABCC), to precisely measure the violation of demographic parity at the distribution level. The new fairness metrics directly measure the difference between the distributions of the prediction probability for different demographic groups. Thus our proposed new metrics enjoy: i) zero-value ABCC/ABPC guarantees zero violation of demographic parity; ii) ABCC/ABPC guarantees demographic parity while the classification thresholds are adjusted. We further re-evaluate the existing fair models with our proposed fairness metrics and observe different fairness behaviors of those models under the new metrics. The code is available at <a class="link-external link-https" href="https://github.com/ahxt/new_metric_for_demographic_parity" rel="external noopener nofollow">this https URL</a>

Machine Learning,Artificial Intelligence,Computers and Society

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the fundamental flaws in the existing fairness metric ΔDP for measuring demographic parity. Specifically, the paper points out: 1. **Zero ΔDP does not guarantee zero bias**: Even if ΔDP is zero, it cannot guarantee that there is no violation of demographic parity at all, because ΔDP is only a necessary condition rather than a sufficient one. 2. **The value of ΔDP cannot accurately quantify the degree of violation of demographic parity**: Different ΔDP values do not represent different levels of unfairness, so it is impossible to accurately compare the fairness of models solely based on the ΔDP value. 3. **The value of ΔDP is affected by the classification threshold**: The value of ΔDP will change with the change of the classification threshold, which makes the adjustment of the threshold in practical applications may lead to changes in the fairness evaluation results. To overcome these flaws, the paper proposes two new distribution - level fairness metrics: **Area Between Probability density function Curves (ABPC)** and **Area Between Cumulative density function Curves (ABCC)**. These two new indicators can measure the violation of demographic parity more accurately and have the following advantages: - **Zero ABPC/ABCC guarantees zero bias**: When ABPC or ABCC is zero, it can ensure full compliance with demographic parity. - **ABPC/ABCC remains effective when adjusting the classification threshold**: These two indicators are not affected by the selection of the classification threshold, so they are more robust in practical applications. Through these new metrics, the paper re - evaluates the existing fairness models and finds that these models exhibit different fairness behaviors under the new metrics. The experimental results show that the new metrics can more accurately reflect the fairness of the models, especially in dynamic - threshold scenarios.

Retiring $Δ$DP: New Distribution-Level Metrics for Demographic Parity

On the Maximal Local Disparity of Fairness-Aware Classifiers

Parametric Fairness with Statistical Guarantees

Minimax Optimal Fair Classification with Bounded Demographic Disparity

Auditing and Enforcing Conditional Fairness via Optimal Transport

On the Inductive Biases of Demographic Parity-based Fair Learning Algorithms

Compatibility of Fairness Metrics with EU Non-Discrimination Laws: Demographic Parity & Conditional Demographic Disparity

DPMLBench: Holistic Evaluation of Differentially Private Machine Learning

Fair Bayes-Optimal Classifiers Under Predictive Parity

Estimating and Implementing Conventional Fairness Metrics With Probabilistic Protected Features

Beyond Demographic Parity: Redefining Equal Treatment

Counterfactual Fairness Is Basically Demographic Parity

Fairness Index Measures to Evaluate Bias in Biometric Recognition

A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results

Fairness Under Demographic Scarce Regime

Chasing Fairness Under Distribution Shift: A Model Weight Perturbation Approach

Approximating Discrimination Within Models When Faced With Several Non-Binary Sensitive Attributes

Optimisation Strategies for Ensuring Fairness in Machine Learning: With and Without Demographics

Bursting the Burden Bubble? An Assessment of Sharma et al.'s Counterfactual-based Fairness Metric

Does Machine Bring in Extra Bias in Learning? Approximating Fairness in Models Promptly

Reducing Population-level Inequality Can Improve Demographic Group Fairness: a Twitter Case Study