Abstract:As machine learning algorithms have been widely deployed across applications, many concerns have been raised over the fairness of their predictions, especially in high stakes settings (such as facial recognition and medical imaging). To respond to these concerns, the community has proposed and formalized various notions of fairness as well as methods for rectifying unfair behavior. While fairness constraints have been studied extensively for classical models, the effectiveness of methods for imposing fairness on deep neural networks is unclear. In this paper, we observe that these large models overfit to fairness objectives, and produce a range of unintended and undesirable consequences. We conduct our experiments on both facial recognition and automated medical diagnosis datasets using state-of-the-art architectures.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is: **the challenges and problems faced by deep neural networks (DNNs) when fairness constraints are imposed during the training process**. Specifically, the author explores the effectiveness and limitations of existing fairness methods when applied to deep neural networks, especially their performance in face recognition and medical image classification tasks. ### Main Problems 1. **Over - fitting Problem**: - Due to the highly overparameterized nature of deep neural networks, they can fit perfectly on the training data, resulting in the inability to obtain meaningful bias measures in the training data. - This over - fitting phenomenon makes it difficult for the fairness constraints imposed on the training data to be generalized to unseen test data. 2. **Ineffective Fairness Intervention**: - The paper finds that many fairness interventions that are effective in classical machine - learning models (such as SVM, logistic regression, etc.) do not work in deep neural networks. - Even if these fairness interventions seem effective, they may lead to unfair phenomena, such as "fairness gerrymandering", that is, moving the decision boundary to meet certain fairness constraints, thus having an adverse impact on important subgroups that are not considered. 3. **Domain - specific Problems**: - In face recognition tasks, even if the loss is equalized between genders on the training data, this fairness is not maintained on the validation and test sets, and may instead lead to greater differences in other sensitive attributes (such as age). - In medical image classification tasks, imposing fairness constraints may reduce the performance of the model on the test set, especially in multi - label classification tasks. ### Formula Summary - **Accuracy Parity**: \[ P(\hat{y} \neq y|a = 0)\approx P(\hat{y} \neq y|a = 1) \] To achieve this goal, loss parity is used as a substitute: \[ \min_{\theta}[\hat{L}(f_{\theta})+\alpha|\hat{L}_{a}^{+}(f_{\theta})-\hat{L}_{a}^{-}(f_{\theta})|] \] - **Equal Opportunity**: \[ P(\hat{y} = 1|a = 1,y = y)\approx P(\hat{y} = 1|a = 0,y = y) \] - **Disparate Impact**: \[ P(\hat{y} = 1|a = 1)\approx P(\hat{y} = 1|a = 0) \] The corresponding objective function is: \[ \min_{\theta}[\hat{L}(f_{\theta})+\alpha\cdot di] \] where, \[ di = -\min\left(\frac{\sum_{i}a_{i}p_{i}}{\sum_{i}a_{i}},\frac{\sum_{i}(1 - a_{i})p_{i}}{\sum_{i}(1 - a_{i})}\right) \] ### Conclusion The paper shows through experiments that existing fairness methods have significant limitations in deep neural networks. In particular, due to the over - fitting problem, these methods are difficult to effectively control bias in practical applications. Therefore, researchers need to explore new methods to address these challenges to ensure the fairness of deep - learning models in high - risk application scenarios.

Technical Challenges for Training Fair Neural Networks

Fairness Testing of Deep Image Classification with Adequacy Metrics

FaiR-N: Fair and Robust Neural Networks for Structured Data

Enhancing Fairness in Neural Networks Using FairVIC

Fairness in Deep Learning: A Survey on Vision and Language Research

Evaluating Proposed Fairness Models for Face Recognition Algorithms

Evaluating the Fairness of Neural Collapse in Medical Image Classification

NeuronFair

Rethinking Bias Mitigation: Fairer Architectures Make for Fairer Face Recognition

Perfectly Parallel Fairness Certification of Neural Networks

FairFix: Enhancing Fairness of Pre-Trained Deep Neural Networks with Scarce Data Resources

Fairness-Aware Graph Neural Networks: A Survey

Fairness in Machine Learning: Definition, Testing, Debugging, and Application

Last-Layer Fairness Fine-tuning is Simple and Effective for Neural Networks

Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling

FairQuant: Certifying and Quantifying Fairness of Deep Neural Networks

NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification

Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset

Toward Fairer Face Recognition Datasets