Technical Challenges for Training Fair Neural Networks

Valeriia Cherepanova,Vedant Nanda,Micah Goldblum,John P. Dickerson,Tom Goldstein
DOI: https://doi.org/10.48550/arXiv.2102.06764
2021-02-13
Abstract:As machine learning algorithms have been widely deployed across applications, many concerns have been raised over the fairness of their predictions, especially in high stakes settings (such as facial recognition and medical imaging). To respond to these concerns, the community has proposed and formalized various notions of fairness as well as methods for rectifying unfair behavior. While fairness constraints have been studied extensively for classical models, the effectiveness of methods for imposing fairness on deep neural networks is unclear. In this paper, we observe that these large models overfit to fairness objectives, and produce a range of unintended and undesirable consequences. We conduct our experiments on both facial recognition and automated medical diagnosis datasets using state-of-the-art architectures.
Machine Learning,Artificial Intelligence,Computers and Society
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is: **the challenges and problems faced by deep neural networks (DNNs) when fairness constraints are imposed during the training process**. Specifically, the author explores the effectiveness and limitations of existing fairness methods when applied to deep neural networks, especially their performance in face recognition and medical image classification tasks. ### Main Problems 1. **Over - fitting Problem**: - Due to the highly overparameterized nature of deep neural networks, they can fit perfectly on the training data, resulting in the inability to obtain meaningful bias measures in the training data. - This over - fitting phenomenon makes it difficult for the fairness constraints imposed on the training data to be generalized to unseen test data. 2. **Ineffective Fairness Intervention**: - The paper finds that many fairness interventions that are effective in classical machine - learning models (such as SVM, logistic regression, etc.) do not work in deep neural networks. - Even if these fairness interventions seem effective, they may lead to unfair phenomena, such as "fairness gerrymandering", that is, moving the decision boundary to meet certain fairness constraints, thus having an adverse impact on important subgroups that are not considered. 3. **Domain - specific Problems**: - In face recognition tasks, even if the loss is equalized between genders on the training data, this fairness is not maintained on the validation and test sets, and may instead lead to greater differences in other sensitive attributes (such as age). - In medical image classification tasks, imposing fairness constraints may reduce the performance of the model on the test set, especially in multi - label classification tasks. ### Formula Summary - **Accuracy Parity**: \[ P(\hat{y} \neq y|a = 0)\approx P(\hat{y} \neq y|a = 1) \] To achieve this goal, loss parity is used as a substitute: \[ \min_{\theta}[\hat{L}(f_{\theta})+\alpha|\hat{L}_{a}^{+}(f_{\theta})-\hat{L}_{a}^{-}(f_{\theta})|] \] - **Equal Opportunity**: \[ P(\hat{y} = 1|a = 1,y = y)\approx P(\hat{y} = 1|a = 0,y = y) \] - **Disparate Impact**: \[ P(\hat{y} = 1|a = 1)\approx P(\hat{y} = 1|a = 0) \] The corresponding objective function is: \[ \min_{\theta}[\hat{L}(f_{\theta})+\alpha\cdot di] \] where, \[ di = -\min\left(\frac{\sum_{i}a_{i}p_{i}}{\sum_{i}a_{i}},\frac{\sum_{i}(1 - a_{i})p_{i}}{\sum_{i}(1 - a_{i})}\right) \] ### Conclusion The paper shows through experiments that existing fairness methods have significant limitations in deep neural networks. In particular, due to the over - fitting problem, these methods are difficult to effectively control bias in practical applications. Therefore, researchers need to explore new methods to address these challenges to ensure the fairness of deep - learning models in high - risk application scenarios.