Fair Meta-Learning: Learning How to Learn Fairly

Dylan Slack,Sorelle Friedler,Emile Givental
DOI: https://doi.org/10.48550/arXiv.1911.04336
2019-11-07
Abstract:Data sets for fairness relevant tasks can lack examples or be biased according to a specific label in a sensitive attribute. We demonstrate the usefulness of weight based meta-learning approaches in such situations. For models that can be trained through gradient descent, we demonstrate that there are some parameter configurations that allow models to be optimized from a few number of gradient steps and with minimal data which are both fair and accurate. To learn such weight sets, we adapt the popular MAML algorithm to Fair-MAML by the inclusion of a fairness regularization term. In practice, Fair-MAML allows practitioners to train fair machine learning models from only a few examples when data from related tasks is available. We empirically exhibit the value of this technique by comparing to relevant baselines.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to train a machine - learning model that is both accurate and fair when the data set is biased or there are insufficient samples of sensitive attributes for specific labels. Specifically, the author focuses on how to use a small amount of task - specific training data to train a fair machine - learning model when relevant task data is available. This problem is particularly important in fairness - related tasks because biases may be introduced during the data collection process, resulting in an unrepresentative or unbalanced data set. ### Core Contributions of the Paper 1. **Fair - MAML Algorithm**: The author proposes a weight - based meta - learning method - Fair - MAML (Fair Model - Agnostic Meta - Learning), which optimizes the parameter configuration of the model by introducing a fairness regularization term. This enables the model to achieve high accuracy and fairness simultaneously with only a few gradient updates and using a small amount of data. 2. **Fairness Regularization Terms**: To ensure the fairness of the model, the author designs two regularization terms that can be calculated quickly: - **demographic parity** (demographic equal rights): \[ R_{dp}(f_\theta, D)=1 - P(\hat{Y} = 1|A = 0)\approx1-\frac{1}{|D_0|}\sum_{x\in D_0}P(f_\theta(x)=1) \] where \(D_0\) represents the instances of the protected group. - **equal opportunity**: \[ R_{eop}(f_\theta, D)=1 - P(\hat{Y} = 1|A = 0, Y = 1)\approx1-\frac{1}{|D_1^0|}\sum_{x\in D_1^0}P(f_\theta(x)=1) \] where \(D_1^0\) represents the instances with positive results in the protected group. 3. **Experimental Verification**: The author conducts experiments using synthetic data sets and real - world data sets (such as the "Communities and Crime" data set) to verify the effectiveness of Fair - MAML. The results show that Fair - MAML can better balance accuracy and fairness when dealing with new tasks, especially when there is only a small amount of training data. ### Experimental Results - **Synthetic Data Experiment**: On the synthetic data set, Fair - MAML demonstrates its advantage in dealing with the unbalanced distribution of data points, especially when there are only 5 positive - class protected - group samples, it can still maintain high accuracy and fairness. - **Communities and Crime Data Set Experiment**: On the real - world data set, when predicting the violent crime rate, Fair - MAML not only achieves high accuracy but also performs excellently in terms of fairness, especially in the generalization ability between different states. In conclusion, this paper proposes a new meta - learning framework, Fair - MAML, which aims to solve the problem of how to train a machine - learning model that is both accurate and fair when the data amount is limited and there are biases.