Abstract:Fairness has become increasingly pivotal in machine learning for high-risk
applications such as machine learning in healthcare and facial recognition.
However, we see the deficiency in the previous logits space constraint methods.
Therefore, we propose a novel framework, Logits-MMD, that achieves the fairness
condition by imposing constraints on output logits with Maximum Mean
Discrepancy. Moreover, quantitative analysis and experimental results show that
our framework has a better property that outperforms previous methods and
achieves state-of-the-art on two facial recognition datasets and one animal
dataset. Finally, we show experimental results and demonstrate that our debias
approach achieves the fairness condition effectively.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve fairness in machine - learning models, especially in high - risk applications such as medical diagnosis and facial recognition. Specifically, the paper points out that the existing logits - space constraint methods are insufficient and cannot effectively eliminate biases in the model. Therefore, the authors propose a new framework - Logits - MMD, which achieves fairness conditions by imposing the Maximum Mean Discrepancy (MMD) constraint on the output logits. This method aims to minimize the difference in logits distribution between different sensitive groups, thereby reducing biases in model predictions.
### Main contributions of the paper:
1. **Proposing the Logits - MMD framework**: By using MMD constraints in the logits space, this framework can more effectively achieve fairness, especially in multi - sensitive - attribute settings.
2. **Theoretical analysis**: The authors theoretically prove the consistency between the MMD constraint and the fairness objective, and explain why MMD is more effective than other methods (such as Gaussian assumptions and histogram approximations).
3. **Experimental verification**: Experimental results on multiple datasets show that Logits - MMD achieves a better balance between fairness and accuracy, outperforming existing methods.
### Specific problem description:
- **Background**: In high - risk applications, model predictions should be independent of users' sensitive attributes (such as race, gender, and age), but existing methods are flawed and cannot effectively eliminate data biases.
- **Objective**: Achieve model fairness by imposing MMD constraints on the output logits of the classifier during the training process.
- **Method**: Use MMD to measure the difference in logits distribution between different sensitive groups, and optimize the model by minimizing these differences.
### Key technical points:
- **Definition of MMD**:
\[
\text{MMD}(P, Q)=\sup_{f\in\mathcal{H}}\left(\mathbb{E}_{X_s\sim P}[f(X_s)]-\mathbb{E}_{Y_t\sim Q}[f(Y_t)]\right)
\]
where $\mathcal{H}$ is the Reproducing Kernel Hilbert Space (RKHS).
- **Logits - MMD regularization term**:
\[
L_{\text{MMD}}=\sum_{a_i, a_j\in A}d(l_{a_i, 0}, l_{a_j, 0})+d(l_{a_i, 1}, l_{a_j, 1})
\]
where $d$ is a function that measures the distance between two sample distributions, and in this paper, the squared form of MMD is used.
- **Training objective**:
\[
\min_{\Theta}L_{\text{CE}}(\Theta)+\lambda L_{\text{MMD}}(\Theta)
\]
where $L_{\text{CE}}$ is the cross - entropy loss, and $\lambda$ is an adjustable hyperparameter used to control the trade - off between accuracy and fairness.
### Experimental results:
- **CelebA dataset**: Logits - MMD improves the Equal Opportunity (EO) by an average of 40.6% and the accuracy by 3.9%.
- **UTK Face dataset**: Logits - MMD improves fairness by 14.3%, 8.5%, and 16.4% respectively, although the accuracy decreases slightly.
- **Dogs and Cats dataset**: Logits - MMD performs well on datasets with different degrees of bias, demonstrating its generalization ability in different scenarios.
In conclusion, this paper effectively solves the deficiencies of existing methods in achieving fairness in machine - learning models by proposing the Logits - MMD framework, providing a more reliable solution for high - risk applications.