Abstract:Fairness has become increasingly pivotal in machine learning for high-risk applications such as machine learning in healthcare and facial recognition. However, we see the deficiency in the previous logits space constraint methods. Therefore, we propose a novel framework, Logits-MMD, that achieves the fairness condition by imposing constraints on output logits with Maximum Mean Discrepancy. Moreover, quantitative analysis and experimental results show that our framework has a better property that outperforms previous methods and achieves state-of-the-art on two facial recognition datasets and one animal dataset. Finally, we show experimental results and demonstrate that our debias approach achieves the fairness condition effectively.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve fairness in machine - learning models, especially in high - risk applications such as medical diagnosis and facial recognition. Specifically, the paper points out that the existing logits - space constraint methods are insufficient and cannot effectively eliminate biases in the model. Therefore, the authors propose a new framework - Logits - MMD, which achieves fairness conditions by imposing the Maximum Mean Discrepancy (MMD) constraint on the output logits. This method aims to minimize the difference in logits distribution between different sensitive groups, thereby reducing biases in model predictions. ### Main contributions of the paper: 1. **Proposing the Logits - MMD framework**: By using MMD constraints in the logits space, this framework can more effectively achieve fairness, especially in multi - sensitive - attribute settings. 2. **Theoretical analysis**: The authors theoretically prove the consistency between the MMD constraint and the fairness objective, and explain why MMD is more effective than other methods (such as Gaussian assumptions and histogram approximations). 3. **Experimental verification**: Experimental results on multiple datasets show that Logits - MMD achieves a better balance between fairness and accuracy, outperforming existing methods. ### Specific problem description: - **Background**: In high - risk applications, model predictions should be independent of users' sensitive attributes (such as race, gender, and age), but existing methods are flawed and cannot effectively eliminate data biases. - **Objective**: Achieve model fairness by imposing MMD constraints on the output logits of the classifier during the training process. - **Method**: Use MMD to measure the difference in logits distribution between different sensitive groups, and optimize the model by minimizing these differences. ### Key technical points: - **Definition of MMD**: \[ \text{MMD}(P, Q)=\sup_{f\in\mathcal{H}}\left(\mathbb{E}_{X_s\sim P}[f(X_s)]-\mathbb{E}_{Y_t\sim Q}[f(Y_t)]\right) \] where $\mathcal{H}$ is the Reproducing Kernel Hilbert Space (RKHS). - **Logits - MMD regularization term**: \[ L_{\text{MMD}}=\sum_{a_i, a_j\in A}d(l_{a_i, 0}, l_{a_j, 0})+d(l_{a_i, 1}, l_{a_j, 1}) \] where $d$ is a function that measures the distance between two sample distributions, and in this paper, the squared form of MMD is used. - **Training objective**: \[ \min_{\Theta}L_{\text{CE}}(\Theta)+\lambda L_{\text{MMD}}(\Theta) \] where $L_{\text{CE}}$ is the cross - entropy loss, and $\lambda$ is an adjustable hyperparameter used to control the trade - off between accuracy and fairness. ### Experimental results: - **CelebA dataset**: Logits - MMD improves the Equal Opportunity (EO) by an average of 40.6% and the accuracy by 3.9%. - **UTK Face dataset**: Logits - MMD improves fairness by 14.3%, 8.5%, and 16.4% respectively, although the accuracy decreases slightly. - **Dogs and Cats dataset**: Logits - MMD performs well on datasets with different degrees of bias, demonstrating its generalization ability in different scenarios. In conclusion, this paper effectively solves the deficiencies of existing methods in achieving fairness in machine - learning models by proposing the Logits - MMD framework, providing a more reliable solution for high - risk applications.

Toward Fairness via Maximum Mean Discrepancy Regularization on Logits Space

Learning Fair Classifiers via Min-Max F-divergence Regularization

On the Maximal Local Disparity of Fairness-Aware Classifiers

Fairness for Robust Log Loss Classification

Minimax Optimal Fair Classification with Bounded Demographic Disparity

Does Machine Bring in Extra Bias in Learning? Approximating Fairness in Models Promptly

Maximum Weighted Loss Discrepancy

Wasserstein Robust Classification with Fairness Constraints

Neural Collapse Inspired Debiased Representation Learning for Min-max Fairness

Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems

AIM: Attributing, Interpreting, Mitigating Data Unfairness

Multi-fairness under class-imbalance

Balancing Fairness and Accuracy in Data-Restricted Binary Classification

Intrinsic Fairness-Accuracy Tradeoffs under Equalized Odds

Fairness-enhancing mixed effects deep learning improves fairness on in- and out-of-distribution clustered (non-iid) data

Optimal Transport of Classifiers to Fairness

Toward a Fairness-Aware Scoring System for Algorithmic Decision-Making

Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Metrizing Fairness

Chasing Fairness Under Distribution Shift: A Model Weight Perturbation Approach

Is Fairness Only Metric Deep? Evaluating and Addressing Subgroup Gaps in Deep Metric Learning