Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Ruifan Huang,Haixia Liu
2024-12-01
Abstract:We bridge fairness gaps from a statistical perspective by selectively utilizing either conditional distance covariance or distance covariance statistics as measures to assess the independence between predictions and sensitive attributes. We enhance fairness by incorporating sample (conditional) distance covariance as a manageable penalty term into the machine learning process. Additionally, we present the matrix form of empirical (conditional) distance covariance for parallel calculations to enhance computational efficiency. Theoretically, we provide a proof for the convergence between empirical and population (conditional) distance covariance, establishing necessary guarantees for batch computations. Through experiments conducted on a range of real-world datasets, we have demonstrated that our method effectively bridges the fairness gap in machine learning.
Machine Learning,Computers and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively improve the fairness of models in machine learning. Specifically, the author proposes a new method. By using Conditional Distance Covariance (CDC) or Distance Covariance (DC) as a metric, it evaluates the independence between prediction results and sensitive attributes, thereby reducing unfairness in the model. ### Background and Problem Description While modern deep neural networks (DNNs) are successfully applied to various tasks, many ethical and legal issues may be triggered during the model training process. Especially in real - world classification and decision - making tasks, biased datasets can affect machine - learning models, leading to unfair prediction results. These unfair prediction results may lead users to make biased choices, which in turn generate more biased data, forming a bias cycle. ### Definition of Fairness Fairness in machine learning can be divided into two levels: 1. **Group - level fairness**: Emphasizes that different groups should be treated equally. 2. **Individual - level fairness**: Aims to provide similar predictions for similar individuals. This paper mainly focuses on group - level fairness. A fair machine - learning model should avoid generating biased outputs based on sensitive attributes (such as race, gender, and age). Although these sensitive attributes may not explicitly appear in the features of the training data, high - dimensional and complex data contains a large amount of information, and some of this information may be inadvertently associated with sensitive attributes, resulting in biased results. ### Solution To enhance fairness, a natural and intuitive idea is to promote the statistical independence between model predictions (\(\hat{Y}\)) and sensitive attributes (\(Z\)). When model predictions and sensitive attributes are regarded as random variables, if \(\hat{Y}\) and \(Z\) are independent, the fairness criterion "Demographic Parity (DP)" is satisfied. Similarly, if \(\hat{Y}\) and \(Z\) are conditionally independent given the true label (\(Y\)), the fairness criterion "Equalized Odds (EO)" is satisfied. ### Method Introduction This paper introduces the Distance Covariance (DC) and Conditional Distance Covariance (CDC) methods to achieve fair classification. DC and CDC are robust methods for measuring linear and nonlinear correlations between two or three random variables (vectors). A smaller (conditional) distance covariance value indicates a weaker relationship between random variables, and its value is zero if and only if two random variables are (conditionally) independent. However, directly calculating DC or CDC requires the analytic form of the known distribution function and involves integration, which is often difficult to achieve in practical applications. Therefore, this paper uses empirical (conditional) distance covariance as an alternative loss and incorporates it as a regularization term in the model training process. ### Main Contributions 1. **Introduce empirical (conditional) distance covariance as a feasible penalty term in the machine - learning process** to promote independence. To the best of our knowledge, this is the first time that conditional distance covariance has been combined with machine learning. 2. **Propose the matrix form of empirical (conditional) distance covariance** for parallel computing to improve computational efficiency. 3. **Provide a theoretical proof of the probability convergence of empirical (conditional) distance covariance and population (conditional) distance covariance in terms of sample size**. These results provide an important theoretical basis for mini - batch computing. 4. **Numerical experiments show that the proposed method has wide applicability** and can achieve competitive performance in multiple datasets and tasks. This method does not depend on the model or prior knowledge of existing biases, and is not limited to binary sensitive attributes and can be extended to any number of sensitive attributes or sub - groups. ### Numerical Experiments The author conducted numerical experiments on four real - world datasets, including the UCI Adult dataset, the ACSIncome dataset, and two image datasets. The experimental results show that the proposed method performs excellently in improving model fairness while maintaining high accuracy. ### Summary This paper provides an effective method to reduce unfairness in machine - learning models by introducing empirical (conditional) distance covariance as a metric for fairness constraints.