GLOCALFAIR: Jointly Improving Global and Local Group Fairness in Federated Learning

Syed Irfan Ali Meerza,Luyang Liu,Jiaxin Zhang,Jian Liu
2024-10-03
Abstract:Federated learning (FL) has emerged as a prospective solution for collaboratively learning a shared model across clients without sacrificing their data privacy. However, the federated learned model tends to be biased against certain demographic groups (e.g., racial and gender groups) due to the inherent FL properties, such as data heterogeneity and party selection. Unlike centralized learning, mitigating bias in FL is particularly challenging as private training datasets and their sensitive attributes are typically not directly accessible. Most prior research in this field only focuses on global fairness while overlooking the local fairness of individual clients. Moreover, existing methods often require sensitive information about the client's local datasets to be shared, which is not desirable. To address these issues, we propose GLOCALFAIR, a client-server co-design fairness framework that can jointly improve global and local group fairness in FL without the need for sensitive statistics about the client's private datasets. Specifically, we utilize constrained optimization to enforce local fairness on the client side and adopt a fairness-aware clustering-based aggregation on the server to further ensure the global model fairness across different sensitive groups while maintaining high utility. Experiments on two image datasets and one tabular dataset with various state-of-the-art fairness baselines show that GLOCALFAIR can achieve enhanced fairness under both global and local data distributions while maintaining a good level of utility and client fairness.
Machine Learning,Artificial Intelligence,Computers and Society
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper titled "GL OCAL FAIR: Jointly Improving Global and Local Group Fairness in Federated Learning" aims to address fairness issues in Federated Learning (FL). Specifically, the paper focuses on the following points: 1. **Improvement of Global and Local Group Fairness**: - **Global Fairness**: Ensuring that the model performance is fair among different sensitive groups (e.g., race, gender, etc.) under the overall data distribution. - **Local Fairness**: Ensuring that the model performance is fair among different sensitive groups within the local data distribution of each client. 2. **Limitations of Existing Methods**: - Existing research mainly focuses on global fairness while neglecting local fairness. - Existing methods usually require sharing sensitive statistical data from clients, which raises privacy concerns. - Existing methods may not achieve optimal fairness on highly heterogeneous datasets. 3. **Proposed Method**: - The paper proposes a client-server collaborative design framework named **GLOCALFAIR**, which can improve both global and local group fairness without sharing sensitive statistical data. - **Client**: Ensures local model performance fairness among different sensitive groups through constrained optimization methods. - **Server**: Uses a fairness-based clustering aggregation method to further ensure the fairness of the global model. ### Key Points of the Solution 1. **Constrained Optimization**: - On the client side, ensures local model performance fairness among different sensitive groups through constrained optimization methods. Specifically, it minimizes the false positive rate (FPR) and false negative rate (FNR) to meet preset thresholds. - Transforms the constrained optimization problem into a two-player zero-sum game, alternating between gradient descent and gradient ascent to optimize model parameters and dual variables. 2. **Fairness-Based Clustering Aggregation**: - On the server side, groups client updates using the K-means clustering method, with the Gini coefficient as a proxy indicator of fairness. - Aggregates updates within each cluster weighted by data volume, then re-aggregates based on the cumulative Gini coefficient of each cluster, prioritizing updates with higher fairness. 3. **Experimental Validation**: - Conducts experiments on multiple federated datasets (including image and tabular data) to validate that GLOCALFAIR improves both global and local fairness while maintaining high model utility and client fairness. ### Summary By proposing the GLOCALFAIR framework, the paper addresses the improvement of both global and local group fairness in federated learning while avoiding the privacy risks associated with sharing sensitive statistical data. Experimental results show that this method effectively enhances fairness across multiple datasets while maintaining high model utility.