A Survey on Group Fairness in Federated Learning: Challenges, Taxonomy of Solutions and Directions for Future Research

Teresa Salazar,Helder Araújo,Alberto Cano,Pedro Henriques Abreu
2024-10-05
Abstract:Group fairness in machine learning is a critical area of research focused on achieving equitable outcomes across different groups defined by sensitive attributes such as race or gender. Federated learning, a decentralized approach to training machine learning models across multiple devices or organizations without sharing raw data, amplifies the need for fairness due to the heterogeneous data distributions across clients, which can exacerbate biases. The intersection of federated learning and group fairness has attracted significant interest, with 47 research works specifically dedicated to addressing this issue. However, no dedicated survey has focused comprehensively on group fairness in federated learning. In this work, we present an in-depth survey on this topic, addressing the critical challenges and reviewing related works in the field. We create a novel taxonomy of these approaches based on key criteria such as data partitioning, location, and applied strategies. Additionally, we explore broader concerns related to this problem and investigate how different approaches handle the complexities of various sensitive groups and their intersections. Finally, we review the datasets and applications commonly used in current research. We conclude by highlighting key areas for future research, emphasizing the need for more methods to address the complexities of achieving group fairness in federated systems.
Machine Learning,Artificial Intelligence,Computers and Society
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the group fairness problem in Federated Learning (FL). Specifically, the paper focuses on how to ensure fair results among groups defined by different sensitive attributes (such as race, gender, etc.) in the Federated Learning environment. ### Problem Background 1. **Importance of Group Fairness**: - In machine learning, group fairness means ensuring that the prediction results of the model do not create prejudice or discrimination against certain specific groups, especially when it involves sensitive attributes (such as race, gender, age, etc.). - Group fairness is crucial for preventing discrimination in the automated decision - making process. 2. **Characteristics of Federated Learning**: - Federated Learning is a distributed machine - learning method that allows multiple clients (such as devices or organizations) to collaboratively train a model without sharing the original data. - This method helps protect privacy because the data remains on local devices rather than being centralized in one location. 3. **Challenges**: - **Data Heterogeneity**: The data distribution in Federated Learning is usually non - independent and identically distributed (non - IID), which will lead to large data differences among different clients and thus exacerbate the bias in the model. - **Privacy Protection**: Achieving fairness while protecting sensitive attributes is a complex problem because these attributes need to be processed and analyzed without revealing privacy. - **Limited Visibility**: Since it is impossible to fully understand the data distribution of all clients, especially in terms of sensitive attributes, this makes it more difficult to implement fairness algorithms. ### Main Contributions of the Paper 1. **Outlining Challenges**: - The paper elaborately describes the unique challenges in achieving group fairness in Federated Learning, such as how to manage sensitive attributes while protecting client privacy and how to deal with the heterogeneity of data distribution. 2. **Establishing a Taxonomy**: - The paper proposes the first taxonomy on group fairness in Federated Learning, which is structured based on six key dimensions: data partitioning, location, strategy, focus, sensitive attributes, data set, and application domain. 3. **Identifying Research Gaps**: - The paper points out the key gaps in the existing literature, analyzes the areas that need further research, such as dealing with intersectionality, developing frameworks for researching group fairness, and addressing the challenges in less - explored areas. ### Conclusion The paper aims to fill the gap in the existing literature regarding a detailed review of group fairness in Federated Learning. By systematically classifying and analyzing existing methods, highlighting key challenges, and providing guidance for future research directions.