Abstract:Group fairness in machine learning is a critical area of research focused on achieving equitable outcomes across different groups defined by sensitive attributes such as race or gender. Federated learning, a decentralized approach to training machine learning models across multiple devices or organizations without sharing raw data, amplifies the need for fairness due to the heterogeneous data distributions across clients, which can exacerbate biases. The intersection of federated learning and group fairness has attracted significant interest, with 47 research works specifically dedicated to addressing this issue. However, no dedicated survey has focused comprehensively on group fairness in federated learning. In this work, we present an in-depth survey on this topic, addressing the critical challenges and reviewing related works in the field. We create a novel taxonomy of these approaches based on key criteria such as data partitioning, location, and applied strategies. Additionally, we explore broader concerns related to this problem and investigate how different approaches handle the complexities of various sensitive groups and their intersections. Finally, we review the datasets and applications commonly used in current research. We conclude by highlighting key areas for future research, emphasizing the need for more methods to address the complexities of achieving group fairness in federated systems.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the group fairness problem in Federated Learning (FL). Specifically, the paper focuses on how to ensure fair results among groups defined by different sensitive attributes (such as race, gender, etc.) in the Federated Learning environment. ### Problem Background 1. **Importance of Group Fairness**: - In machine learning, group fairness means ensuring that the prediction results of the model do not create prejudice or discrimination against certain specific groups, especially when it involves sensitive attributes (such as race, gender, age, etc.). - Group fairness is crucial for preventing discrimination in the automated decision - making process. 2. **Characteristics of Federated Learning**: - Federated Learning is a distributed machine - learning method that allows multiple clients (such as devices or organizations) to collaboratively train a model without sharing the original data. - This method helps protect privacy because the data remains on local devices rather than being centralized in one location. 3. **Challenges**: - **Data Heterogeneity**: The data distribution in Federated Learning is usually non - independent and identically distributed (non - IID), which will lead to large data differences among different clients and thus exacerbate the bias in the model. - **Privacy Protection**: Achieving fairness while protecting sensitive attributes is a complex problem because these attributes need to be processed and analyzed without revealing privacy. - **Limited Visibility**: Since it is impossible to fully understand the data distribution of all clients, especially in terms of sensitive attributes, this makes it more difficult to implement fairness algorithms. ### Main Contributions of the Paper 1. **Outlining Challenges**: - The paper elaborately describes the unique challenges in achieving group fairness in Federated Learning, such as how to manage sensitive attributes while protecting client privacy and how to deal with the heterogeneity of data distribution. 2. **Establishing a Taxonomy**: - The paper proposes the first taxonomy on group fairness in Federated Learning, which is structured based on six key dimensions: data partitioning, location, strategy, focus, sensitive attributes, data set, and application domain. 3. **Identifying Research Gaps**: - The paper points out the key gaps in the existing literature, analyzes the areas that need further research, such as dealing with intersectionality, developing frameworks for researching group fairness, and addressing the challenges in less - explored areas. ### Conclusion The paper aims to fill the gap in the existing literature regarding a detailed review of group fairness in Federated Learning. By systematically classifying and analyzing existing methods, highlighting key challenges, and providing guidance for future research directions.

A Survey on Group Fairness in Federated Learning: Challenges, Taxonomy of Solutions and Directions for Future Research

Fairness and Privacy in Federated Learning and Their Implications in Healthcare

Federated Fairness without Access to Sensitive Groups

A Generalized Look at Federated Learning: Survey and Perspectives

Linkage on Security, Privacy and Fairness in Federated Learning: New Balances and New Perspectives

A Multivocal Literature Review on Privacy and Fairness in Federated Learning

Federated Fairness Analytics: Quantifying Fairness in Federated Learning

Privacy and Fairness in Federated Learning: on the Perspective of Trade-off

Unified Group Fairness on Federated Learning

Unveiling Group-Specific Distributed Concept Drift: A Fairness Imperative in Federated Learning

AI Fairness–From Machine Learning to Federated Learning

Mitigating Group Bias in Federated Learning: Beyond Local Fairness

Federating for Learning Group Fair Models

Recent Advances on Federated Learning: A Systematic Survey

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

Addressing Bias and Fairness Using Fair Federated Learning: A Synthetic Review

Federated learning design and functional models: survey

Federated Learning: Challenges, Methods, and Future Directions

Minimax Demographic Group Fairness in Federated Learning

Federated Learning Meets Fairness and Differential Privacy