Abstract:As federated learning gains increasing importance in real-world applications due to its capacity for decentralized data training, addressing fairness concerns across demographic groups becomes critically important. However, most existing machine learning algorithms for ensuring fairness are designed for centralized data environments and generally require large-sample and distributional assumptions, underscoring the urgent need for fairness techniques adapted for decentralized and heterogeneous systems with finite-sample and distribution-free guarantees. To address this issue, this paper introduces FedFaiREE, a post-processing algorithm developed specifically for distribution-free fair learning in decentralized settings with small samples. Our approach accounts for unique challenges in decentralized environments, such as client heterogeneity, communication costs, and small sample sizes. We provide rigorous theoretical guarantees for both fairness and accuracy, and our experimental results further provide robust empirical validation for our proposed method.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to solve the problem of achieving fairness in Federated Learning (FL), especially in the case of small samples and without the need to assume data distribution (distribution - free). Specifically, the paper proposes a post - processing algorithm named FedFaiREE to address the following challenges: 1. **Client heterogeneity**: In federated learning, the data distributions of different clients may vary, which leads to the complexity of model training and challenges in fairness. 2. **Communication cost**: In federated learning, frequent data exchanges will bring high communication costs, especially in large - scale distributed systems. 3. **Small sample size**: The amount of data for each client is usually small, which limits the effectiveness of existing fairness methods. 4. **No distribution assumption**: Many existing fairness methods rely on assumptions about data distribution, but in practical applications, these assumptions are often not valid. ### Background and motivation With the increasing importance of federated learning in practical applications, how to achieve fairness among different groups has become particularly crucial. However, most existing machine - learning algorithms are designed for centralized data environments and usually require a large number of samples and distribution assumptions. This makes it more difficult to achieve fairness in decentralized and heterogeneous systems. Therefore, there is an urgent need for a fairness technique suitable for small samples and without distribution assumptions. ### Methods and contributions The paper proposes a post - processing algorithm named FedFaiREE, which has the following characteristics: 1. **Small samples and no distribution assumption**: FedFaiREE can achieve fairness in the case of small samples without making any assumptions about the data distribution. 2. **Theoretical guarantee**: The paper provides strict theoretical guarantees, proving that this method can achieve approximately optimal accuracy while satisfying fairness constraints. 3. **Experimental verification**: The experimental results further verify the effectiveness of FedFaiREE, especially in practical application scenarios where existing methods cannot effectively control fairness due to small sample sizes, while FedFaiREE performs well. ### Specific methods The core idea of FedFaiREE is to use order statistics to meet fairness constraints and select the classifier with the highest accuracy among those that meet the fairness constraints. The specific steps are as follows: 1. **Candidate set construction**: Establish a candidate set that meets fairness constraints through a distributed algorithm. 2. **Optimal threshold selection**: Select the threshold pair that minimizes the estimated misclassification error from the candidate set. ### Theoretical analysis The paper provides the following theoretical guarantees: 1. **Fairness guarantee**: At a given confidence level, the finally output classifier can meet the predefined fairness requirements. 2. **Accuracy guarantee**: When the input classifier is close to the Bayes optimal classifier, FedFaiREE can achieve an almost optimal misclassification error. ### Extensions and applications The paper also explores the applications of FedFaiREE in different scenarios, including fairness indicators such as label shift and Equalized Odds. These extensions further verify the robustness and effectiveness of FedFaiREE in practical applications. ### Summary In summary, this paper successfully solves the key problem of achieving fairness in federated learning by proposing the FedFaiREE algorithm, especially in the case of small samples and without distribution assumptions. This method is not only strictly proven theoretically but also shows good performance in practical applications.

Distribution-Free Fair Federated Learning with Small Samples

Fairness-aware Agnostic Federated Learning

Dynamic Fair Federated Learning Based on Reinforcement Learning

Fairness and Accuracy in Federated Learning

FedFair^3: Unlocking Threefold Fairness in Federated Learning

FedFDP: Fairness-Aware Federated Learning with Differential Privacy

Fairness and Privacy in Federated Learning and Their Implications in Healthcare

WassFFed: Wasserstein Fair Federated Learning

FedSampling: A Better Sampling Strategy for Federated Learning

Fairness-Aware Client Selection for Federated Learning

Federated Fairness Analytics: Quantifying Fairness in Federated Learning

FedHF: A High Fairness Federated Learning Algorithm Based on Deconfliction in Heterogeneous Networks.

Privacy, accuracy, and model fairness trade-offs in federated learning

Unified Group Fairness on Federated Learning

Post-Fair Federated Learning: Achieving Group and Community Fairness in Federated Learning via Post-processing

Multi-Dimensional Fair Federated Learning

FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning

Proportional Fairness in Federated Learning

A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data

A Fair Federated Learning Framework with Reinforcement Learning.

Federated Learning Meets Fairness and Differential Privacy