QI-DPFL: Quality-Aware and Incentive-Boosted Federated Learning with Differential Privacy

Wenhao Yuan,Xuehe Wang
2024-04-12
Abstract:Federated Learning (FL) has increasingly been recognized as an innovative and secure distributed model training paradigm, aiming to coordinate multiple edge clients to collaboratively train a shared model without uploading their private datasets. The challenge of encouraging mobile edge devices to participate zealously in FL model training procedures, while mitigating the privacy leakage risks during wireless transmission, remains comparatively unexplored so far. In this paper, we propose a novel approach, named QI-DPFL (Quality-Aware and Incentive-Boosted Federated Learning with Differential Privacy), to address the aforementioned intractable issue. To select clients with high-quality datasets, we first propose a quality-aware client selection mechanism based on the Earth Mover's Distance (EMD) metric. Furthermore, to attract high-quality data contributors, we design an incentive-boosted mechanism that constructs the interactions between the central server and the selected clients as a two-stage Stackelberg game, where the central server designs the time-dependent reward to minimize its cost by considering the trade-off between accuracy loss and total reward allocated, and each selected client decides the privacy budget to maximize its utility. The Nash Equilibrium of the Stackelberg game is derived to find the optimal solution in each global iteration. The extensive experimental results on different real-world datasets demonstrate the effectiveness of our proposed FL framework, by realizing the goal of privacy protection and incentive compatibility.
Computer Science and Game Theory
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem of how to motivate mobile edge devices to actively participate in model training in Federated Learning (FL), while reducing the risk of privacy leakage during wireless transmission. Specifically, the article proposes a new method named QI - DPFL (Quality - Aware and Incentive - Boosted Federated Learning with Differential Privacy) to address the following challenges: 1. **Insufficient incentive mechanism**: Most studies assume that mobile devices will unconditionally participate in the training of the federated learning model once invited. However, in fact, participating in model training consumes a large amount of computing and communication resources, and participants also need to be vigilant against the risk of information leakage. Therefore, the lack of an effective economic incentive mechanism will lead to devices being unwilling to participate. 2. **Insufficient privacy protection**: Although federated learning itself provides a certain degree of privacy protection, there are still significant privacy leakage risks during the gradient propagation process. Existing Differential Privacy (DP) techniques usually rely on the standard (ϵ, δ)-DP mechanism and are vulnerable to the "disaster mechanism", resulting in poor privacy protection effects. 3. **Uneven data quality**: The data quality of clients participating in federated learning varies greatly. Directly using the data of all clients may lead to a decline in model performance. Therefore, an effective method is needed to select clients with high - quality data. ### Solutions To solve the above problems, QI - DPFL proposes the following innovations: 1. **Client selection mechanism based on EMD**: - Use the Earth Mover’s Distance (EMD) metric to evaluate the quality of client data, thereby selecting clients with high - quality data to participate in training. - EMD can quantify the difference between the local data distribution and the global data distribution. Selecting clients with smaller differences can improve model performance. 2. **Incentive mechanism based on Stackelberg game**: - Model the interaction between the central server and the selected clients as a two - stage Stackelberg game. - In the first stage, the central server designs a time - dependent reward mechanism to minimize its costs (including accuracy loss and total reward distribution). - In the second stage, each selected client decides the optimal privacy budget according to the assigned reward to maximize its own utility. 3. **Application of differential privacy technology**: - Introduce ρ - zero - concentrated differential privacy (ρ - zCDP) technology and add Gaussian noise when uploading local model parameters to protect privacy. - Adjust the privacy budget to balance the relationship between model performance and privacy protection. ### Experimental verification Through extensive experiments on different real - world datasets, the results show that the QI - DPFL framework not only achieves the goals of privacy protection and incentive compatibility but also improves the training performance of the model. ### Formula summary - **EMD calculation formula**: \[ \theta_h=\sum_{j \in Y}\|P_h(y = j)-P_a(y = j)\| \] where \(P_h\) is the data distribution of client \(h\), and \(P_a\) is the global actual distribution. - **Gaussian noise variance calculation**: \[ \sigma^2_i(t)=\frac{2C^2}{\rho_t^i|D_i|^2} \] - **Central server cost function**: \[ U_T(R, \rho)=\gamma\frac{2\beta G^2}{\lambda^2 T}+(1 - \gamma)\sum_{k = 1}^T\pi^{k - 1}R_k \] - **Client utility function**: \[ U_t^i(\rho_t^