A Crowdsourcing Framework for On-Device Federated Learning

Shashi Raj Pandey,Nguyen H. Tran,Mehdi Bennis,Yan Kyaw Tun,Aunas Manzoor,Choong Seon Hong
DOI: https://doi.org/10.1109/TWC.2020.2971981
2020-02-03
Abstract:Federated learning (FL) rests on the notion of training a global model in a decentralized manner. Under this setting, mobile devices perform computations on their local data before uploading the required updates to improve the global model. However, when the participating clients implement an uncoordinated computation strategy, the difficulty is to handle the communication efficiency (i.e., the number of communications per iteration) while exchanging the model parameters during aggregation. Therefore, a key challenge in FL is how users participate to build a high-quality global model with communication efficiency. We tackle this issue by formulating a utility maximization problem, and propose a novel crowdsourcing framework to leverage FL that considers the communication efficiency during parameters exchange. First, we show an incentive-based interaction between the crowdsourcing platform and the participating client's independent strategies for training a global learning model, where each side maximizes its own benefit. We formulate a two-stage Stackelberg game to analyze such scenario and find the game's equilibria. Second, we formalize an admission control scheme for participating clients to ensure a level of local accuracy. Simulated results demonstrate the efficacy of our proposed solution with up to 22% gain in the offered reward.
Machine Learning,Computer Science and Game Theory,Networking and Internet Architecture
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on two aspects: 1. **How to motivate a large number of mobile users to participate in Federated Learning (FL) to ensure the joint construction of a high - quality global model without sharing private data**. Since the amount of data on each device is limited, it is difficult to train a high - quality model. Therefore, a large number of users need to be motivated to participate in order to provide sufficient data samples. 2. **How to maintain communication efficiency during the Federated Learning process**. Specifically, when exchanging model parameters, how to reduce the number of communications while maintaining the accuracy of the global model. Because in Federated Learning, mobile devices need to communicate frequently with the central server to upload and download model parameters, which will lead to high communication costs. ### Detailed Explanation - **Design of Incentive Mechanisms**: In order to encourage more mobile users to participate in Federated Learning, the author proposes a market - based compensation mechanism (such as rewards, data discounts, etc.) and designs a game - theoretic framework (Stackelberg game) to analyze the interaction between the server and participating clients. In this way, the server can effectively motivate clients to participate in training, thereby improving the quality of the global model. - **Optimization of Communication Efficiency**: The author introduces the concept of communication efficiency, that is, reducing the number of communications in each iteration while maintaining the accuracy of the model. To this end, they design a communication - efficient cost model and find the optimal solution through optimization algorithms (such as linear complexity algorithms), thereby reducing communication costs while ensuring the quality of the model. ### Formula Representation The key formulas involved in the paper include: - **Objective Function of Local Sub - problems**: \[ J_k(w)=\frac{1}{D_k}\sum_{i = 1}^{D_k}f_i(w)+\lambda g(w) \] where \(w\in\mathbb{R}^d\) is the local model parameter, \(g(\cdot)\) is a regularization function, usually expressed as \(g(\cdot)=\frac{1}{2}\|\cdot\|^2\), and \(\lambda\in[0, 1]\) is a regularization coefficient. - **Objective Function of Global Problems**: \[ \min_{w\in\mathbb{R}^d}J(w)\quad\text{where}\quad J(w)\equiv\frac{\sum_{k = 1}^K D_k J_k(w)}{D} \] - **Upper Bound of the Number of Iterations**: \[ I_g(\epsilon,\theta)=\zeta\cdot\frac{\log(1/\epsilon)}{1-\theta} \] where \(\theta\) represents the local relative precision, \(\epsilon\) represents the precision of the global model, and \(\zeta>0\) is a constant. Through these methods, the author aims to solve the key challenges in Federated Learning, that is, how to protect user privacy while motivating users to participate and improving communication efficiency.