Joint Device Participation, Dataset Management, and Resource Allocation in Wireless Federated Learning Via Deep Reinforcement Learning

Jinlian Chen,Jun Zhang,Nan Zhao,Yiyang Pei,Ying-Chang Liang,Dusit Niyato
DOI: https://doi.org/10.1109/tvt.2023.3325843
IF: 6.8
2024-01-01
IEEE Transactions on Vehicular Technology
Abstract:Federated Learning (FL) enables large-scale machine learning without uploading the private data of wireless devices. Due to the heterogeneity and limitation of the devices' resources, the FL accuracy and latency substantially depend on the device participation and training dataset size. In this letter, to strike a trade-off between the FL accuracy and FL latency, a joint device participation, dataset management and resource allocation (DPDMRA) optimization problem is investigated. To solve the non-convex optimization problem, a Markov decision process is formulated for the resource-limited wireless FL. Moreover, due to the high dimensional continuous action space, a multi-agent softmax deep double deterministic policy gradients (MASD3) method is employed to obtain the optimal DPDMRA strategies. The double actor networks and softmax operator are designed to alleviate the underestimation bias. Simulation results demonstrate that the proposed DRL method can obtain the global optimal policy without complete information in the dynamic environment. Compared with the other baseline schemes, the proposed MASD3 approach can achieve the larger system utility with the better convergence performance.
What problem does this paper attempt to address?