FedMC: Federated Reinforcement Learning on the Edge with Meta-Critic Networks

Derun Zou,Xusheng Liu,Lintan Sun,Jianhui Duan,Ruichen Li,Yeting Xu,Wenzhong Li,Sanglu Lu
DOI: https://doi.org/10.1109/ipccc55026.2022.9894336
2022-01-01
Abstract:Federated learning (FL) has been proposed as a novel paradigm to enable distributed learning on the edge with privacy protection. However, existing federated learning approaches mainly focus on training deep classification and clustering models, and no enough attention has been paid to solve the federated reinforcement learning task on the edge, a challenging task where multiple learning agents observe local state and take local actions to train a global learning model without revealing their local dataset. In this paper, we propose a generalised federated reinforcement learning framework called FedMC that integrates reinforcement learning models trained by multiple edge devices into a general model based on a meta-learning approach. In the proposed framework, each participant adopts a meta-value network (MVN) and task-actor encoder network (TAEN) locally to perform meta-learning training based on local task samples, and periodically uploads the weights of local MVN and TAEN to the server, which aggregate them to a global model with rapid adaptability and cross-task applicability. Extensive experiments based on a number of reinforcement learning tasks show that FedMC outperforms various federated learning baseline algorithms, and it is competitive with the methods that centrally train reinforcement learning task with global dataset.
What problem does this paper attempt to address?