Reward Shaping Based Federated Reinforcement Learning

Yiqiu Hu,Yun Hua,Wenyan Liu,Jun Zhu
DOI: https://doi.org/10.1109/access.2021.3074221
IF: 3.9
2021-01-01
IEEE Access
Abstract:Federated reinforcement learning aims to promote training efficiency or improve policy quality through information interaction with privacy protection. Existing federated reinforcement learning methods rarely utilize the structure of reinforcement learning algorithms while are limiting to specific scenarios or algorithms. We propose a general federated reinforcement learning framework FRS, which employs reward shaping as the federated information shared among different clients with different tasks to promote each client’s training speed and policy quality. The federated reward shaping is implicitly learned by average state value information of all clients to protect each client’s task privacy as the real trajectory is anonymous. Experiments on the GridWorld environment show that FRS can algorithm-independently improve the policy quality and promote training speed with protecting each client’s privacy.
What problem does this paper attempt to address?