Deep Reinforcement Learning for Over-the-Air Federated Learning in SWIPT-Enabled IoT Networks

Xinran Zhang,Hui Tian,Wanli Ni,Mengying Sun
DOI: https://doi.org/10.1109/vtc2022-fall57202.2022.10012702
2022-01-01
Abstract:As a distributed machine learning paradigm, federated learning (FL) has been regarded as a promising candidate to preserve user privacy in Internet of Things (IoT) networks. Leveraging the waveform superposition property of wireless channels, over-the-air FL (AirFL) achieves fast model aggregation by integrating communication and computation via concurrent analog transmissions. To support sustainable AirFL among energy-constrained IoT devices, we consider that the base station (BS) adopts simultaneous wireless information and power transfer (SWIPT) to distribute global model and charge local devices in each communication round. To maximize the long-term energy efficiency (EE) of AirFL, we investigate a resource allocation problem by jointly optimizing the time division, transceiver beamforming, and power splitting in SWIPT-enabled IoT networks. Considering such multiple closely-coupled continuous valuables, we propose a deep reinforcement learning (DRL) algorithm based on twin delayed deep deterministic (TD3) policy to smartly make downlink and uplink communication strategies with the coordination between the BS and devices. Simulation results show that the proposed TD3 algorithm obtains about 41% EE improvement compared to traditional optimization method and other DRL algorithms.
What problem does this paper attempt to address?