Long-Term Throughput Maximization in Wireless Powered Communication Networks: A Multi-Task DRL Approach
Arman Ahmadian,Wonjae Shin,Hyuncheol Park
DOI: https://doi.org/10.1109/jiot.2024.3370192
IF: 10.6
2024-01-01
IEEE Internet of Things Journal
Abstract:In this paper, we study a wirelessly-powered communication network (WPCN) composed of several wireless devices (WDs) which rely on an energy access point (EAP) for their supply of energy which is used to transfer data to a data access point (DAP). While the dominant focus of the literature in this area has been on frame-based data rate optimization, it has been known that this approach is greedy and only optimal in additive white Gaussian noise (AWGN) channels. Hence, in this work, we propose an online algorithm that, based on the current state of the batteries and channels, adaptively calculates the transmission time and power allocations to maximize the long-term performance of this system in fading channels. In order to accomplish such a goal, we employ the Twin Delayed DDPG (TD3) approach, which is a deep reinforcement learning (DRL) technique designed for continuous state and action spaces. Additionally, we use the multi-task learning (MTL) techniques to train a DRL agent that creates a composite policy capable of dynamically optimizing not just one specific WPCN configuration, but a continuum of WPCN problems, which we call a composite environment. Each of these environments has different parameters such as WDs’ distances to the DAP and EAP, batteries’ size, rectifying antenna (rectenna) energy harvesting (EH) model linearity, as well as different uplink (UL) and downlink (DL) channel specular components. Simulation results confirm that the TD3 algorithm can easily learn to achieve a much higher common throughput compared to the traditional optimization schemes.
computer science, information systems,telecommunications,engineering, electrical & electronic