Distributed Reinforcement Learning for Optimizing Age of Information and Energy Consumption in Wireless Powered IoT Systems.

Xianzhe Xu,Nan Liu,Zhiwen Pan
DOI: https://doi.org/10.1145/3603781.3603924
2023-01-01
Abstract:In this paper, we study a real-time monitoring system in which distributed Internet of Things (IoT) devices are responsible for the sampling of an underlying physical process and sending update packets to a common base station (BS) in order to maintain the freshness of information at the BS. In the considered model, the IoT devices are powered through wireless energy transfer (WET) by the BS. Due to limited wireless resources, only a subset of devices can transmit update packets at any given time. To minimize the weighted sum of average Age of information (AoI) and energy consumption of the BS, we model this problem as a Markov Decision Process (MDP) with finite state and action spaces, and propose an IPPO-based algorithm. Once trained, the proposed algorithm enables the devices to operate distributedly, but still achieve good performance. Simulation results show that compared with random sampling and transmitting strategy and Implicit Q-learning (IQL) algorithm, the proposed IPPO-based algorithm can optimize both AoI and energy more effectively.
What problem does this paper attempt to address?