Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems.

Lifeng Lai,Fu-Chun Zheng,Wanli Wen,Jingjing Luo,Ge Li
DOI: https://doi.org/10.1109/vtc2022-fall57202.2022.10013053
2022-01-01
Abstract:In this paper, we consider the dynamic content caching issue in the cache-enabled Internet of Things (IoT) systems. For real-time applications in cache-enabled IoT systems, it is imperative to design dynamic content caching schemes to reduce the energy consumption of sensors and improve the freshness of information at users. We first design a dynamic content caching procedure for a cache-enabled IoT system with limited cache capacity and express the evolution of the Age of Information (AoI) at both the edge caching node and each user. Then, we formulate the dynamic content caching problem as a Markov Decision Process to minimize the expectation of a long-term accumulative cost, which jointly considers the average AoI of users and the energy consumption of sensors. To solve this problem, we propose an actor-critic based caching algorithm without prior knowledge of users' content demands. The numerical results show that the proposed algorithm can achieve lower average AoI and energy consumption than other baselines.
What problem does this paper attempt to address?