Abstract:With the popularity of smart devices and the growth of high-bandwidth applications, the wireless industry is facing an increased surge in data traffic. This challenge highlights the limitations of traditional edge-caching solutions, especially in terms of content-caching effectiveness and network-communication latency. To address this problem, we investigated efficient caching strategies in heterogeneous network environments. The caching decision process becomes more complex due to the heterogeneity of the network environment, as well as due to the diversity of user behaviors and content requests. To address the problem of increased system latency due to the dynamically changing nature of content popularity and limited cache capacity, we propose a novel content placement strategy, the long-short-term-memory–content-population-prediction model, to capture the correlation of request patterns between different contents and the periodicity in the time domain, in order to improve the accuracy of the prediction of content popularity. Then, to address the heterogeneity of heterogeneous network environments, we propose an efficient content delivery strategy: the multi-intelligent critical collaborative caching policy. This strategy models the edge-caching problem in heterogeneous scenarios as a Markov decision process using multi-base-station-environment information. In order to fully utilize the multi-intelligence information, we have improved the actor–critic approach by integrating the attention mechanism into a neural network. Whereas the actor network is responsible for making decisions based on local information, the critic network evaluates and enhances the actor's performance. We conducted extensive simulations, and the results showed that the Long Short Term Memory content population prediction model was more advantageous, in terms of content-popularity-prediction accuracy, with a 28.61% improvement in prediction error, compared to several other existing methods. The proposed multi-intelligence actor–critic collaborative caching policy algorithm improved the cache-hit-rate metric by up to 32.3% and reduced the system latency by 1.6%, demonstrating the feasibility and effectiveness of the algorithm.

Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems.

Caching Transient Content for IoT Sensing: Multi-Agent Soft Actor-Critic

AoI and Energy Consumption Oriented Dynamic Status Updating in Caching Enabled IoT Networks

Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks

Reinforcement learning for cost-effective IoT service caching at the edge

Deep Reinforcement Learning Approaches for Content Caching in Cache-Enabled D2D Networks

Energy Efficient Cache Update and Content Delivery for Optimizing Information Freshness of Industrial Applications

Delay-Optimal Edge Cache Replacement with Non-Markovian Content Fetching

Delay-Optimal Edge Caching with Imperfect Content Fetching Via Stochastic Learning

A Deep Reinforcement Learning-Based Caching Strategy for IoT Networks With Transient Data

Optimal Status Update for Caching Enabled IoT Networks: A Dueling Deep R-Network Approach

A Deep Reinforcement Learning Approach for Dynamic Contents Caching in HetNets

AoI-Aware Markov Decision Policies for Caching

Online Digital Twin-Empowered Content Resale Mechanism in Age of Information-Aware Edge Caching Networks

Enhancing Heterogeneous Network Performance: Advanced Content Popularity Prediction and Efficient Caching

Transient Data Caching Based on Maximum Entropy Actor–Critic in Internet-of-Things Networks

Novel Edge Caching Approach Based on Multi-Agent Deep Reinforcement Learning for Internet of Vehicles

Caching in Dynamic Environments: A Near-Optimal Online Learning Approach

Cognitive-Caching: Cognitive Wireless Mobile Caching by Learning Fine-Grained Caching-Aware Indicators

Dynamic Content Update for Wireless Edge Caching via Deep Reinforcement Learning

On-Demand AoI Minimization in Resource-Constrained Cache-Enabled IoT Networks With Energy Harvesting Sensors