Abstract:Nowadays video content has contributed to the majority of Internet traffic, which brings great challenge to the network infrastructure. Fortunately, the emergence of edge computing has provided a promising way to reduce the video load on the network by caching contents closer to users.But caching replacement algorithm is essential for the cache efficiency considering the limited cache space under existing edge-assisted network architecture. To investigate the challenges and opportunities inside, we first measure the performance of five state-of-the-art caching algorithms based on three real-world datasets. Our observation shows that state-of-the-art caching replacement algorithms suffer from following weaknesses: 1) the rule-based replacement approachs (e.g., LFU,LRU) cannot adapt under different scenarios; 2) data-driven forecast approaches only work efficiently on specific scenarios or datasets, as the extracted features working on one dataset may not work on another one. Motivated by these observations and edge-assisted computation capacity, we then propose an edge-assisted intelligent caching replacement framework LSTM-C based on deep Long Short-Term Memory network, which contains two types of modules: 1) four basic modules manage the coordination among content requests, content replace, cache space, service management; 2) three learning-based modules enable the online deep learning to provide intelligent caching strategy. Supported by this design, LSTM-C learns the pattern of content popularity at long and short time scales as well as determines the cache replacement policy. Most important, LSTM-C represents the request pattern with built-in memory cells, thus requires no data pre-processing, pre-programmed model or additional information. Our experiment results show that LSTM-C outperforms state-of-the-art methods in cache hit rate on three real-traces of video requests. When the cache size is limited, LSTM-C outperfor-s baselines by 20%~32% in cache hit rate. We also show that the training and predicting time of one iteration are $8.6~ms$ and $300~mu s$ on average respectively, which are fast enough for online operations.

Applying Deep Learning to the Cache Replacement Problem

Dynamic Access Distance Driven Cache Replacement

A Novel Cache Replacement Policy Via Dynamic Adaptive Insertion And Re-Reference Prediction

A Cache Replacement Policy Using Adaptive Insertion and Re-reference Prediction

Fleche: an efficient GPU embedding cache for personalized recommendations

Toward Edge-Assisted Video Content Intelligent Caching With Long Short-Term Memory Learning

Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning

A Learned Cache Eviction Framework with Minimal Overhead

Learning Memory Access Patterns

Genetic Cache: A Machine Learning Approach to Designing DRAM Cache Controllers in HBM Systems

Fast Modeling L2 Cache Reuse Distance Histograms Using Combined Locality Information from Software Traces

A Two Level Neural Approach Combining Off-Chip Prediction with Adaptive Prefetch Filtering

Phoebe: Reuse-Aware Online Caching with Reinforcement Learning for Emerging Storage Models

Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching

Adaptive Cache Management for Complex Storage Systems Using CNN-LSTM-Based Spatiotemporal Prediction

Caching in Dynamic Environments: A Near-Optimal Online Learning Approach

Designing a Deep Neural Network engine for LLC block reuse prediction to mitigate Soft Error in Multicore

Cache Promotion Policy Using Re-reference Interval Prediction

Cooperatively Managing Dynamic Writeback and Insertion Policies in a Last-Level DRAM Cache.

Enhancing LRU Replacement Via Phantom Associativity