Abstract:Nowadays video content has contributed to the majority of Internet traffic, which brings great challenge to the network infrastructure. Fortunately, the emergence of edge computing has provided a promising way to reduce the video load on the network by caching contents closer to users.But caching replacement algorithm is essential for the cache efficiency considering the limited cache space under existing edge-assisted network architecture. To investigate the challenges and opportunities inside, we first measure the performance of five state-of-the-art caching algorithms based on three real-world datasets. Our observation shows that state-of-the-art caching replacement algorithms suffer from following weaknesses: 1) the rule-based replacement approachs (e.g., LFU,LRU) cannot adapt under different scenarios; 2) data-driven forecast approaches only work efficiently on specific scenarios or datasets, as the extracted features working on one dataset may not work on another one. Motivated by these observations and edge-assisted computation capacity, we then propose an edge-assisted intelligent caching replacement framework LSTM-C based on deep Long Short-Term Memory network, which contains two types of modules: 1) four basic modules manage the coordination among content requests, content replace, cache space, service management; 2) three learning-based modules enable the online deep learning to provide intelligent caching strategy. Supported by this design, LSTM-C learns the pattern of content popularity at long and short time scales as well as determines the cache replacement policy. Most important, LSTM-C represents the request pattern with built-in memory cells, thus requires no data pre-processing, pre-programmed model or additional information. Our experiment results show that LSTM-C outperforms state-of-the-art methods in cache hit rate on three real-traces of video requests. When the cache size is limited, LSTM-C outperfor-s baselines by 20%~32% in cache hit rate. We also show that the training and predicting time of one iteration are $8.6~ms$ and $300~mu s$ on average respectively, which are fast enough for online operations.

DeepCache: Principled Cache for Mobile Deep Vision.

Close the Gap Between Deep Learning and Mobile Intelligence by Incorporating Training in the Loop

Explore Training of Deep Convolutional Neural Networks on Battery-powered Mobile Devices: Design and Application

Accelerating Convolutional Neural Networks for Continuous Mobile Vision Via Cache Reuse.

DeepCache: Accelerating Diffusion Models for Free

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

CloudEye: A New Paradigm of Video Analysis System for Mobile Visual Scenarios

Intelligent Video Caching at Network Edge: A Multi-Agent Deep Reinforcement Learning Approach

A Long-Short-Term Fusion Approach for Video Cache.

Fleche: an efficient GPU embedding cache for personalized recommendations

Reactive Video Caching via long-short-term fusion approach

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Toward Edge-Assisted Video Content Intelligent Caching With Long Short-Term Memory Learning

Toward Smart and Cooperative Edge Caching for 5G Networks: A Deep Learning Based Approach

PrefCache: Edge Cache Admission with User Preference Learning for Video Content Distribution

Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution

Caching as an Image Characterization Problem using Deep Convolutional Neural Networks

A Survey of Deep Learning for Data Caching in Edge Network

Cloud-based or On-device: An Empirical Study of Mobile Deep Inference

Icache: an Importance-Sampling-Informed Cache for Accelerating I/O-Bound DNN Model Training.