Sampling-based Caching for Low Latency in Distributed Coded Storage Systems

Kaiyang Liu,Jingrong Wang,Heng Li,Jun Peng,Jianping Pan
DOI: https://doi.org/10.1109/tsc.2023.3318315
IF: 11.019
2023-01-01
IEEE Transactions on Services Computing
Abstract:Caching has been considered as a promising solution to achieve low latency in distributed erasure coded storage systems. The previous research work categorizes all feasible caching decisions into a set of cache partitions, and then obtains the optimal solution by applying the market clearing price on each cache partition. While enjoying the ultimate performance of low data access latency, the optimal scheme suffers from high computation overheads when applied to large-scale storage systems. This paper presents SampleX, which constructs the sparsification of cache partitions through sampling to approximate the optimal caching scheme with substantially reduced computation complexity. Theoretical analysis guarantees the performance of SampleX. Furthermore, SampleX is implemented in a streaming fashion, capturing the characteristics of recent traffic for online cache content replacement. Trace-driven experimental results show that online SampleX is up to 95× faster than the state-of-the-art online scheme while only incurring a performance loss of 0.81%.
computer science, information systems, software engineering
What problem does this paper attempt to address?