Online Caching With Switching Cost and Operational Long-Term Constraints: An Online Learning Approach.

Zifan Jia,Qingsong Liu,Xiaoyan Gu,Haihui Fan,Feifei Dai,Bo Li,Weiping Wang
DOI: https://doi.org/10.1109/ICASSP48485.2024.10446753
2024-01-01
Abstract:The design of effective online caching policies is an increasingly important problem for content distribution networks, online recommender systems, and edge computing services, etc. Exiting literature usually tackles this problem through the lens of optimistic online learning and aims to achieve sublinear regret. In this paper, we focus on a non-trivial extension of classic online caching problem inspired by operational requirements of real-world systems including switching costs and long-term constraints. To tackle the challenges of switching costs and operational long-term constraints in the online caching, we introduce the Block-structured Follow-the-Regularized-Leader (B-FTRL) caching policy. Our approach incorporates a block structure that divides time into blocks to minimize caching switching costs. The theoretical analysis shows that B-FTRL achieves a utility regret bound of $O\left( {{T^{\frac{{2a - b + 1}}{{1 + a}}}} + {T^{\frac{b}{{1 + a}}}}} \right)$ and switching costs bound of $O\left( {{T^{\frac{1}{{1 + a}}}}} \right)$, where a and b are tunable algorithm parameters. By carefully selecting the values of a and b, we are able to limit the total regret to O(T <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2/3</sup> ) while satisfying the operational long-term constraints in expectation. Additionally, we provide high-probability constraint violation bounds of $O\left( {\sqrt T } \right)$. The performance of the proposed algorithm is evaluated with detailed trace-driven numerical tests.
What problem does this paper attempt to address?