CPN Meets Learning: Online Scheduling for Inference Service in Computing Power Network

Mingtao Ji,Ji Qi,Lei Jiao,Gangyi Luo,Hehan Zhao,Xin Li,Baoliu Ye,Zhuzhong Qian
DOI: https://doi.org/10.1016/j.comnet.2024.110903
IF: 5.493
2024-01-01
Computer Networks
Abstract:The advent of Computing Power Network (CPN) has opened up vast opportunities for machine learning inference, yet the challenge of reducing high operational cost due to intensive computations and the sheer volume of inference tasks cannot be overlooked. Scheduling inference tasks for mitigating operational cost involves various challenges, such as migrating tasks under unpredictable CPN status, making time-coupled decisions for resource provisioning, and selecting computing sites based on dynamic electricity prices. To address these issues, we introduce CPN-Inference, a novel and flexible inference framework built upon CPN. Specifically, we formulate a time-varying integer program problem that aims to minimize long-term cost, involving switching cost, operational cost, communication cost, queuing cost, and accuracy loss. We also propose a group of polynomial-time online algorithms for supporting the formulated problem by solving delicately constructed subproblems based on the inputs predicted via online learning. Furthermore, our algorithms are proven for their competitive ratio, showcasing the performance gap between our approach and the optimal offline optimum. A testbed is constructed to evaluate inference performance on real devices. Our comprehensive evaluations, based on datasets from real systems, demonstrate that our algorithms outperform multiple alternatives, by achieving an average cost reduction of 35%.
What problem does this paper attempt to address?