When CPN Meets AI: Resource Provisioning for Inference Query Upon Computing Power Network
Mingtao Ji,Zhuzhong Qian,Baoliu Ye
DOI: https://doi.org/10.1109/icpads60453.2023.00304
2023-01-01
Abstract:Performing machine learning inference at the network edge, named Edge Inference, showing benefits like low latency, reduced data traffic, and improved user privacy, has attracted massive attention. Computing Power Network (CPN) creates opportunities for edge inference, but also poses multiple challenges for service providers, including computing power selections from different enterprises, the time-coupled decision for resource provisioning and the unpredictable CPN network status. To overcome these challenges, this study formulates a time-varying integer program problem whose goal is to minimize long-term costs, involving switching costs, operational costs, communication costs, and queuing costs. Then we design a group of polynomial-time online algorithms to make online decisions, by taking into account stochastic inputs. Our algorithms adaptively make control decisions by solving delicately constructed subproblems based on the inputs predicted via online learning. Specifically, we first obtain fractional solutions which are transformed into integers for deployment with expectations preserving. Furthermore, we conduct a rigorous proof and establish the competitive ratio which highlights the difference between the performance of our proposed algorithms and the offline optimum. Our comprehensive evaluations, using datasets from real systems, demonstrate that our algorithms outperform multiple alternatives, up to an average of 35% cost reduction, confirming the effectiveness of our algorithms.