Online Approximation Scheme for Scheduling Heterogeneous Utility Jobs in Edge Computing

Chi Zhang,Haisheng Tan,Haoqiang Huang,Zhenhua Han,Shaofeng H-C Jiang,Guopeng Li,Xiang-Yang Li
DOI: https://doi.org/10.1109/tnet.2022.3193381
2023-01-01
Abstract:Edge computing systems typically handle a wide variety of applications that exhibit diverse degrees of sensitivity to job latency. Therefore, a multitude of utility functions of the job response time need to be considered by the underlying job dispatching and scheduling mechanism. Nonetheless, previous studies in edge computing mainly focused on optimizing a single utility function across all jobs, e.g., linear, sigmoid, or the hard deadline. In this paper, we design online job dispatching and scheduling strategies in which different jobs can be categorized by different non-increasing utility functions. Our goal is to maximize the total utility of all scheduled jobs. We first prove that no online deterministic algorithm could achieve a competitive ratio better than the lower bound $\Omega \left({\frac {1}{\sqrt {\epsilon }}}\right)$ under the $(1+\epsilon)$ -speed augmentation model. We proceed to propose an online algorithm, named as O4A, for handling jobs with heterogeneous utilities. We prove that O4A is $O\left({\frac {1}{\epsilon ^{2}}}\right)$ -competitive. We also design its distributed version, i.e., DO4A. We implement O4A and DO4A on an edge computing testbed running deep learning inference jobs. With the production trace from Google Cluster, our experimental and large-scale simulation results indicate that O4A can increase the total utility by up to 50% compared with state-of-the-art methods. Besides, the performance loss of DO4A is only 2% compared with O4A with a small communication overhead involved. Moreover, both of our algorithms are robust to estimation errors in job processing time and transmission delay.
What problem does this paper attempt to address?