Online Scheduling Algorithms for Unbiased Distributed Learning over Wireless Edge Networks

Jinlong Pang,Ziyi Han,Ruiting Zhou,Haisheng Tan,Yue Cao
DOI: https://doi.org/10.1016/j.sysarc.2022.102673
IF: 5.836
2022-01-01
Journal of Systems Architecture
Abstract:To realize smart IoT services, such as intelligent video surveillance, smart city and autonomous driving, tremendous amount of distributed machine learning jobs will train unbiased models over large datasets collected by geo-distributed wireless edge network, adopting a parameter server (PS) architecture. The training of unbiased distributed learning (UDL) relies on geo-distributed data and brings high response latency and bandwidth consumption, introducing a new challenge: how to schedule UDL jobs such that the response latency (training time) is minimized, meanwhile reducing the expensive bandwidth cost among geo-distributed sites in the edge wireless network. To address it, we propose two online scheduling algorithms, Okita and O k i t a ∗, to achieve long-term overall cost minimization. Okita schedules UDL jobs at each time slot in a preemptive manner to jointly decide the execution time window, the amount of training data, the number and the location of concurrent workers and PSs in each site, whereas O k i t a ∗ schedules jobs via a non-preemptive fashion. To evaluate the practical effectiveness of the proposed algorithms, we implement both testbed experiments and large-scaled simulations. We show that our proposed algorithms can reduce up to 70% of the total training cost, compared to three classical schedulers in today’s cloud system.
What problem does this paper attempt to address?