Abstract:Due to the intensive computation requirements of emerging applications and the limited computational capability of edge computing servers, the computation task must be executed on multiple edge servers in a distributive and cooperative manner. However, the large amount of information exchanged among the edge servers is a major obstacle for improving the computing performance. By utilizing the excess computational resource, coded MapReduce provides an effective approach to reduce the communication load. In this paper, we develop a stochastic load scheduling framework to complete the computation tasks with coded MapReduce considering the intrinsic tradeoff between the communication and computation loads. Our goal is to minimize the communication load under time-varying excess computational resources. We first reduce this problem to a task scheduling problem by exploiting the property of the computing repetition in the coded MapReduce framework. Since the task scheduling problem is still a stochastic optimization problem, it is generally difficult to solve. In the offline setting, we obtain the optimal computation load scheduling algorithm by adopting the augmented Lagrangian method. In the online setting, we derive a worst-case performance bound of the online equal task scheduling (ETS) algorithm by using competitive analysis. Furthermore, we make full use of past state information of computing resources for pre-planing and propose an improved algorithm based on the ETS algorithm in a learning manner. Finally, our proposed algorithm is evaluated by simulation to demonstrate that the proposed algorithms are superior over the conventional algorithms, and the performance gap between the online and offline algorithms is fairly small.

Load-Aware Hybrid Scheduling in Large Compute Clusters

On-Line Scheduling of Parallel Jobs in Heterogeneous Multiple Clusters

Load scheduling for distributed edge computing: A communication-computation tradeoff

Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers

Container Scheduling Algorithms for Distributed Cloud Environments

Online Approximation Scheme for Scheduling Heterogeneous Utility Jobs in Edge Computing

Agon: A Scalable Competitive Scheduler for Large Heterogeneous Systems

Co-Scheduler: A Coflow-Aware Data-Parallel Job Scheduler in Hybrid Electrical/Optical Datacenter Networks

Online Job Scheduling in Distributed Machine Learning Clusters

Hybrid Workload Scheduling on HPC Systems

CloudCoaster: Transient-aware Bursty Datacenter Workload Scheduling

Scheduling Distributed Deep Learning Jobs in Heterogeneous Cluster with Placement Awareness

Low Complexity Hierarchical Scheduling for Diverse Datacenter Jobs.

Optimizing Internal Overlaps by Self-Adjusting Resource Allocation in Multi-Stage Computing Systems

When Computing Meets Heterogeneous Cluster: Workload Assignment in Graph Computation

Cluster Scheduler on Heterogeneous Cloud

Geographical Job Scheduling in Data Centers with Heterogeneous Demands and Servers

Hybrid Workflow Scheduling on Edge Cloud Computing Systems

Low-latency job scheduling with preemption for the development of deep learning

A Near Optimal Multi-Faced Job Scheduler For Datacenter Workloads

A HPC Co-Scheduler with Reinforcement Learning