Makespan Minimization for MapReduce Systems with Different Servers

Yiwei Jiang,Yuqing Zhu,Weili Wu,Deying Li
DOI: https://doi.org/10.1016/j.future.2016.07.012
IF: 7.307
2017-01-01
Future Generation Computer Systems
Abstract:In this paper we study MapReduce scheduling on n parallel machines (servers) with different speeds v1≥v2≥⋯≥vn. Each job contains two kinds of tasks: map tasks and reduce tasks. The job’s reduce tasks can only be processed after finishing all its map tasks. We assume that the map tasks are parallelized, i.e., it can be arbitrarily split and parts of the same task can be processed on different machines in parallel, while the reduce tasks are non-parallelizable. We consider both the offline and online scheduling problems. On offline version, if the reduce tasks are non-preemptive, we design an approximation algorithm whose worst case ratio is at most max{1+Δ2−1n,Δ}, where Δ=v1vn is the ratio of the fastest speed to the slowest speed. If the reduce tasks are preemptive, we provide an approximation algorithm with worst case ratio of 2. On online version where jobs arriving over time, we design two heuristics for non-preemptive and preemptive reduce tasks respectively. In experiment, we verify the advantage of our algorithms comparing with the state-of-the-art.
What problem does this paper attempt to address?