Design and Implementation of a Real-time Scheduling Algorithm for MapReduce

LIU Ji,CHEN Xiang-Lan,DAI Dong,SUN Ming-Ming,ZHOU Xue-Hai
DOI: https://doi.org/10.3969/j.issn.1003-3254.2013.08.024
2013-01-01
Abstract:MapReduce is a popular batch data processing framework in cloud computing field.Sharing MapReduce cluster and meeting the deadlines of jobs is a key problem to be solved.This paper proposes a two phase real-time scheduling algorithm which separate scheduling into job scheduling and task scheduling.It uses sampling method to estimate the task excuting time so that the scheduler can make a decision on how many slots should be assigned to the job and how to calculate the job’s priority.Using delay-scheduling scheme in task scheduling,the"computing locality"problem can be solved well.Experiments result shows that the scheduling algorithm implemented in this paper satisfies the job’s real-time requirement as well as throughput of the cluster.
What problem does this paper attempt to address?