Design and Implementation of Job Scheduling Algorithm for Multi-User MapReduce Clusters

王凯,吴泉源,杨树强
DOI: https://doi.org/10.3969/j.issn.1006-2475.2010.10.007
2010-01-01
Abstract:As more enterprises start to use data-intensive cluster computing systems such as Hadoop and Dryad for more applications,sharing MapReduce clusters among multiple users that reducing the cost of establishing an independent cluster and the demand of sharing common data sets resources for users is increasing.Based on fair scheduling algorithm,combining with slot allocation delay and priority technology,the paper proposes an improved algorithm.It can achieve better data locality,improve the performance of the system,such as throughput,response time.To meet the differentiated business services,it sets the appropriate for users to ensure special tasks.
What problem does this paper attempt to address?