Modeling the Performance of MapReduce under Resource Contentions and Task Failures

Xiaolong Cui,Xuelian Lin,Chunming Hu,Richong Zhang,Chengzhang Wang
DOI: https://doi.org/10.1109/cloudcom.2013.28
2013-01-01
Abstract:MapReduce is a widely used programming model for large scale data processing. In order to estimate the performance of MapReduce job and analyze the bottleneck of MapReduce job, a practical performance model for MapReduce is needed. Many works have been done on modeling the performance of MapReduce jobs. However, existing performance models ignore some important factors, such as I/O congestions and task failures over cluster, which may significantly change the execution costs of MapReduce job. This paper, aiming at predicting the execution time of a MapReduce job, presents an enhanced performance model that takes the resource contention and task failures into consideration. In addition, the experimental results show that the model is more accurate than those without considering the contention and failure factors.
What problem does this paper attempt to address?