Research of Hadoop Performance Optimization Based on the Scheduler

LIU Juan,DOU Yu-sheng,HE Chen,TANG Hong
DOI: https://doi.org/10.3969/j.issn.1000-7024.2013.01.036
2013-01-01
Abstract:To improve the performance of Hadoop scheduler,reduce the task response time of Hadoop cluster,one kind of dynamic scheduling algorithm based on the CPU occupancy rate is proposed.Firstly,traditional performance optimization method is compared.It is pointed out that lacking of dynamics and flexibility became the key problem.Then,the default task scheduling model is analyzed deeply and the CPU occupancy as a load index,feeding back the load index in circular calculation is employed.It adapted load changes dynamically.The experimental results show that the algorithm in Hadoop cluster can effectively improve the cluster performance.
What problem does this paper attempt to address?