Research and Design of Performance Monitoring Tool for Hadoop Clusters

chongyang xue,feng liu,honghui li,jun xiao,zhen liu
DOI: https://doi.org/10.1007/978-81-322-1759-6_93
2014-01-01
Abstract:Hadoop is an open source platform. Because of its open source, high fault tolerance, and scalability, it has been widely used to deal with big data in many IT industries. It becomes particularly important and difficult to monitor and analyze the performance of a Hadoop cluster as the scale of the cluster grows. Based on Hadoop source code, this paper proposed a performance monitoring tool named Hadoop Monitor. Integrated with a job scheduling framework, such as Quartz, the Hadoop Monitor grasps the performance data from Hadoop cluster periodically and then generates a performance report. This tool can provide a great help to analyze the performance of Hadoop cluster.
What problem does this paper attempt to address?