Design and Implementation of Enhanced Parallel Computing Framework System Tu Huake, Zou Hua, Lin Rongheng

Huake Tu,Hua Zou,Rongheng Lin
DOI: https://doi.org/10.19335/j.cnki.2095-6649.2012.12.005
2012-01-01
Abstract:MapReduce is the parallel computing framework provided by Google, which is used to deal with the large-scale data parallel computing. Hadoop, the open-source implementation of MapReduce, has been widely used in different projects. However, when users want to use this parallel computing framework, they have to spend time on the Hadoop cluster configuaration, learning the progamming API, and the MapReduce job operations. This paper proposes the Enhanced Parallel Computing Framework System to provide the users parallel computing cluster, and simplify the configuration, programming, uploading, and operating procedures. Especially, Enhanced Parallel Computing Framework System defines a set of annoations, with which users can quickly build their own MapReduce Job.
What problem does this paper attempt to address?