Research on Optimized MapReduce Model of Hadoop Cloud Plat-Form

ZHANG Hong,WANG Xiaoming,CAO Jie,MA Yanhong,GUO Yirong,WANG Min
DOI: https://doi.org/10.3778/j.issn.1002-8331.1604-0388
2016-01-01
Abstract:Sequential control of running mechanism of MapReduce model on Hadoop platform can lead to waste of com-puting resources. From the perspective of the fine-grained parallel data processing of each node, combined with multi-threads technique of Java shared memory, this paper optimizes MapReduce model and puts forward a MapReduce+OpenMP framework. This model is a distributed and parallel computing architecture based on Hadoop cloud platform, which combines computing resources of coarse and fine granularity. After programming and realizing on the GPS trajectory data of the taxi in the Hadoop distributed cluster environment, the results show that this distributed parallel computing model can really improve the computing efficiency of processing big data set, and it is an effective optimization and improvement to the MapReduce model of big data processing.
What problem does this paper attempt to address?