An Efficient MapReduce Framework for Intel MIC Cluster.

Wenzhu Wang,Qingbo Wu,Yusong Tan,Yaoxue Zhang
DOI: https://doi.org/10.1007/978-3-319-23862-3_13
2015-01-01
Abstract:MapReduce is a distributed programming framework to process large scale data set by employing clusters in scale-out ways. However, scaling-up the single node is better than scale-out solution because of less communication overhead. As Intel MIC has a higher performance than ordinary CPU, we propose an efficient MapReduce framework for Intel MIC cluster. Our framework provides several new features, such as fault tolerant mechanism for MIC management, efficient buffer management in MIC memory, and asynchronous task transfer between CPU and MIC. It could manage a large scale MIC cluster and exploit applications in MapReduce like ways. The experimental results show that our system is upï¾źto 1.35x and 6.8x faster than Hadoop on ordinary CPU cluster.
What problem does this paper attempt to address?