Adapting Skyline Computation To The Mapreduce Framework: Algorithms And Experiments

Boliang Zhang,Shuigeng Zhou,Jihong Guan
DOI: https://doi.org/10.1007/978-3-642-20244-5_39
2011-01-01
Abstract:This paper addresses the problem of skyline computation under the MapReduce framework. As a parallel programming model for data-intensive computing applications, MapReduce runs on a cluster of commercial PCs with the main idea of task decomposition and result reduction. Based on different data partitioning strategies, three MapReduce style skyline computation algorithms are developed: MapReduce based BNL (MR-BNL), MapReduce based SFS (MR-SFS) and MapReduce based Bitmap (MR-Bitmap). Extensive experiments are conducted to evaluate and compare the three algorithms under different settings of data distribution, dimensionality, buffer size and cluster size.
What problem does this paper attempt to address?