Closed Cubing on PC Clusters

游进国,奚建清,张平健,刘艳霞
DOI: https://doi.org/10.3969/j.issn.1002-137x.2009.06.039
2009-01-01
Computer Science
Abstract:The closed cube is a very efficient technology for the data cube compression in OLAP,but its parallel algorithm is little studied in the literature.This paper presented a solution that is easy to be implemented and applied.It parallelizes the closed cube construction and the query answering based on the MapReduce framework over low cost share nothing PC clusters.The experiments show that our approach can rapidly process huge data sets with at least 10 million rows and get pretty good linear speedup.Further,it compresses data cubes much greater.
What problem does this paper attempt to address?