HDW: A High Performance Large Scale Data Warehouse

Jinguo You,Jianqing Xi,Chuan Zhang,Gengqi Guo
DOI: https://doi.org/10.1109/imsccs.2008.16
2008-01-01
Abstract:As data warehouses grow in size, ensuring adequate database performance will be a big challenge. This paper presents a solution, called HDW, based on Google infrastructure such as GFS, Bigtable, MapReduce to build and manage a large scale distributed data warehouse for high performance OLAP analysis. In addition, HDW provides XMLA standard interface for front end applications. The results show that HDW achieves pretty good performance and high scalability, which has been demonstrated on at least 18 nodes with 36 cores.
What problem does this paper attempt to address?