SCANCHUNK:AN EFFICIENT ALGORITHM FOR HUNTING DENSE REGIONS IN DATA CUBE

ZHOU Bo
DOI: https://doi.org/10.3321/j.issn:0254-4164.1999.06.011
1999-01-01
Chinese Journal of Computers
Abstract:MOLAP and ROLAP are two main approaches of building OLAP system. MOLAP is good for query performance but suffers from storage inefficiency when the data cube is sparse. ROLAP can be built on mature RDBMS technology but its performance is not as competitive. The paper presents a new structure of data cube based on the dense regions. Following the new structure, the MOLAP and ROLAP approaches can be integrated to obtain both high query performance and space efficiency. The core of building the new structure lies in hunting dense regions from raw data. The dense region hunting problem is defined as an optimization problem in this paper. An efficient algorithm named ScanChunk has been developed. The accuracy and complexity of algorithm ScanChunk have been analyzed and extensive performance studies have been performed. The experimental results clearly show that ScanChunk is efficient and effective in locating dense region in large database.
What problem does this paper attempt to address?