Efficient indexing and querying algorithm for large-scale XML data

Weimin Qu,Junlin Zhang,Le Sun,YuFang Sun
2003-01-01
Ruan Jian Xue Bao/Journal of Software
Abstract:To optimize the query evaluation of extensible markup language (XML) data, several indexing schemes were proposed in recent years. However, most of these schemes result in poor performance when XML data have large size or intricate structure. To overcome these limitations, the paper proposes I-index which is a novel indexing and querying scheme to evaluate partial matching path queries. I-index constructs another index on original path indexes, and achieves the target set of queries by filtering scratch label path candidates backwardly. The paper also builds a cache system to accelerate the processing of queries, which is called Qcache. This paper presents the architecture of I-index along with algorithms to construct I-index and evaluate partial matching path queries. Experimental results confirm that I-index improves query processing greatly, with the performance gap increasing with the size and structure complexity of source XML data.
What problem does this paper attempt to address?