An Optimized Filter Strategy for XML Query Processing

CHEN Hai-Kun,LI Jian-Zhong,LUO Ji-Zhou
DOI: https://doi.org/10.3969/j.issn.1002-137X.2007.04.030
2007-01-01
Computer Science
Abstract:How to retrieve information user interested in has become a hotspot in research .In this paper, we propose a filter strategy to reduce the number of candidate nodes using double level index. In the filter algorithm, all nodes are coded according to their path type, and clustered based on their codes. In course of processing, we first calculate the target codes of the query according to the structure of XML data. Then we load parts of the index which target codes point to into memory, and use the filter expression to filter those definitely useless nodes without scanning entire index. Furthermore, we also extend the index structure to process twig pattern query for XML . The analysis and the experimental results show that our filter algorithm has a better performance than filter algorithm using Bit vector, and also needs less storage.
What problem does this paper attempt to address?