An Extended Algorithm for XML Filtering System YFilter

SU Mingshi,ZHANG Shouzhi
DOI: https://doi.org/10.3969/j.issn.1000-3428.2005.21.023
2005-01-01
Abstract:YFilter,an efficient and scalable filtering system of XML documents,is particularly effective for short documents and large numbers of queries.A new index-based technique is proposed to answer large numbers of XPath queries.It uses a dual index built over the incoming XML document to extend the YFilter algorithm,so that it is efficient as well to deal with large XML documents.It is able to retrieve the structure information of elements in the XML document in advance.Eliminating elements that are not guaranteed to be part of any match is able to avoid an amount of redundant processing.Finally,an experiment is conducted on synthetic XML documents,which shows that the algorithm is efficient to deal with large XML documents.
What problem does this paper attempt to address?