A novel XML approximate querying method with ranking

Da-xin LIU,Tong WANG
DOI: https://doi.org/10.3969/j.issn.1006-7043.2006.z1.086
2006-01-01
Abstract:In order to deal with the retrieval of large scale of heterogeneous XML documents, both information retrieval and data mining knowledge should be applied for approximate match, for traditional XML query language (such as XPath, XQuery) is no longer fit for the situation. A novel approximate querying and ranking method is presented. According to the hierarchical path extracted from XML trees, the XML Document Matrix is at first mapped into Vector Space. Then the Singular Value Decomposition is applied to delete the correlated redundancy and reduce the dimensionality. Finally, the object query vector explores in the reduced retrieval space to match the similar documents thereby returning ranked query results. The DBLP dataset is adopted for our test and the ranking query results prove the efficiency of the proposed method.
What problem does this paper attempt to address?