<Emphasis Type="Italic">pq</Emphasis>-Hash: An Efficient Method for Approximate XML Joins

Fei Li,Hongzhi Wang,Liang Hao,Jianzhong Li,Hong Gao
DOI: https://doi.org/10.1007/978-3-642-16720-1_13
2010-01-01
Abstract:Approximate matching between large tree sets is broadly used in many applications such as data integration and XML de-duplication. However, most existing methods suffer for low efficiency, thus do not scale to large tree sets.
What problem does this paper attempt to address?