A Data Mining Approach To Xml Dissemination

Xiaoling Wang,Martin Ester,Weining Qian,Aoying Zhou
DOI: https://doi.org/10.1007/978-3-642-17616-6_40
2010-01-01
Abstract:Currently user's interests are expressed by XPath or XQuery queries in XML dissemination applications. These queries require a good knowledge of the structure and contents of the documents that will arrive; As well as knowledge of XQuery which few consumers will have. In some cases, where the distinction of relevant and irrelevant documents requires the consideration of a large number of features, the query may be impossible. This paper introduces a data mining approach to XML dissemination that uses a given document collection of the user to automatically learn a classifier modelling of his/her information needs. Also discussed are the corresponding optimization methods that allow a dissemination server to execute a massive number of classifiers simultaneously. The experimental evaluation of several real XML document sets demonstrates the accuracy and efficiency of the proposed approach.
What problem does this paper attempt to address?