I/O and CPU Balanced XML Keyword Retrieval

Shan Wang
2010-01-01
Abstract:The widespread application of XML(extensive makeup language)makes it a new research focused on the area of information retrieval.Although the precision of XML information retrieval can be improved greatly because of the internal structure in XML documents,the finer retrieval granularity(i.e.elements or passages instead of documents)and more complex scoring and ranking models(e.g.the language model combined with the hierarchical inference network)convert traditional information retrieval applications which are I/O intensive into CPU-bound applications.In view of such a transformation,a new query processing framework of XML retrieval is proposed,which can dynamically balance I/O and CPU workloads to minimize average response time per query by creating two indexes for XML corpus and scheduling subtasks to use different indexes for queries evaluation according to the current state of the system.
What problem does this paper attempt to address?