Cost model and replacement policy for an XML cache in distributed environment

Jianzhong Li,Fan Li,Zhenying He
2005-01-01
Abstract:To improve the query performance over XML documents storing on distributing data sources, XML cache, which stores some XML documents retrieving from data sources, is proposed. Being different from HTML documents, the XML documents we access usually generate according to the requirement of query, and the generating strategy may both influence the workload of network and query cost. Therefore, XML cache should choose the best generating strategy for these documents to decrease the total cost of network and query maximally. This paper presents a cost model that can be used in distributed environments to analyze overall performance of the XML cache. Based on this cost model, we also propose a greedy replacement strategy.
What problem does this paper attempt to address?