Web Information Integration Based on Compressed XML.

Hongzhi Wang,Jianzhong Li,Zhenying He,Jizhou Luo
DOI: https://doi.org/10.1007/978-3-540-39845-5_11
2003-01-01
Abstract:Nowadays, information integration to web data sources and XML becomes a favorite information exchange format. New application motivates the problems that massive information is often transmitted in network and must be processed in limited buffer in mediator. To process query on massive data from web data source effectively, we present a method of XML compression based on edit distance for information transmission in information integration. By compressing XML, this method can reduce both the transmission time and buffer space. Two different strategies of XML compression for transmission and process in mediator are designed. Optimization of the combination of these strategies is discussed. We also propose the query execution algorithms on compressed XML data in buffer of mediator. We focus on main operators of data from wrapper in mediator, namely sort, union, join and aggregation. Implementation of these operators on compressed data using two different methods is described in this paper.
What problem does this paper attempt to address?