A Uniform System for Publishing and Maintaining XML Data
Byron Choi,Wenfei Fan,Xibei Jia,Arek Kasprzyk
DOI: https://doi.org/10.1016/B978-012088469-8.50132-7
2004-01-01
Abstract:the Web. To exchange data currently residing in databases, one needs to publish it in XML, i. e., to extract data from the database and transform the data into an XML format. In practice, data publishing is often done with a predefined "schema". A community agrees on a certain schema, and subsequently all members of the community exchange their data w. r. t. the predefined schema, by ensuring their pub-lished (target) XML data to conform to the fixed schema. This is called schema-directed XML publishing. The need for this is particularly evident in biological data exchange and services. However, it is nontrivial to ensure that the tar-get XML data conforms to a given schema. The difficulty is introduced by, among others, recursion in a target schema, which is common in, e. g., biological ontologies [7]. With XML publishing also comes the increasing need for maintaining target XML data. The underlying source data often changes and evolves, and the source updates should be reflected in its XML target accurately and efficiently. A naive approach would be to recompute the XML target from scratch in response to source data changes. This is not very realistic in many applications where XML publishing in-volves voluminous data and may take hours to complete. This suggests that one needs to deal with updates incremen-tally: propagate the updates from the source data to its XML target with minimal recomputation. While this is reminis-cent of traditional database view maintenance, incremental updates are more challenging for hierarchical and possibly recursive XML views constrained by a predefined schema. In response to the need we proposed a new approach for schema-directed publishing of relational data in XML, Supported in part by NSF Career Award IIS-0093168, NSFC 60228006 and EPSRC GR/S63205/01.Permission to copy without fee all or part of this material is granted pro-vided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.