SCHEMA DISCOVERY FOR SEMISTRUCTURED HIERARCHICAL DATA

Fang LIU,He-ping HU,Song-feng LU
DOI: https://doi.org/10.3969/j.issn.1000-1220.2001.01.022
2001-01-01
Abstract:Semistructured data arise when the source does not impose a rigid structure (such as the web) and when data is combined from several heterogeneous data sources. Semistructured data is a self-describing data whose structure is implicit or irregular. The lack of external schema information makes querying and browsing these data inefficient. This paper presents an algorithm discover schema fastly and efficiently. Using a top-down approach with efficient prune strategy, a schema tree can be constructed from an OEM graph.
What problem does this paper attempt to address?