Semi-structured data extraction and schema knowled

Chen Enhong,Wang Xufa
2001-01-01
Abstract:A semi-structured data extraction method to get the useful information embedded in a group of relevant well pages and store it with OEM(Object Exchange Model) is proposed. Then, the data mining method is adopted to discover schema knowledge implicit in the semi-structured data. This knowledge can make users understand the information structure on the web more deeply and thourouly. At the same time, it can also provide a kind of effective schema for the querying of web information.
What problem does this paper attempt to address?