Research and Implementation of Structure Extraction of Semi-structured Document

杨建武,陈晓鸥
DOI: https://doi.org/10.3969/j.issn.1000-3428.2001.10.007
2001-01-01
Abstract:A model of structure extraction was brought forward in the paper. First, an idea was given that the semantic structure information been extracted at information source through the rules of the relation between semantic structure information and style information. Then, the paper puts forward a model how to extract structure of semi-structured document. The key step and key algorithm were discussed in detail. Last, the extraction method and its application were summarized with an system, which had been constructed based on the scheme. The idea and the method had been used in an applied system with success.
What problem does this paper attempt to address?