A Stack- based HTML to XML Transformation Approach

吴相智,刘卫国,费洪晓
DOI: https://doi.org/10.3969/j.issn.1674-599X.2004.02.015
2004-01-01
Abstract:Large volume of current Web information is constructed in HTML format. Extracting (information) from Web and then reusing them is an important target in the research field. The authors (propose) a stack-based HTML to XML transformation approach in order to simplify information extraction task.
What problem does this paper attempt to address?