Ontology-Based Information Extraction from Web Sources

周明建,高济,李飞
DOI: https://doi.org/10.3321/j.issn:1003-9775.2004.04.027
2004-01-01
Abstract:Based on the ontology, this paper regards the hierarchy of information to be extracted as the path of information extraction, defines an information item ontology of Web page and automatic creates a construction ontology by parsing the Web page. Using these two ontologies, an approach to semi-automatically generate information extraction rules is presented for efficiently collecting information from Web.
What problem does this paper attempt to address?