Web Information Extractor Based on Extended Tag Graph

Liang WANG,Zhengyu ZHU
DOI: https://doi.org/10.3969/j.issn.1000-3428.2005.08.060
2005-01-01
Abstract:A new Web information extractor is discussed. It is based on extend tag graph (ETG), and has the ability to separate the data from the pattern data. This extractor is used in Web information retrieval, with supporting effective real-time information retrieval, extract and reform in tag level inside the Web page. Besides the design of the extractor, it also discusses its practice in experimental system.
What problem does this paper attempt to address?