Research on Web information extraction system based on intelligence and its design

LIU Ya-Dong,PENG Jian,ZHANG Da-Ping
DOI: https://doi.org/10.3969/j.issn.0490-6756.2009.04.019
2009-01-01
Abstract:Along with the Internet rapid development,a mass of information is supplied for people,but all these information is in the Web pages.In order to make use of these information data,it's needed to extract the data from the pages.This paper presents a new Web information extraction system based on intelligence-EIES.Through improving and using RoadRunner,it realizes the information extraction intellectualization without any manual work in the extraction process.The experiment indicates that this system can be more accurate and more effective to classify similar pages and extract web information.
What problem does this paper attempt to address?