WIEAS: Helping to Discover Web Information Sources and Extract Data from Them

Liyu Li,Shiwei Tang,Dongqing Yang,Tengjiao Wang,Zhi-hong Deng,Zhihua Su
DOI: https://doi.org/10.1007/978-3-540-24655-8_79
2004-01-01
Abstract:In recent years, more and more information appeared on the web. Extracting information from the web and converting them into regular format become significantly important work. After observing a number of web sites, we found that most of useful information is contained in the web sources, which have a large number of similarly structured web documents. So in this paper we present an approach for discovering the useful information sources from the web and extracting information from them. A useful web information source discovering method and a novel information extraction method are proposed. We also develop a prototype system WIEAS (Web Information Extraction, Analysis And Services) to implement our idea, and use the information extracted by WIEAS to provide plentiful services.
What problem does this paper attempt to address?