Design of Web Crawler for Deep Web Based on ID3 Algorithm

Wang Shunyan,Li Lei,Wu Binghua
DOI: https://doi.org/10.3969/j.issn.1003-3513.2008.06.008
2008-01-01
Abstract:Considering the problem of poor information coverage in Web data mining,this paper proposes a configurable Web crawling method for deep Web which can improve the results performance of a general search engine significantly.It classifies Web pages and manipulates key information of page content in order to make sensible queries.The experiment results also show it.
What problem does this paper attempt to address?