A Method to Automatically Discover and Classify Deep Web Data Source Using Multi-Classifier

李志涛,刘全,周文云
DOI: https://doi.org/10.1109/csie.2009.435
2009-01-01
Abstract:Recently, the discovery of Deep Web data source and domain-relevant issue attract more and more attentions. This paper proposed a method using multi-classifier to discover and classify the data source of Deep Web. Firstly, It used Naïve bays classifier to class the page into domain relevance or not. Secondly, improved C4.5 Decision tree algorithm was used to identify the query interface. The result of the experiment competed with single decision tree classifier proved this method is effective.
What problem does this paper attempt to address?