A Feature-Weighted Instance-Based Learner for Deep Web Search Interface Identification

Hong Wang,Qingsong Xu,Youyang Chen,Jinsong Lan
DOI: https://doi.org/10.19026/rjaset.5.4862
2013-01-01
Abstract:Determining whether a site has a search interface is a crucial priority for further research of deep web databases. This study first reviews the current approaches employed in search interface identification for deep web databases. Then, a novel identification scheme using hybrid features and a feature-weighted instance-based learner is put forward. Experiment results show that the proposed scheme is satisfactory in terms of classification accuracy and our feature-weighted instance-based learner gives better results than classical algorithms such as C4.5, random forest and KNN.
What problem does this paper attempt to address?