A Survey of Deep Web Data Integration
LIU Wei,MENG Xiao-Feng,MENG Wei-Yi,W LIU,XF MENG,WY MENG
DOI: https://doi.org/10.3321/j.issn:0254-4164.2007.09.001
2007-01-01
Chinese Journal of Computers
Abstract:As the rapid development of World Wide Web, there is tremendous information "hiddened" in Deep Web, and its capacity is increasing rapidly. The information can only be accessed by the query interfaces provided by Web database. The data in Deep Web are obtained in the form of dynamic Web pages when users send a query. Due to the poor structure of Web pages and the instability and large scale of Deep Web, it is a very challenging task to integrate the abundant information automatically and use it effectively. Until now, Deep Web data integration has still been a rising research field, and there are a number of challenging issues in it. A great deal of research works is developed in this field, but it is imbalanced on the issues of this field. A framework of Deep Web data integration is proposed in this paper, and the key research works in Deep Web data integration are classified and surveyed according to this framework. At last, the deficiencies in this field are analyzed and the suggestions for future research works are put forward.