Extraction and Management of Meta Information on the Domain-Oriented Deep Web

Bo Liu,Jiang Xiang
DOI: https://doi.org/10.1109/icsess.2016.7883185
2016-01-01
Abstract:On the Internet, Deep Web information is hidden in the depths, the ordinary search engines cannot return them directly. In order to provide Deep Web information conveniently for users, a Deep Web information integration system should be set up. However, schemas of Web databases are not known in advance. Thus, it is an important task to extract information from the query interfaces and construct the meta database that is the basis for the Deep Web information system. This paper studies meta information extraction techniques from the query interfaces of Deep Web. It applies the method based on visual features and user-defined rules to get the attribute information of the source query interfaces, and stores them in the meta database of the Deep Web integration system. It solves the problem of managing or maintaining the source information. The experimental results show that the extraction methods are feasible and have good performance.
What problem does this paper attempt to address?