MSMbP: an efficient multi-source schema matching algorithm based on prime number calculation for deep web data integration
Jing Yu,Meng Zhao,Liguo Yang,Guohua Liu
2009-01-01
Journal of Information and Computational Science
Abstract:With the development of the Internet, many databases which could be searched on web appear, the web of which a great deal of hidden information is called "deep web", has brought many new problems to the field of data integration, schema matching as the key operation to the data integration has become the focus of data retrieval research on deep web. In this paper, schema matching methods between the existing large-scale online database query interface attributes were studied, from the perspective of an entirely new application of deep web data integration, a new efficient multi-source schema matching method-MSMbP based on the prime number calculation is proposed. Prime number theory is introduced into the progress of the schema matching, the string matches between attributes are switched into the simple calculations of the prime numbers, simple matches and complex matches are discovered accurately, and the efficiency of schema matching is greatly improved. Copyright ©2009 Binary Information Press.