A genetic search strategy based on simulated annealing for web mining

Hao Chen,Naizheng Zheng Bian,Beiji Zou, AhHwweYU
2008-01-01
Journal of Computational Information Systems
Abstract:As the web continues to increase in size, the relative coverage of web search engine is decreasing, and search tools that combine the results of multiple search engines are becoming more valuable. We present a genetic search strategy for a search engine by showing that important relation existed between web statistical studies, search engines and optimization techniques. The user query is used to mathematically define a fitness function of Web pages. The simulated annealing genetic algorithm evolves a population of pages and aims at maximizing this fitness function. We define a creation operator which uses the results given by standard search engine. The crossover and mutation operator consist in exploring links. Experimental results have shown that our method leads to pages of qualities that are significantly better than those of the standard search engines. © 2008 Binary Information Press.
What problem does this paper attempt to address?