Research on Search Results Optimization Technology with Category Features Integration

Yanxia Qin,Dequan Zheng,Tiejun Zhao
DOI: https://doi.org/10.1007/s13042-011-0037-9
2011-01-01
International Journal of Machine Learning and Cybernetics
Abstract:The optimization of search results has always been the research hot spot in the area of search engine. In previous work, various kinds of document ranking were used to optimize the search results, in which topic partition by clustering has been proved to be a good way. However, the clusters, containing a lot of documents unorganized, still directly limit the retrieval speed. To address this issue, the paper firstly integrates the two methods together to re-rank the documents in clusters. We find that the category features, which have great discernibility for categories, have good effects on the document sequencing. Thereupon we attempt to apply the category features into search results on the basis of the clusters. Related experiments show that our Top N results are more in line with the users’ needs and the retrieval speed can be implicitly improved, which proves that our approach significantly outperforms the original clustering method.
What problem does this paper attempt to address?