Log mining to improve the performance of site search

Gui-Rong Xue,Hua-Jun Zeng,Zheng Chen,Wei-Ying Ma,Chao-Jun Lu
DOI: https://doi.org/10.1109/WISEW.2002.1177868
2002-01-01
Abstract:Despite of the popularity of global search engines,people still suffer from low accuracy of site search. Theprimary reason lies in the difference of link structuresand data scale between global Web and website, whichleads to failures of traditional re-ranking methods suchas HITS, PageRank and DirectHit. This paper proposes anovel re-ranking method based on user logs withinwebsites. With the help of website taxonomy, we mine forgeneralized association rules and abstract accesspatterns of different levels. Mining results aresubsequently used to re-rank the retrieved pages. One ofthe advantages of our mining algorithm is that it resolvesthe diversity problem of user's access behavior anddiscovers general patterns. Experiment shows that theproposed method outperforms keyword-based method by15% and DirectHit by 13% respectively.
What problem does this paper attempt to address?