Machine Learning Based Query Expansion in Blog Retrieval

王秉卿,张奇,吴立德,黄萱菁
DOI: https://doi.org/10.3969/j.issn.1003-0077.2008.06.017
2008-01-01
Abstract:A novel query expansion approach is presented in this paper,which applys the machine learning technique to the query expansion.It improves the retrieval performance by training a machine learning modular to predict and select the query expansion words.With the pseudo-relevance feedback,a set of candidate expansion words are generated for a certain topic.Then a Support Vector Machine(SVM) judges on these candidate words and forms an optimized query by selecting the top candidate words.To train such a SVM for query word judgment is difficult because the training data set is unavailable.This issue is resolved by generating the training data set via the retrieval results and evaluation tools available.In the opinion retrieval task of BLOG TRACK held by the TREC conference,we use this query expansion method to improve the Mean Average Precision(MAP) by 33.1% compared with the baseline result.
What problem does this paper attempt to address?