A patent retrieval method based on automatic query expansion

Yang Shuai,Wang Feng,Lin Lanfen,Zhu Xiaowei,Xie Fei
DOI: https://doi.org/10.3969/j.issn.2095-2783.2013.10.023
2013-01-01
Abstract:Existing patent retrieval methods cannot effectively capture user's query intents due to the lack in query expansion.To solve this problem,we propose a novel patent retrieval method based on automatic query expansion.Considering the characteristics of patent documents,an improved TF-IDF scheme is first adopted to extract patent domain terms and build the domain vocabularies.At the retrieval stage,query inputs are analyzed to extract key words,and then the field of query and the difficulty of query expansion are determined based on domain vocabularies.Furthermore,according to the term distribution variation analysis on pseudo related documents,the pseudo relevance feedback(PRF)-based automatic query expansion techniques are utilized to generate and rank the candidate expansion terms.At last,the expansion terms are combined with original query conditions to compose the final query conditions for searching.The comparative experiment results show that our method achieves better recall and average precision.
What problem does this paper attempt to address?