Domain-Specific Information Retrieval Based on Improved Language Model

Kai Kang,Kunhui Lin,Changle Zhou,Feng Guo
DOI: https://doi.org/10.1109/FSKD.2007.261
2007-01-01
Abstract:There are two key ingredients in the general framework of language models used in information retrieval, one is importance weighting, the other is word relationship computing. A series of improvements are made to these ingredients of the general framework of language models which is used in domain-specific information retrieval. First, an EM algorithm is proposed to estimate the importance weights of query terms, and the Bayesian smoothing is used to compute the productive probabilities of important terms. Next, a new algorithm based on Dynamic Bayesian Network is proposed for obtaining the explanation probabilities between terms. Experiment shows that the improved model performs remarkably better for domain-specific information retrieval than some traditional retrieval techniques, and the extended framework has good expansibility.
What problem does this paper attempt to address?