Document Representation Model Based on Query and Content

阳小华,周座
DOI: https://doi.org/10.3969/j.issn.1673-0062.2010.01.011
2010-01-01
Abstract:In information retrieval,the quality of a document representation model is one of the important factors which affect retrieval performance.According to the comprehensive information theory,epistemology information is the trinity of syntactic information,semantic information and pragmatic information.The mainstream of document representation models at present primarily utilize syntactic and semantic information while are devoid of pragmatic information,which is the bottle-neck of retrieval performance improving.In this thesis,we present a document representation model based on users' query behavior and documents'content,in which the pragmatic information from users' implicit feedback and the semantic and syntactic information from documents is integrated to dynamically regulate the key-weight of index database.This model can consequently improve recall and precision rate in information retrieval.
What problem does this paper attempt to address?