Website Search Engine Design and Implementation Based on a Document Representation Model

Hui JIANG,Xiao-hua YANG,Zhi-ming LIU,Shi-yu YAN,Jia-yu MA,Xiao-yun LI,Meng LI,Zuo ZHOU
DOI: https://doi.org/10.3969/j.issn.1673-0062.2013.04.018
2013-01-01
Abstract:According to the comprehensive information theory,epistemology information is the trinity of syntactic information,semantic information and pragmatic information. Making better use of pragmatic information in information retrieval can promote the quality of infor-mation retrieval. A document representation model based on query and content can make better use of pragmatic information,and it is good to promote the precision of the website search engine. Lucene is a open source full text search engine architecture which is devel-oped using java language. We use lucene to design and implement a website engine based on document representation model using query and content. The experiment results show that this model can effectively improve precision rate in information retrieval.
What problem does this paper attempt to address?