Study of Data-Pretreatment for Full-Text Search System

HAN Sheng,LIU Guang-zhi
DOI: https://doi.org/10.3969/j.issn.1673-629x.2006.03.073
2006-01-01
Abstract:The application of full-text search has caused a revolution of the information retrieval field.It is the core that the file database researches and develops.In a full-text search system,the setting-up of the index database of full text is a systematic foundation.Its project organization influences the final search efficiency of searching algorithm and system of the full-text search engine directly.This paper introduces such data-pretreatment technology as index database structural design,text index technology,etc.Also introduces that in the fulltext retrieval system mainly,and the data processing procedure of index database of full-text retrieval system.Finally,studied the produce-algorithms of index database of full-text retrieval system on this basis,provided produce-algorithm of index database under two kinds of situations: individual file and batch processing.
What problem does this paper attempt to address?