The Algorithm of New Conception Discovery Based on Indexing Information

XIA Ying,LIU Gong-shen,Li Xiang
DOI: https://doi.org/10.3969/j.issn.1007-757X.2007.01.003
2007-01-01
Abstract:The discovery and recognition of network new conception is a fundamental technology in the field of information processing ,which can help sensing publicinterests in Internet and recognizing Internet interesting information. Because of using indexing information rationally, it can process a mass of Information with the algorithm proposed in this paper. In order to improve the precision of new conceptiondiscovery, the algorithm adopts not only the technology of traditional word segmentatio-nand string frequency statistics, but also automatic combination of Chinese character components and the string co-occurrence between websites. It's proved by experiments that 75 percent of the potential new conception is acceptable. The new algorithm can thoroughly meet the requirementsof the application of information processing.
What problem does this paper attempt to address?