Research on Chinese Keywords Extraction Based on Characters Sequence Annotation

王昊,邓三鸿,苏新宁
DOI: https://doi.org/10.11925/infotech.1003-3513.2011.12.06
2012-01-01
Abstract:Based on the whole Chinese booklist of a certain university library as well as the analysis of its book indexing information, the paper summarizes the features and extracting laws of Chinese keywords, and establishes a Chinese key- words extraction model based on characters sequence annotation, which proposes the basic idea and implementation scheme for extracting keywords. It verifies the feasibility, rationality and practicality of the model by large - scale experiments, and basically solves the problems of Chinese keywords extraction without executing words segmentation, which shows that characters sequence annotation is better than words sequence annotation.
What problem does this paper attempt to address?