Research on Keywords Indexing for Chinese Bibliography Based on Word Roles Annotation

Deng Sanhong,Wang Hao,Qin Jiahang,Su Xinning
DOI: https://doi.org/10.13530/j.cnki.jlis.2012.02.007
2012-01-01
Abstract:Automatic indexing by computers for Chinese bibliography has become one of the most critical problems which should be solved immediately in digital library construction.This paper tries to introduce Conditional Random Fields(CFRs) algorithm into the keyword extraction of Chinese bibliography,and builds the model which faces book contents based on the word roles annotation.The model turns the book contents into sequences of words.Based on that,an idea which combines word roles space model building with context features of word sequence comprehensive utilization has been proposed.Moreover,the paper also verifies the rationality and practicality of the model by showing the experiment of automatically extracting keywords from titles and abstracts.
What problem does this paper attempt to address?