Mining Construction Rules of Chinese Keyphrase Based on Rough Set Theory

Yuan-chao LIU,Xiao-long WANG,Zhi-ming XU,Bing-quan LIU
DOI: https://doi.org/10.3321/j.issn:0372-2112.2007.02.041
2007-01-01
Tien Tzu Hsueh Pao/Acta Electronica Sinica
Abstract:Phrase conveys more information than word, and can better represent main topic of one article. Most of keywords we referred to are actually in form of phrases. The problem is that extraction of keyphrase lacks guidance of some general rules. By taking advantage of the ability of rough set theory on data generalization and knowledge reduction, the manually labeled keyphrase corpus which come from People's Daily was mined and some construction rules of Chinese keyphrase has been generated. These rules can be used for automatic keyword extraction, and can also help people manually label keyword. The experimental results are promising: the performance of keyword extraction improved greatly after importing these rules.
What problem does this paper attempt to address?