Discovering Chinese Concept-In-Corpus

Jian-Chao Chen,Qi-Lun Zheng,Zhao Li
DOI: https://doi.org/10.1109/icmlc.2008.4620835
2008-01-01
Abstract:Concept is the basic of knowledge. A concept consists of a connotation and an extension. The paper comes up with a concept of concept-in-corpus which is a special kind of formal concept, and presents a discovering algorithm called FCWFT (filtering concept-word based on feature-tree) which automatically mine the connotation and the extension for a Chinese concept-in-corpus from corpus in Chinese. Our work is the first one attempting to mine formal concepts from free texts in the area of natural language processing. We test the algorithm with a large scale corpus. The result is encouraging.
What problem does this paper attempt to address?