Construction of an Extensible Chinese Word Segmentation System

Jin Huaxing,Dai Xinyu,Chen Jiajun
DOI: https://doi.org/10.3321/j.issn:1002-8331.2005.23.053
2005-01-01
Abstract:The paper presents a way to construct a highly extensible Chinese word segmentation system and describe a software framework, which is very flexible. In a practically useful word segmentation system, we always use diversity methods. This framework can be adapted to all kinds of methods, rule based, statistics based or a hybrid way. And this framework can also be adapted to all kinds of unknown name entity recognition approaches. We also give instructions of how to implement such a framework in this paper. Keyword Chinese word segmentation, statistical method, extensible, framework
What problem does this paper attempt to address?