A Method Combining Rule-based and Statistics-based Approaches for Chinese Word Segmentation

赵伟,戴新宇,尹存燕,陈家骏
DOI: https://doi.org/10.3969/j.issn.1001-3695.2004.03.008
2004-01-01
Abstract:Chinese automatic word segmentation is a basic task in the area of Chinese NLP.After summarizing and analyzing current techniques used in Chinese word segmentation,this paper presents a new method for word segmentation which is based on a marked corpus base.The method combines rule-based and corpus-based statistical methods.
What problem does this paper attempt to address?