Statistical Approach Based on POS for Chinese Time Word Disambiguity

DAI Jian-ying,HE Zhong-shi
DOI: https://doi.org/10.3969/j.issn.1000-582x.2005.09.014
2005-01-01
Abstract:Segmentation Ambiguity is an important factor influencing accuracy of Chinese auto-segmentation system.Time words include expressions both indicating exact time positions and those scattering in a period of time.On the foundations of modern Chinese corpus processing principles and certain type time word segmentation ambiguity,this paper proposes a statistical language model and corresponding approach based on maximum likelihood to solve the ambiguous problem,and it reaches a 90% accuracy which shows the effectiveness of the algorithm.
What problem does this paper attempt to address?