A Practical Approach To Resolving Combination Ambiguity In Chinese Word Segmentation

Ying Qin,Suxiang Zhang,Xiaojie Wang
DOI: https://doi.org/10.1109/ICOSP.2006.345823
2006-01-01
Abstract:In Chinese word segmentation task, combination ambiguity is one of challenges not being well settled. The main obstacle exists in the detection of ambiguous words in given texts and their proper segmentations. This paper puts forward a practical approach to automatically collecting ambiguous words and disambiguating based on Maximum Entropy principle. The experimental result reveals the approach of automatic collection ambiguous words can detect combination ambiguity effectively avoiding arduous manual work. As to the disambiguation based on Maximum Entropy, we investigate new features grounded on prior and contextual knowledge and achieve promising result.
What problem does this paper attempt to address?