Closed-Set Chinese Word Segmentation Based On Convolutional Neural Network Model

Zhipeng Xie
DOI: https://doi.org/10.1007/978-3-319-69005-6_3
2017-01-01
Abstract:This paper proposes a neural model for closed-set Chinese word segmentation. The model follows the character-based approach which assigns a class label to each character, indicating its relative position within the word it belongs to. To do so, it first constructs shallow representations of characters by fusing unigram and bigram information in limited context window via an element-wise maximum operator, and then build up deep representations from wider contextual information with a deep convolutional network. Experimental results have shown that our method achieves better closed-set performance compared with several state-of-the-art systems.
What problem does this paper attempt to address?