Abstract:Semi-Markov conditional random fields (Semi-CRFs) have been successfully utilized in many segmentation problems, including Chinese word segmentation (CWS). The advantage of Semi-CRF lies in its inherent ability to exploit properties of segments instead of individual elements of sequences. Despite its theoretical advantage, Semi-CRF is still not the best choice for CWS because its computation complexity is quadratic to the sentence’s length. In this paper, we propose a simple yet effective framework to help Semi-CRF achieve comparable performance with CRF-based models under similar computation complexity. Specifically, we first adopt a bi-directional long short-term memory (BiLSTM) on character level to model the context information, and then use simple but effective fusion layer to represent the segment information. Besides, to model arbitrarily long segments within linear time complexity, we also propose a new model named Semi-CRF-Relay. The direct modeling of segments makes the combination with word features easy and the CWS performance can be enhanced merely by adding publicly available pre-trained word embeddings. Experiments on four popular CWS datasets show the effectiveness of our proposed methods. The source codes and pre-trained embeddings of this paper are available on https://github.com/fastnlp/fastNLP/.

LM Enhanced BiRNN-CRF for Joint Chinese Word Segmentation and POS Tagging.

Bi-directional LSTM Recurrent Neural Network for Chinese Word Segmentation

Bidirectional LSTM-CRF Attention-based Model for Chinese Word Segmentation

Dilated Convolutional Neural Network with Joint Training for Chinese Word Segmentation

Deep Learning for Chinese Word Segmentation and POS Tagging.

A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging.

Chinese Word Segmentation Via BiLSTM+Semi-CRF with Relay Node

Joint n-gram Chinese language modeling with an application to Chinese word segmentation

State-of-the-art Chinese Word Segmentation with Bi-LSTMs

Recurrent Neural Word Segmentation with Tag Inference

Chinese Word Segmentation Method on the Basis of Bidirectional Long-Short Term Memory Model

Chinese Lexical Analysis with Deep Bi-GRU-CRF Network

A Deep Attention Network for Chinese Word Segment.

A BiLSTM-CRF Based Approach to Word Segmentation in Chinese

A joint method for Chinese word segmentation and part-of-speech labeling based on deep neural network

A Feature-Enriched Neural Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging

Long Short-Term Memory Neural Networks for Chinese Word Segmentation.

LSTM-CRF Neural Network with Gated Self Attention for Chinese NER

A Deep Convolutional Neural Model for Character-Based Chinese Word Segmentation

A Unified Tagging Solution: Bidirectional LSTM Recurrent Neural Network with Word Embedding.