Joint Chinese Word Segmentation and Named Entity Recognition Based on Max-Margin Markov Networks

QIAO Wei,SUN Maosong
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2010.05.028
2010-01-01
Abstract:Chinese word segmentation(CWS) and named entity recognition(NER) are often separately processed.Max-margin Markov networks(M3N) were used to construct a joint CWS and NER scheme in which joint training and testing are performed.Experiments on the SIGHAN_2005 dataset show that M3N-based word segmenters outperforms CRFs-based word segmenters by 0.3%-2.0%.Experiments on the SIGHAN_2005 and SIGHAN_2006 datasets show that the joint CWS and NER scheme benefits both the two tasks with a CWS improvement of 1.5%-5.5% and an NER improvement of 5.7%-7.9%.The influence of the feature template and decoding on the scheme is also discussed in this paper.
What problem does this paper attempt to address?