A joint method for Chinese word segmentation and part-of-speech labeling based on deep neural network

Lichi Yuan
DOI: https://doi.org/10.1007/s00500-022-07093-w
IF: 3.732
2022-04-21
Soft Computing
Abstract:Aiming at the sequential tasks of Chinese word segmentation and part-of-speech labeling, this paper proposes a parallel model for word segmentation and part-of-speech labeling that combines BERT model, bidirectional long-short memory model, and conditional random field model, Markov family model (MFM) or Tree Probability (TLP). In part-of-speech labeling combined with MFM or TLP, the part-of-speech of the current word is not only related to the part-of-speech of the previous word, but also related to the current word itself. The use of the joint method helps to use part-of-speech information to achieve word segmentation, and organically combining the two is beneficial to eliminate ambiguity and improve the accuracy of part-of-speech labeling or word segmentation tasks. Experimental data shows that the joint model for part-of-speech labeling and Chinese word segmentation proposed in this paper can significantly enhances the precision of Chinese word segmentation and the accuracy of part-of-speech labeling.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?