Word-Based Method for Chinese Part-of-Speech Via Parallel and Adversarial Network

Kaiyu Huang,Jingxiang Cao,Zhuang Liu,Degen Huang
DOI: https://doi.org/10.1049/cje.2020.00.411
IF: 1.019
2022-01-01
Chinese Journal of Electronics
Abstract:Chinese part-of-speech(POS)tagging is an essential task for Chinese downstream natural lan-guage processing tasks.The accuracy of the Chinese POS task will drop dramatically by word-based methods be-cause of the segmentation errors and the word sparsity.Also,there are several Chinese POS tagging sets with dif-ferent criteria.Some of them only have a small-scale an-notated corpus and are hard to train.To this end,we propose a modified word-based transformer neural net-work architecture.Meanwhile,we utilize an adversarial transfer learning method that splits the architecture into shared and private parts.This work directly improves the ability of the word-based model,instead of adopting a joint character-based method.Extensive experiments show that our method achieves state-of-the-art perform-ance on all datasets,and more importantly,our method improves performance effectively for the word-based Chinese sequence labeling task.
What problem does this paper attempt to address?