Joint Training and Decoding Using Virtual Nodes for Cascaded Segmentation and Tagging Tasks.

Xian Qian,Qi Zhang,Yaqian Zhou,Xuanjing Huang,Lide Wu
2010-01-01
Abstract:Many sequence labeling tasks in NLP require solving a cascade of segmentation and tagging subtasks, such as Chinese POS tagging, named entity recognition, and so on. Traditional pipeline approaches usually suffer from error propagation. Joint training/decoding in the cross-product state space could cause too many parameters and high inference complexity. In this paper, we present a novel method which integrates graph structures of two sub-tasks into one using virtual nodes, and performs joint training and decoding in the factorized state space. Experimental evaluations on CoNLL 2000 shallow parsing data set and Fourth SIGHAN Bakeoff CTB POS tagging data set demonstrate the superiority of our method over cross-product, pipeline and candidate reranking approaches.
What problem does this paper attempt to address?