Study on Architectures for Chinese POS Tagging and Parsing

Hailong Cao,Yujie Zhang,Hitoshi Isahara
2008-01-01
Research in Computing Science
Abstract:How to deal with part of speech (POS) tagging is a very important problem when we build a syntactic parsing system. We could preprocess the text with a POS tagger before perform parsing in a pipelined approach. Alterna- tively, we could perform POS tagging and parsing simultaneously in an inte- grated approach. Few, if any, comparisons have been made on such architecture issues for Chinese parsing. This paper presents an in-depth study on this prob- lem. According to comparison experiments, we find that integrated approach can make significantly better performance both on Chinese parsing and un- known words POS tagging than the pipelined approach. As for known words POS tagging, we find that the two approaches get similar tagging accuracy, but the tagging results of integrated approach do lead to much better parsing per- formance. We also analyze the reasons account for the performance difference.
What problem does this paper attempt to address?