Analysis of Parts-of-speech Correspondence Between DCC and GKB

Likun QIU,Hui ZHAO,Shiwen YU,Xuefeng ZHU
DOI: https://doi.org/10.3969/j.issn.1003-0077.2017.05.001
2017-01-01
Abstract:Part-of-speech annotation has attracted extensive attention from the areas including Chinese information processing,Chinese grammar study and Chinese lexicographer.Multiple part-of-speech systems have been proposed and there are significant differences between these systems.So far,little research has been done to systematically compare different large-scale part-of-speech annotations.Based on the part-of-speech annotation results in Dictionary of Contemporary Chinese and Grammatical Knowledge-Base Dictionary,this paper proposes a mapping algorithm, which can detect part-of-speech differences in two dictionaries automatically.Further,we analyze the differences and conclude in two perspectives.1) about 83.5% of the part-of-speech annotation results is identical.and 2) all the differences can be attributed to three effects :part-of-speech shifting,different part-of-speech annotation standards and different senses.
What problem does this paper attempt to address?