Construction of Chinese Abstract Meaning Representation Corpus with Concept-to-word Alignment

Bin LI,Yuan WEN,Li SONG,Lijun BU,Weiguang QU,Nianwen XUE
DOI: https://doi.org/10.3969/j.issn.1003-0077.2017.06.013
2017-01-01
Abstract:As a new sentence-level meaning representation,abstract meaning representation(AMR)uses a rooted a-cyclic directed graph to represent the meaning of a sentence.A large AMR bank has been constructed for English, but the concepts of an AMR graph are not aligned to the words in a sentence,which increases the difficulty in manu-al annotation as well as automatic parsing.This paper describes the construction of a Chinese AMR corpus,based on guidelines adapted from English for Chinese-specific properties.We also designs an efficient annotation frame-work that incorporates concept-to-word alignment,taking advantage of the morphology-poor nature of Chinese.We have annotated the AMRs of 6 923 sentences selected from the Chinese TreeBank,among which 48% of the sen-tences are graphs,1% of the sentences are cycles,and 32% have non-projective subtrees.We plan to publicly re-lease this data for linguistic and NLP research.
What problem does this paper attempt to address?