Research on Chinese Negation and Speculation: Corpus Annotation and Identification

Bowei Zou,Guodong Zhou,Qiaoming Zhu
DOI: https://doi.org/10.1007/s11704-015-5101-2
IF: 2.6688
2016-01-01
Frontiers of Computer Science
Abstract:Identifying negative or speculative narrative fragments from facts is crucial for deep understanding on natural language processing (NLP). In this paper, we firstly construct a Chinese corpus which consists of three sub-corpora from different resources. We also present a general framework for Chinese negation and speculation identification. In our method, first, we propose a feature-based sequence labeling model to detect the negative or speculative cues. In addition, a cross-lingual cue expansion strategy is proposed to increase the coverage in cue detection. On this basis, this paper presents a new syntactic structure-based framework to identify the linguistic scope of a negative or speculative cue, instead of the traditional chunking-based framework. Experimental results justify the usefulness of our Chinese corpus and the appropriateness of our syntactic structure-based framework which has showed significant improvement over the state-of-the-art on Chinese negation and speculation identification.
What problem does this paper attempt to address?