Building Chinese Sense Annotated Corpus with the Help of Software Tools

Yunfang Wu,Peng Jin,Tao Guo,Shiwen Yu
DOI: https://doi.org/10.3115/1642059.1642080
2007-01-01
Abstract:This paper presents the building procedure of a Chinese sense annotated corpus. A set of software tools is designed to help human annotator to accelerate the annotation speed and keep the consistency. The software tools include 1) a tagger for word segmentation and POS tagging, 2) an annotating interface responsible for the sense describing in the lexicon and sense annotating in the corpus, 3) a checker for consistency keeping, 4) a transformer responsible for the transforming from text file to XML format, and 5) a counter for sense frequency distribution calculating.
What problem does this paper attempt to address?