Method of Building BFS-CTC: a Chinese Tagged Corpus of Sentential Semantic Structure
LUO Sen-lin,LIU Ying-ying,FENG Yang,HAN Lei,CHEN Gong,WANG Qian
DOI: https://doi.org/10.3969/j.issn.1001-0645.2012.03.019
2012-01-01
Abstract:Based on the modern Chinese semantics,a Chinese sentential semantic mode is built,and then a Chinese tagged corpus,BFS-CTC(Beijing forest studio-Chinese tagged corpus),is built according to the Chinese sentential semantic mode.There are more than ten thousand sentences in the corpus,and the corpus contains six kinds of Chinese syntactic types.Tagging the sentence quickly and conveniently could be implemented by using the self-developed tools.BFS-CTC provides lexical,syntactic and sentential semantic structure tagging information,so that it could be used in comparative analysis of syntactic and semantic,or used for horizontal analysis.In addition,the corpus has good scalability,and it could generate more targeted extension tagged banks.