Building Chinese Zero Corpus Form Discourse Perspective

Chen SHENG,Fang KONG,Guodong ZHOU
DOI: https://doi.org/10.13209/j.0479-8023.2018.057
2019-01-01
Abstract:To better deal with Chinese zero elements, this paper makes a theoretical analysis from discourse perspective and completes the construction of the Chinese Discourse Zero Corpus (CDZC). First, the necessity of corpus construction has been explored based on the research of existing theoretical and data sources. Then, the top-down and forword search annotation strategy and the combination of the human machine are used to complete corpus annotation. Finally, the detailed statistics analysis shows that CDZC can fully reflect the characters of Chinese linguistic and provide corpus resources for related research.
What problem does this paper attempt to address?