A Study on the Recovery of Omitted Constituents in Chinese Elliptical Sentences

Han Yan,Yiran Zhao,Peipei Sun,Yanqiu Shao
DOI: https://doi.org/10.1007/978-3-031-28953-8_20
2023-01-01
Abstract:Rule-based processing of an elliptical sentence corpus and the construction of a dataset is the basis for various elliptical recovery tasks. We clarify the scope of Chinese elliptical sentences from a linguistic point of view, manually annotate the existing elliptical corpus, and construct a dataset for elliptical recovery. Moreover, we address the problems of different criteria for judging ellipsis and different degrees of fineness in recovering elliptical components in the manual annotation process and study the criteria for recovering elliptical components of Chinese elliptical sentences based on the needs of elliptical recovery tasks and the operability of computers. The linguistic explanation provides a reference for the dataset’s expansion in the subsequent tasks of omission position detection and omission-referent disambiguation.
What problem does this paper attempt to address?