Is Japanese CCGBank empirically correct? A case study of passive and causative constructions

Daisuke Bekki,Hitomi Yanaka
DOI: https://doi.org/10.48550/arXiv.2302.14708
2023-03-01
Abstract:The Japanese CCGBank serves as training and evaluation data for developing Japanese CCG parsers. However, since it is automatically generated from the Kyoto Corpus, a dependency treebank, its linguistic validity still needs to be sufficiently verified. In this paper, we focus on the analysis of passive/causative constructions in the Japanese CCGBank and show that, together with the compositional semantics of ccg2lambda, a semantic parsing system, it yields empirically wrong predictions for the nested construction of passives and causatives.
Computation and Language
What problem does this paper attempt to address?