Investigating the content and form of referring expressions in Mandarin: introducing the Mtuna corpus

Kees van Deemter,Le Sun,R. Sybesma,Xiao Li,Bo Chen,Muyun Yang
DOI: https://doi.org/10.18653/v1/W17-3532
2017-09-01
Abstract:East Asian languages are thought to handle reference differently from languages such as English, particularly in terms of the marking of definiteness and number. We present the first Data-Text corpus for Referring Expressions in Mandarin, and we use this corpus to test some initial hypotheses inspired by the theoretical linguistics literature. Our findings suggest that function words deserve more attention in Referring Expressions Generation than they have so far received, and they have a bearing on the debate about whether different languages make different trade-offs between clarity and brevity.
What problem does this paper attempt to address?