Emergence of Syntax Needs Minimal Supervision

Raphaël Bailly,Kata Gábor
DOI: https://doi.org/10.48550/arXiv.2005.01119
2020-05-03
Abstract:This paper is a theoretical contribution to the debate on the learnability of syntax from a corpus without explicit syntax-specific guidance. Our approach originates in the observable structure of a corpus, which we use to define and isolate grammaticality (syntactic information) and meaning/pragmatics information. We describe the formal characteristics of an autonomous syntax and show that it becomes possible to search for syntax-based lexical categories with a simple optimization process, without any prior hypothesis on the form of the model.
Computation and Language
What problem does this paper attempt to address?