Abstract:This paper investigates the problem of cross-lingual transfer parsing, aiming at inducing dependency parsers for low-resource languages while using only training data from a resource-rich language (e.g., English). Existing model transfer approaches typically don't include lexical features, which are not transferable across languages. In this paper, we bridge the lexical feature gap by using distributed feature representations and their composition. We provide two algorithms for inducing cross-lingual distributed representations of words, which map vocabularies from two different languages into a common vector space. Consequently, both lexical features and non-lexical features can be used in our model for cross-lingual transfer. Furthermore, our framework is flexible enough to incorporate additional useful features such as cross- lingual word clusters. Our combined contributions achieve an average relative error reduction of 10.9% in labeled attachment score as compared with the delexicalized parser, trained on English universal treebank and transferred to three other languages. It also significantly outperforms state-of-the-art delexicalized models augmented with projected cluster features on identical data. Finally, we demonstrate that our models can be further boosted with minimal supervision (e.g., 100 annotated sentences) from target languages, which is of great significance for practical usage.

Multilingual Universal Dependency Parsing from Raw Text with Low-Resource Language Enhancement.

Universal Dependencies Parsing For Colloquial Singaporean English

The HIT-SCIR System for End-to-End Parsing of Universal Dependencies

A Simple Yet Effective Joint Training Method for Cross-Lingual Universal Dependency Parsing.

Cross-lingual Dependency Parsing Based on Distributed Representations.

Enhanced Universal Dependency Parsing with Second-Order Inference and Mixture of Training Data

A Distributed Representation-Based Framework for Cross-Lingual Transfer Parsing.

Enhanced Universal Dependency Parsing with Automated Concatenation of Embeddings

Universal Discourse Representation Structure Parsing

NJU-Parser: Achievements on Semantic Dependency Parsing.

Enhancing Discourse Dependency Parsing with Sentence Dependency Parsing: A Unified Generative Method Based on Code Representation

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages

Chinese Dependency Parsing Based on Treebank

Cross-Lingual Dependency Parsing by POS-Guided Word Reordering.

High-order Joint Constituency and Dependency Parsing

Exploiting Multi-typed Treebanks for Parsing with Deep Multi-task Learning.

Overview of the NLPCC 2019 Shared Task: Cross-Domain Dependency Parsing

A Representation Learning Framework For Multi-Source Transfer Parsing

Low-Resource Syntactic Transfer with Unsupervised Source Reordering

Urdu Dependency Parsing and Treebank Development: A Syntactic and Morphological Perspective

Zero-Shot Cross-Lingual Machine Reading Comprehension via Inter-sentence Dependency Graph