Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport

Kelly Marchisio,Ali Saad-Eldin,Kevin Duh,Carey Priebe,Philipp Koehn
DOI: https://doi.org/10.48550/arXiv.2210.14378
2022-10-25
Computation and Language
Abstract:Bilingual lexicons form a critical component of various natural language processing applications, including unsupervised and semisupervised machine translation and crosslingual information retrieval. We improve bilingual lexicon induction performance across 40 language pairs with a graph-matching method based on optimal transport. The method is especially strong with low amounts of supervision.
What problem does this paper attempt to address?