Learning Bilingual Sentiment-Specific Word Embeddings without Cross-Lingual Supervision

Yanlin Feng,Xiaojun Wan
DOI: https://doi.org/10.18653/v1/n19-1040
2019-01-01
Abstract:Word embeddings learned in two languages can be mapped to a common space to produce BilingualWord Embeddings (BWE). Unsupervised BWE methods learn such a mapping without any parallel data. However, these methods are mainly evaluated on tasks of word translation or word similarity. We show that these methods fail to capture the sentiment information and do not perform well enough on cross-lingual sentiment analysis. In this work, we propose UBiSE (Unsupervised Bilingual Sentiment Embeddings), which learns sentiment-specific word representations for two languages in a common space without any cross-lingual supervision. Our method only requires a sentiment corpus in the source language and pretrained monolingual embeddings of both languages. We evaluate our method on three language pairs for cross-lingual sentiment analysis. Experimental results show that our method outperforms previous unsupervised BWE methods and even supervised BWE methods. Our method succeeds for a distant language pair English-Basque.
What problem does this paper attempt to address?