Cross-Lingual Sentiment Classification with Bilingual Document Representation Learning

Xinjie Zhou,Xianjun Wan,Jianguo Xiao
DOI: https://doi.org/10.18653/v1/p16-1133
2016-01-01
Abstract:Cross-lingual sentiment classification aims to adapt the sentiment resource in a resource-rich language to a resource-poor language. In this study, we propose a representation learning approach which simultaneously learns vector representations for the texts in both the source and the target languages. Different from previous research which only gets bilingual word embedding, our Bilingual Document Representation Learning model BiDRL directly learns document representations. Both semantic and sentiment correlations are utilized to map the bilingual texts into the same embedding space. The experiments are based on the multilingual multi-domain Amazon review dataset. We use English as the source language and use Japanese, German and French as the target languages. The experimental results show that BiDRL outperforms the state-of-the-art methods for all the target languages.
What problem does this paper attempt to address?