Cross Lingual Opinion Analysis Via Transfer Learning.

Jun Xu,Ruifeng Xu,Yuxin Ding,Xiaolong Wang,Chunyu Kit
2010-01-01
Abstract:This paper presents the first attempt to apply instance-level transfer learning technique to cross lingual opinion analysis by using the translation of annotated corpus from other languages as the supplementary training data for the opinion classifier for target language. Firstly, Transfer AdaBoost algorithm (TrAdaBoost) is applied as the base transfer learning strategy, which makes use of few labeled examples in target language to leverage the large annotated corpus in other languages and selects the translated annotated data with good confidence as supplementary training samples to improve the opinion classifier for target language. Considering that the re-weighting scheme adopted in TrAdaBoost has the potential risk of over-discarding of source training examples, this algorithm is further improved by combining the bagging procedure and the boosting procedure of TrAdaBoost, named Transfer Boosting with Bagging (TrBB). The proposed two algorithms are evaluated on document-level and sentence-level Chinese opinion analysis, respectively. The achieved encouraging performances show that the proposed transfer learning based approaches improve the opinion analysis effectively by exploiting small training data in target language and large cross lingual training data.
What problem does this paper attempt to address?