Recognizing Cross-Lingual Textual Entailment with Co-Training Using Similarity and Difference Views

Jiang Zhao,Man Lan,Zheng-Yu Niu,Donghong Ji
DOI: https://doi.org/10.1109/ijcnn.2014.6889713
2014-01-01
Abstract:Cross-lingual textual entailment is a relatively new problem that detects the entailment relationship between two text fragments written in different languages. Previous work adopted machine learning algorithms and similarity measures as features to address this task. In order to overcome the high cost of human annotation and further improve the recognition performance, we present a novel co-training approach to solve this problem. We first use an off-the-shelf machine translation tool to eliminate the language gap between two texts. Then we measure the similarities and differences between two texts and regard them as sufficient and redundant views. We use those two views to conduct the co-training procedure to perform classification. Besides, a new effective Kullback-Leibler (KL) based criterion is proposed to select the results from all possible iterations. Experiments on cross-lingual datasets provided by SemEval 2013 show that our method significantly outperforms the baseline systems and previous work.
What problem does this paper attempt to address?