An analysis method of cross-lingual literature similarity

Jiao LIU,Rongyi CUI,Yahui ZHAO,Zhenguo ZHANG
DOI: https://doi.org/10.3969/j.issn.1004-4353.2016.02.012
2016-01-01
Abstract:We analyse different language literatures with sentence alignment and propose a cross-lingual litera-tures’similarity method based on multilingual topic correlation model.In this paper,the data model for the collected different language literatures is firstly gained by term-document matrix,which is obtained by the process of words segmentation,the adjustment and selection of words segmentation results,and the weight calculation of feature words.And then,multilingual topic correlation semantic space is built.The three differ-ent language literatures are represented in the semantic space where each topic is made up of the three langua-ges.Similarity calculation of different language literatures is completed by their correlation topic in the seman-tic space.Experiment results show that the similarity of different language literaturescan be calculated directly in the semantic space,the accuracy can be reached 90%,which verify the effectiveness of our method in calcu-lating the similarity of cross-lingual literatures.
What problem does this paper attempt to address?