Cross-language vocabulary synonym prediction method and device and electronic equipment

Sun Maosong,Qi Fanchao,Lin Yankai,Zhu Hao,Xie Ruobing,Liu Zhiyuan
2019-01-01
Abstract:The embodiment of the invention provides a cross-language vocabulary prosthetic prediction method and device and electronic equipment. The method comprises the steps: determining loss functions of learning of a source language word vector and learning of a target language word vector respectively; Determining a loss function of word vector alignment and synonym information fusion respectively; Selecting a certain number of source language words and target language word pairs with the same semantics based on the monolingual corpora of the source language and the target language; Optimizing eachloss function based on the source language word and target language word pair and an established prosthetic knowledge base in the source language to obtain a bilingual word vector belonging to the same semantic space; And based on the bilingual word vector, searching the labeled synonyms of the source language words close to the target vocabulary word vector in the target language, and carrying out synonym prediction on the target vocabulary. According to the embodiment of the invention, the existing prosthetic knowledge base can be reasonably utilized to carry out prosthetic prediction on the cross-language vocabularies, so that the labor cost and the time cost of prosthetic prediction are effectively saved.
What problem does this paper attempt to address?