SSF: Sentence Similar Function Based on Word2vector Similar Elements

Xinpan Yuan,Songlin Wang,Lanjun Wan,Chengyuan Zhang
DOI: https://doi.org/10.3745/jips.02.0124
2019-01-01
Journal of Information Processing Systems
Abstract:In this paper, to improve the accuracy of long sentence similarity calculation, we proposed a sentence similarity calculation method based on a system similarity function. The algorithm uses word2vector as the system elements to calculate the sentence similarity. The higher accuracy of our algorithm is derived from two characteristics: one is the negative effect of penalty item, and the other is that sentence similar function (SSF) based on word2vector similar elements doesn't satisfy the exchange rule. In later studies, we found the time complexity of our algorithm depends on the process of calculating similar elements, so we build an index of potentially similar elements when training the word vector process. Finally, the experimental results show that our algorithm has higher accuracy than the word mover's distance (WMD), and has the least query time of three calculation methods of SSF.
What problem does this paper attempt to address?