A Korean Sentence Similarity Calculation Method Based on Sub-Word Level Information

Yinqin Wang,Xiaodong Yan,Xiaoqing Xie,Run A
DOI: https://doi.org/10.1109/cac53003.2021.9728370
2021-01-01
Abstract:Sentence similarity is the key technology in text summary and machine translation. In this paper, a Korean sentence similarity calculation model based on local inference and difference analysis(Kor-Sim) is presented, Kor-Sim resolves the problem of insufficient capture of potential semantic information by traditional similarity calculation method.First, the Korean word vector can be obtained by using GlowVe, secondly the embedding vector can be effectively represented by BiLSTM, thirdly the semantic expression of the sentence is strengthened by local inference.Finally, by using difference analysis, the minute semantic differences between sentences can be captured by the interaction of sentences.The experimental results show that the Kor-Sim model can improve the accuracy of the similarity calculation in Korean sentences.
What problem does this paper attempt to address?