CHINESE SHORT SENTENCE SIMILARITY CALCULATION BASED ON TREE-STRUCTURE CORPUS
Fei Hongxiao,Mo Tianchi,Lin Qing,Yang Yanqun,Tan Yeqing,Yan Xingjun
DOI: https://doi.org/10.3969/j.issn.1000-386x.2013.08.005
2013-01-01
Abstract:In many fields,such as document summarisation,personalised searching,detection of academic integrity,FQA and automatic translation,the short sentence similarity calculation is the core algorithm.Through introducing the tree-structure corpus,we accurately define the similarity of words and calculate it,and make further improvement on the Chinese short sentence similarity algorithm based on keywords sequence extraction.Results of experiment show that this method achieves expected effect in improving the accuracy of Chinese short sentence similarity calculation,and is more in line with people’s intuitive sense.