A Short Text Description Similarity Computation Method for Chinese Entities

Tianyi QIN,Chan LIN,Boyu SONG,Yi GUAN
DOI: https://doi.org/10.3969/j.issn.2095-2163.2015.02.010
2015-01-01
Abstract:Short text description for Chinese entities has features of statistical sparsity,semantic discretization and irregular vocabulary. This research analyses the relationship between sememe network and word similarity in Hownet and presents a short text description similarity computation method that consists of semantic similarity part and short text classification part. In the semantic similarity part,the method weakens the influence of Hownet’s shallow sememes and balances weights of sememes. In the short text classification part,the method transforms short texts into sememe vectors and classifies them according to the distribution of sememes in certain fields. Take average results of those two parts to generate short text de-scription similarity. Effectiveness of the method is proved by task 1 of Baidu knowledge map analyzing competition.
What problem does this paper attempt to address?