Improvement of Text Similarity Computing Algorithm Based on Attribute

Zheng-wu YUAN,Yu-sen LI,Xue-ying ZHANG
DOI: https://doi.org/10.3969/j.issn.1000-3428.2009.17.002
2009-01-01
Abstract:Documents similarity computing with attribute barycenter coordinate model is a relatively new method,but the semantic information easily loss and is inefficient. For resolving these problems,an improved algorithm based on the attribute barycenter coordinate is presented. The method is inspired from the satisfying degree function in decision-making assessment theory. Matching the points between the intersection of query line and document complex and document barycenter using the new algorithm can keep the character of document vector within the result and improve the precision as well as efficiency. Experimental results show that the recall,precision and value of F1 of the model can increase 2%~4%.
What problem does this paper attempt to address?