Similarity algorithm of text based on semantic understanding

Bo JIN,Yan-jun SHI,Hong-fei TENG
DOI: https://doi.org/10.3321/j.issn:1000-8608.2005.02.028
2005-01-01
Abstract:Text similarity counting has been widely used in several fields, for example, the field of copy detection and the field of information retrieval, etc.. With the study of text similarity computing and semantic understanding, the textural similarity counting can be expanded to paragraph similarity counting, and then the paragraph similarity counting can be expanded to article similarity counting. A new set of textural (including words, sentences and paragraphs) similarity algorithm is given. This algorithm can count out the similarity rate of two texts. Compared with other methods of similarity computing, the algorithm can raise the recall rate.
What problem does this paper attempt to address?