A Similarity Search Algorithm for Text Based on Inverted-index

YANG Jianwu,CHEN Xiaoou
DOI: https://doi.org/10.3969/j.issn.1000-3428.2005.05.001
2005-01-01
Abstract:For the dimensions sparseness of the text set, a new similarity search algorithm for text set is proposed, which is based on inverted-index. The algorithm can quickly gain a super-set for the targets by search on inverted-index. Experiments show that the algorithm is faster than the algorithm based on multi-dimension index for huge text set, while a little nicetylosing.
What problem does this paper attempt to address?