Search Results Clustering Based on a Linear Weighting Method of Similarity

Dequan Zheng,Haibo Liu,Tiejun Zhao
DOI: https://doi.org/10.1109/IALP.2011.72
2011-01-01
Abstract:The cluster of search results can facilitate users in finding the needed from massive information. But the effect of the traditional text clustering has been verified not good enough. Lingo Algorithm, which adopts LSI for clustering, generates candidate labels first, then distributes the documents, and forms the clusters finally. On the basis of Lingo Algorithm, this paper presents a linear weighted method of Single-Pass improvement, which integrates HowNet semantic similarity and cosine similarity, fuses and rediscovers clusters, and extracting the cluster labels. The experiments have showed that our method it achieves a good results in clusters in the form of purity and F-measure.
What problem does this paper attempt to address?