Tag Clustering Algorithm Using Object-based Feature Vector

ZHOU Jin,CHEN Chao,YU Neng-hai
DOI: https://doi.org/10.3969/j.issn.1000-1220.2012.03.016
2012-01-01
Abstract:In the social tagging systems,it often uses data mining techniques,such as clustering,to remedy the problems of tag redundancy and ambiguity.The current tag clustering algorithms are mainly based on the tag co-occurrence in different items,but these algorithms′ clustering precision and recall are relatively low,which can only calculate the similarity between two tags.This paper proposes a new tag clustering algorithm,which introduces an object-based feature vector to characterize a single tag.This feature vector can represent a tag exactly and can get a more accurate similarity between two tags by using cosine similarity formula.K-Means algorithm is used to cluster the users′ tags.The experiment shows that the algorithm proposed in this paper can get a more accurate clustering result.
What problem does this paper attempt to address?