Abstract:Data clustering is an effective method for data analysis and pattern recognition which has been applied in many fields such as image segmentation, machine learning and data mining. It is the process of splitting the multidimensional data into several groupings or clusters based on some similarity measures. A cluster is usually defined by a cluster center. Generally, the information of the features may differ from each other and the contributions to the clustering are different. The most meaningful features play an important role in explaining the differences among the samples, thus should be pay a more attention in the clustering process to get a exact grouping. In order to reflect the particular contributions of the features, this paper proposed a new features weighted affinity propagation clustering (AP) algorithm. In this method, all the features are evaluated by a feature analysis method. The training samples are those which are near to the centers in each group according to the affinity propagation cluster result. The similarity matrix of AP is updated on the weighted features and the new cluster result is obtained. The radius by which the training samples are determined is a very important parameter in our method. We study it by means of the sum of weighted distance between any two samples clustered in the same group corresponding to the cluster result. In order to demonstrate our method, three public data sets from UCI were used. The experiment results on the three dataset showed the superiority of the features weighted AP method.

Analyze Blog′s Simularity Passing the New Clustering Algorithm Called Increase K-Means

Improved Blog Clustering Through Automated Weighting of Text Blocks

A Comparison Study of Clustering Algorithms for Microblog Posts

Finding User Clusters in Sina Microblog

Blogger clustering by utilizing link information

Computer Image Content Retrieval Considering K-Means Clustering Algorithm

Finding Community Structure of Bayesian Networks by Improved K-Means Algorithm

An Improved Parallel K-means Algorithm Based on MapReduce

The Parallel Implementation and Application of an Improved K-means Algorithm

A New Similarity in Clustering Through Users' Interest and Social Relationship

An Improved K-means Algorithm for Document Clustering

An improved clustering algorithm for web document

A Kind of Improved Data Clustering Algorithm in Web Log Mining

A Particle Swarm Optimization K-Means Algorithm for Mongolian Elements Clustering

A Novel K '-Means Algorithm For Clustering Analysis

A New Feature Weighted Affinity Propagation Clustering Algorithm

Chinese Web Comments Clustering Analysis with a Two-phase Method.

An Improved K-means Algorithm Based on Mapreduce and Grid

Improvement Study and Application Based on K-Means Clustering Algorithm

An improved black hole algorithm designed for K-means clustering method

Novel Hybrid Hierarchical-K-means Clustering Method (H-K-means) for Microarray Analysis