Affinity Propagation Clustering Algorithm Based on Large-Scale Data-Set

Limin Wang,Kaiyue Zheng,Xing Tao,Xuming Han
DOI: https://doi.org/10.1080/1206212x.2018.1425184
2018-01-01
International Journal of Computers and Applications
Abstract:Affinity Propagation (AP) algorithm is not effective in processing large-scale data-sets, so the paper purposed an affinity propagation clustering algorithm based on large scale data-set, called LD-AP. First, we use the idea of grid clustering to divide large data-sets into small datasets and running AP in them to ensure the center of clustering. Then, we introduced the structure similarity matrix to calculate the distance of the cluster center. At last, we used Density peak Clustering Algorithm (DP) algorithm to cluster the cluster again. The experimental results show that the improved algorithm is better than the original algorithm in the clustering effect and computation speed.
What problem does this paper attempt to address?