Parallel K-PSO Based on MapReduce

Junjun Wang,Dongfeng Yuan,Mingyan Jiang
DOI: https://doi.org/10.1109/icct.2012.6511380
2012-01-01
Abstract:K-means is widely used in scientific research and commercial applications because of its simplicity and linearity. However, in faced of ever-growing amount of data and higher demand of cluster analysis today, how to improve the performance of K-means has become challenging and significant. So an improved method called parallel K-PSO which combines Particle Swarm Optimization (PSO) with K-means based on MapReduce is proposed in this paper. Firstly, it takes advantage of PSO to improve the global search ability of K-means, and then it makes K-means parallel with MapReduce to enhance its capability of processing massive data. We evaluate the proposed method through experimental results.
What problem does this paper attempt to address?