Parallel CSA-FCM Clustering Algorithm Based on MapReduce

Chunchun Cui,Runtong Zhang
DOI: https://doi.org/10.2991/saeme-17.2017.115
2017-01-01
Abstract:Fuzzy C-Means (FCM) algorithm is a kind of widely used clustering algorithm, which is widely used in pattern recognition, image processing, medical research and other fields. But FCM doesn't have better performance suppressing noise. A parallel clustering algorithm based on MapReduce is proposed in this paper, which combines Clonal Selection Algorithm and the algorithm uses intelligent optimization method to optimize the initial clustering center, and makes use of the global search ability of CSA to make the algorithm more robust. The algorithm process is designed to conform to the MapReduce programming model and it has the ability of dealing with large-scale dataset. The experiments prove that parallel Clonal Selection Algorithm-Fuzzy C-Means (CSA-FCM) can improves the searching performance and the noise immunity and has high speed up and scalability.
What problem does this paper attempt to address?