A DISTRIBUTED CLUSTERING ALGORITHM BASED ON MICRO-CLUSTERING

He Qingsong,Wu Chengrong,Zeng Jianping
DOI: https://doi.org/10.3969/j.issn.1000-386X.2011.01.081
2011-01-01
Abstract:With the development of informatisation,the information data is distributed to different departments and every department has the need to fully cooperate with each other in condition of its own information not being leaked;on the other hand,the concentrated calculation cannot satisfy the requirement of different application due to huge amount of the information data.The distributed data mining becomes one of the research hot-points in above background.In this paper,by dividing the system into core-nodes and periphery-nodes,we conduct the hierarchical management and reduce system's burden brought by the communication of information.The definition of micro-clustering will be presented in the paper and the algorithm is described in the periphery-nodes.Experiment illuminate that our distributed algorithm has similar accuracy rate as that of the concentrated K-means algorithm in condition of assuring no leakage of every department's data.This has demonstrated the feasibility and validity of the algorithm.
What problem does this paper attempt to address?