RESEARCH OF LARGE SAMPLE DATA CLUSTERING METHOD BASED ON IMPROVED ISODATA ALGORITHM

ZHANG Li-na,JIANG Xin-hua,NA Ri-su
DOI: https://doi.org/10.16853/j.cnki.1009-3575.2013.01.027
2013-01-01
Abstract:How to extract effective feature data form the large sample,complex structures and dispersion data is the key and difficult of the pattern recognition,the ISODATA algorithm is one of the common algorithm of large samples data clustering.While,the inadequacies of the algorithm is need to pre-determine initial cluster parameters.The paper proposed to measure the effectiveness of clustering based on the golden section method,the method can dynamically calculate the clustering metrics,and achieve effective clustering of large sample data.The results show that the method can select the most representative and best characteristic features from the original large sample data.
What problem does this paper attempt to address?