An Improved K-Means Algorithm of High-Dimensional Data

Cheng,Hong Zhang
DOI: https://doi.org/10.4028/www.scientific.net/amr.926-930.2968
2014-01-01
Advanced Materials Research
Abstract:This paper summarizes the characteristics of high-dimensional data and the difficulties of high-dimensional data clustering, points out the shortcomings of traditional clustering algorithm in performing clustering high-dimensional data, and proposes an improved K-means algorithm to complete the high-dimensional data clustering, the algorithm has better scalability and high efficiency, suitable for handling large document sets.
What problem does this paper attempt to address?