Analysis of Euclidean Distance and Manhattan Distance in the K-Means Algorithm for Variations Number of Centroid K

R Suwanda,Z Syahputra,E M Zamzami
DOI: https://doi.org/10.1088/1742-6596/1566/1/012058
2020-06-01
Journal of Physics: Conference Series
Abstract:Abstract K-Means is a clustering algorithm based on a partition where the data only entered into one K cluster, the algorithm determines the number group in the beginning and defines the K centroid. The initial determination of the cluster center is very influential on the results of the clustering process in determining the quality of grouping. Better clustering results are often obtained after several attempts. The manhattan distance matrix method has better performance than the euclidean distance method. The author making the result of conducted testing with variations in the number of centroids (K) with a value of 2,3,4,5,6,7,8,9 and the authors having conclusions where the number of centroids 3 and 4 have a better iteration of values than the number of centroids that increasingly high and low based on the iris dataset.
What problem does this paper attempt to address?