A New Validity Index Based on Intra-Cluster Variation and Inter-Cluster Overlap

BEN Sheng-lan,SU Guang-da
DOI: https://doi.org/10.16136/j.joel.2010.02.024
2010-01-01
Abstract:The determination of cluster number is still an open problem for fuzzy C-means clustering In this paper,a new validity index is proposed to evaluate partition and determine the optimal number of clusters for fuzzy clustering. In a good partition,the similarities of patterns in a cluster should be maxi-mized and the clusters should be well separated Intra-cluster variation and inter-cluster overlap are de-fined to measure the similarities within a cluster and the separation between clusters respectively. The validity index is defined based on the two measurements. Experimental results on four artifiaal datasets and two real datasets show the effeaiveness and robustness of the proposed validity index.
What problem does this paper attempt to address?