Hybrid Clustering Solutions Fusion Based on Gated Three-way Decision.

Kaixiang Yang,Yifan Shi,Zhiwen Yu,Zhijie Zhong,Jichao Bi,Mengzhi Wang
DOI: https://doi.org/10.1109/ijcnn54540.2023.10191633
2023-01-01
Abstract:Consensus clustering methods provide better performance by fusing multiple clustering solutions in terms of accuracy, robustness and stability. However, most current methods suffer from different challenges: i) the high-dimensional problem; ii) limitations of single clustering method; iii) the optimal number of clusters selecting for a certain validity measure; iv) redundant clustering candidate attributes. To overcome the above limitations, we propose a hybrid clustering solutions fusion method based on gated three-way decision (HCFG) for data analysis. By integrating multiple clustering solutions and executing information fusion, HCFG enjoys four properties: (1) multiple random subspace generation strategy is utilized to generate diverse low-dimensional subspaces effectively; (2) a fusion framework that considers characteristics of both the soft clustering and hard clustering methods is designed, in which potential boundary of feature attribute sets is explored; (3) the optimal number of clusters is set by utilizing multiple clustering validity indices; (4) clustering solutions is considered as attributes and a gated three-way decision method is proposed to adaptively conduct attribute reductions. Extensive comparative experiments on 24 real-world data sets demonstrates the effectiveness and superiority of HCFG. Moreover, nonparametric tests are conducted to compare HCFG with multiple consensus clustering methods.
What problem does this paper attempt to address?