Consistency-oriented clustering ensemble via data reconstruction

Hengshan Zhang,Yun Wang,Yanping Chen,Jiaze Sun
DOI: https://doi.org/10.1007/s10489-024-05654-0
IF: 5.3
2024-07-20
Applied Intelligence
Abstract:The study highlights that using different distance measures on the same dataset leads to varying clustering results, making the choice of distance measure a challenge when prior knowledge is lacking. To address this issue, a consistency-oriented clustering ensemble via data reconstruction is developed. This approach eliminates the need to select a specific distance measure and achieves higher consistency between the clustering ensemble and base clusterings while maintaining superior clustering performance. First, the base clustering is generated via the clustering with different distance measures and a consistency definition is introduced in the proposed method. Then the ensemble process updates the weights of base clusterings to ensure they reach the consistency. At the same time data reconstruction process is integrated into the ensemble process to guarantee a high convergence rate and efficient clustering. Finally, the clustering ensemble result is achieved with the higher consistency measure and improved clustering performance by balancing both factors. In the experiment, the effectiveness of the proposed method is verified and the specification of the parameters is advised through the various experimental outcomes.
computer science, artificial intelligence
What problem does this paper attempt to address?