Hashing-based Affinity Matrix for Dominant Set Clustering

Qihua Li,Xing Tian,Wing W. Y. Ng,Marcello Pelillo
DOI: https://doi.org/10.1016/j.neucom.2022.06.067
IF: 6
2022-01-01
Neurocomputing
Abstract:Dominant set clustering has been widely used to solve a variety of problems such as image segmentation, video analysis, and image retrieval. However, the key problem of it is the need of a given affinity matrix to provide similarity for every pair of data points. The affinity matrix is either user-given or computed using a pairwise computation, which is impractical for big data situations. Therefore, this work proposes a semi-supervised hashing-based affinity matrix computation method (HAM) for dominant set clustering (HAM-DSC) for cases of semi-supervised clustering where labels are available to a portion of data. The HAM computes affinity matrix efficiently based on Hamming distance between learned hash codes, which preserve semantic similarities effectively. To deal with unsupervised clustering problems, an unsupervised extension of the HAM-DSC is also given in this work. Experimental results on 7 real-world datasets show that the HAM-DSC yields an overall better performance together with lower time costs compared to dominant set clustering using other affinity matrix computation methods. (c) 2022 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?