On improvability of hash clustering data from different sources by bipartite graph

Jianxi Zhao,Xiaonan Wang,Qingrong Zou,Fangyuan Kang,Jingfu Peng,Fan Wang
DOI: https://doi.org/10.1007/s10044-022-01125-9
IF: 2.307
2022-12-07
Pattern Analysis and Applications
Abstract:Clustering is a long-standing challenging task in pattern recognition and computer vision. In recent years, with development of multimedia technologies and explosive growth of data, the access to data is various. Clustering data from different feature sources has attracted considerable attentions for the remarkable clustering performance due to exploiting complementary information from data of different features. However, in fact, existing work often suffers from heavy computational load that restricts their capacity for large-scale datasets and most existing methods about fusing multi-view data are not perfect enough, namely, the quality of the joint matrix learned from multiple sources is not high enough. In this paper, we propose an efficient and effective clustering approach which combines sparse subspace learning with a bipartite graph for binary/hash codes produced from data of different sources by a collaborative discrete representation learning model that is an efficient and effective data fusion and binary code learning method. The bipartite graph that owns low time complexity is constructed to extract local geometric structure information for improving clustering performance. Extensive experiments performed on four benchmark datasets validate the efficiency and effectiveness of the proposed approach in comparison with ten state-of-the-art methods.
computer science, artificial intelligence
What problem does this paper attempt to address?