Reliable Node Similarity Matrix Guided Contrastive Graph Clustering

Yunhui Liu,Xinyi Gao,Tieke He,Tao Zheng,Jianhua Zhao,Hongzhi Yin

2024-08-07

Abstract:Graph clustering, which involves the partitioning of nodes within a graph into disjoint clusters, holds significant importance for numerous subsequent applications. Recently, contrastive learning, known for utilizing supervisory information, has demonstrated encouraging results in deep graph clustering. This methodology facilitates the learning of favorable node representations for clustering by attracting positively correlated node pairs and distancing negatively correlated pairs within the representation space. Nevertheless, a significant limitation of existing methods is their inadequacy in thoroughly exploring node-wise similarity. For instance, some hypothesize that the node similarity matrix within the representation space is identical, ignoring the inherent semantic relationships among nodes. Given the fundamental role of instance similarity in clustering, our research investigates contrastive graph clustering from the perspective of the node similarity matrix. We argue that an ideal node similarity matrix within the representation space should accurately reflect the inherent semantic relationships among nodes, ensuring the preservation of semantic similarities in the learned representations. In response to this, we introduce a new framework, Reliable Node Similarity Matrix Guided Contrastive Graph Clustering (NS4GC), which estimates an approximately ideal node similarity matrix within the representation space to guide representation learning. Our method introduces node-neighbor alignment and semantic-aware sparsification, ensuring the node similarity matrix is both accurate and efficiently sparse. Comprehensive experiments conducted on $8$ real-world datasets affirm the efficacy of learning the node similarity matrix and the superior performance of NS4GC.

Machine Learning

What problem does this paper attempt to address?

The paper attempts to address the issue of how to effectively utilize similarity information between nodes in graph clustering. Specifically, existing contrastive learning methods have a significant limitation in graph clustering, which is their failure to fully explore the similarity between nodes. Many methods assume that the node similarity matrix is the same in the representation space, ignoring the inherent semantic relationships between nodes. This leads to the failure of node representations to well preserve semantic similarity during the learning process. To address this issue, the authors propose a new framework—Reliable Node Similarity Matrix Guided Contrastive Graph Clustering (NS4GC). This framework aims to estimate a near-ideal node similarity matrix and use it to guide the representation learning process. By introducing node-neighbor alignment and semantic-aware sparsification techniques, NS4GC ensures that the node similarity matrix is both accurate and efficiently sparse. Experimental results show that this method performs excellently on multiple real-world datasets, effectively learning the node similarity matrix and achieving superior performance in graph clustering tasks.

Reliable Node Similarity Matrix Guided Contrastive Graph Clustering

Cluster-guided Contrastive Graph Clustering Network

Simple Contrastive Graph Clustering

A Debiased Graph Clustering Approach Using Dual Contrastive Learning

Self-supervised Contrastive Attributed Graph Clustering

Graph Clustering with High-Order Contrastive Learning

GLAC-GCN: Global and Local Topology-Aware Contrastive Graph Clustering Network

Dual Contrastive Learning Network for Graph Clustering

Neighborhood Contrastive Representation Learning for Attributed Graph Clustering

Adversarial Cluster-Level and Global-Level Graph Contrastive Learning for node representation

Graph Representation Learning via Contrasting Cluster Assignments

Graph-Based Short Text Clustering via Contrastive Learning with Graph Embedding.

Multi-Graph Contrastive Learning Clustering Network

Enhancing Graph Contrastive Learning with Node Similarity

CC-GNN: A Clustering Contrastive Learning Network for Graph Semi-Supervised Learning

Multilayer Graph Contrastive Clustering Network

Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion

Deep Contrastive Graph Learning with Clustering-Oriented Guidance

NCAGC: A Neighborhood Contrast Framework for Attributed Graph Clustering

Self-Consistent Contrastive Attributed Graph Clustering With Pseudo-Label Prompt

Eliciting Structural and Semantic Global Knowledge in Unsupervised Graph Contrastive Learning