Abstract:Multi-view document clustering, which learns common representations from multiple views to achieve consistent partition, has emerged lots of increasing work. Though promising performance has been demonstrated in various applications, their view representations are learned with no consideration of achieving a consistent clustering partition. In this paper, we propose a Multi-view document Clustering model with Joint Contrastive learning (MCJC) to address the aforementioned issue. Our model learns the view representations with a joint contrastive learning module by introducing a task-specific objective so that it can effectively achieve consistency both in cluster-wise and featurewise hidden spaces. Meanwhile, in the clustering module, we collect the view-level cluster agreement and document-level clustering partition to refine the contrastive learning and obtain document assignments. As a result, the proposed model can use a joint contrastive module to learn clustering-friendly representation and through multi-level clustering to achieve better clustering performance. Extensive experiments on real datasets demonstrate that our model achieves state-of-the-art clustering effectiveness.

Multi-view Document Clustering with Joint Contrastive Learning