Adaptive Structural Enhanced Representation Learning for Deep Document Clustering

Jingjing Xue,Ruizhang Huang,Ruina Bai,Yanping Chen,Yongbin Qin,Chuan Lin
DOI: https://doi.org/10.1007/s10489-024-05791-6
IF: 5.3
2024-01-01
Applied Intelligence
Abstract:Structural deep document clustering methods, which leverage both structural information and inherent data properties to learn document representations using deep neural networks for clustering, have recently garnered increased research interest. However, the structural information used in these methods is usually static and remains unchanged during the clustering process. This can negatively impact the clustering results if the initial structural information is inaccurate or noisy. In this paper, we present an adaptive structural enhanced representation learning network for document clustering. This network can adjust the structural information with the help of clustering partitions and consists of two components: an adaptive structure learner, which automatically evaluates and adjusts structural information at both the document and term levels to facilitate the learning of more effective structural information, and a structural enhanced representation learning network. The latter incorporates integrates this adjusted structural information to enhance text document representations while reducing noise, thereby improving the clustering results. The iterative process between clustering results and the adaptive structural enhanced representation learning network promotes mutual optimization, progressively enhancing model performance. Extensive experiments on various text document datasets demonstrate that the proposed method outperforms several state-of-the-art methods. The overall framework of adaptive structural enhanced representation learning network
What problem does this paper attempt to address?