Multi-view subspace text clustering

Maha Fraj,Mohamed Aymen Ben HajKacem,Nadia Essoussi
DOI: https://doi.org/10.1007/s10844-024-00897-2
2024-10-06
Journal of Intelligent Information Systems
Abstract:Text clustering has become an important challenge in artificial intelligence since several applications require to automatically organize documents into homogeneous topics. Given the availability of several text representation models, text documents can be organized through a multi-view text clustering approach. In this context, we propose a new subspace multi-view text clustering method (MVSTC). The proposed method offers a rich representation of text by integrating several models to detect different aspects of text such as syntactic, topic, and semantic features. MVSTC is capable of discovering latent correlations between documents by projecting the data onto a topological map. MVSTC seeks a subspace representation based on a low-rank and sparse representation to capture the global and local structure of multi-view textual data. Extensive experiments on real text data sets demonstrate that our method outperforms the existing multi-view clustering methods in terms of several evaluation metrics.
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?