Interpretable Multi-View Clustering

Mudi Jiang,Lianyu Hu,Zengyou He,Zhikui Chen
2024-05-04
Abstract:Multi-view clustering has become a significant area of research, with numerous methods proposed over the past decades to enhance clustering accuracy. However, in many real-world applications, it is crucial to demonstrate a clear decision-making process-specifically, explaining why samples are assigned to particular clusters. Consequently, there remains a notable gap in developing interpretable methods for clustering multi-view data. To fill this crucial gap, we make the first attempt towards this direction by introducing an interpretable multi-view clustering framework. Our method begins by extracting embedded features from each view and generates pseudo-labels to guide the initial construction of the decision tree. Subsequently, it iteratively optimizes the feature representation for each view along with refining the interpretable decision tree. Experimental results on real datasets demonstrate that our method not only provides a transparent clustering process for multi-view data but also delivers performance comparable to state-of-the-art multi-view clustering methods. To the best of our knowledge, this is the first effort to design an interpretable clustering framework specifically for multi-view data, opening a new avenue in this field.
Machine Learning
What problem does this paper attempt to address?
The paper primarily focuses on addressing the issue of interpretability in Multi-view Clustering (MVC). Specifically, although existing multi-view clustering methods have achieved significant results in improving clustering accuracy, they still fall short in explaining the clustering results. In many practical applications, it is crucial to clearly demonstrate the decision-making process—i.e., to explain why a particular sample is assigned to a specific cluster. To address this issue, the authors propose a novel interpretable multi-view clustering framework. This method first extracts embedded features from each view and generates pseudo-labels to guide the initial construction of decision trees. Subsequently, it iteratively optimizes the feature representations of each view and the interpretable decision trees through a joint optimization framework. This framework not only improves clustering accuracy but also enhances the model's interpretability, making the clustering results more transparent and trustworthy. Experimental results show that the proposed method not only matches the clustering quality of state-of-the-art multi-view clustering methods but also excels in interpretability, especially when handling multi-view data. Additionally, the method significantly outperforms existing interpretable clustering methods designed for single-view data. In summary, the main contributions of this paper are: 1. Proposing a new multi-view clustering algorithm that introduces interpretability into the multi-view clustering domain, opening up a new research direction. 2. Designing a joint optimization clustering framework that simultaneously improves embedded feature representations and decision trees, thereby enhancing clustering accuracy and model interpretability. 3. Demonstrating through experimental results that the method maintains performance comparable to existing state-of-the-art multi-view clustering methods while offering higher interpretability.