NeurCAM: Interpretable Neural Clustering via Additive Models

Nakul Upadhya,Eldan Cohen
2024-08-24
Abstract:Interpretable clustering algorithms aim to group similar data points while explaining the obtained groups to support knowledge discovery and pattern recognition tasks. While most approaches to interpretable clustering construct clusters using decision trees, the interpretability of trees often deteriorates on complex problems where large trees are required. In this work, we introduce the Neural Clustering Additive Model (NeurCAM), a novel approach to the interpretable clustering problem that leverages neural generalized additive models to provide fuzzy cluster membership with additive explanations of the obtained clusters. To promote sparsity in our model's explanations, we introduce selection gates that explicitly limit the number of features and pairwise interactions leveraged. Additionally, we demonstrate the capacity of our model to perform text clustering that considers the contextual representation of the texts while providing explanations for the obtained clusters based on uni- or bi-word terms. Extensive experiments show that NeurCAM achieves performance comparable to black-box methods on tabular datasets while remaining interpretable. Additionally, our approach significantly outperforms other interpretable clustering approaches when clustering on text data.
Machine Learning
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? The paper "NEURCAM: Neural Clustering Additive Model for Explainable Clustering" aims to address the issue of explainable clustering algorithms. Specifically, the authors propose a new method called the Neural Clustering Additive Model (NeurCAM), which achieves explainable clustering through Neural Generalized Additive Models. The main objectives of the paper are as follows: 1. **Enhancing the interpretability of clustering**: - Traditional decision tree methods suffer from a significant drop in interpretability as the tree size increases when dealing with complex problems. NeurCAM provides fuzzy cluster membership through Neural Generalized Additive Models and maintains model interpretability through additive explanations. 2. **Introducing a selection gate mechanism**: - To promote the sparsity of model explanations, NeurCAM introduces a selection gate mechanism that explicitly limits the number of features and pairwise interactions used by the model. This helps reduce model complexity, making it easier to understand and validate. 3. **Interpretability in text clustering**: - NeurCAM also demonstrates its applicability in text clustering, considering contextual representations of text and providing cluster explanations based on single-word or bi-word terms. 4. **Balancing performance and interpretability**: - Experimental results show that NeurCAM performs comparably to black-box methods on tabular datasets while maintaining high interpretability. In text clustering tasks, NeurCAM significantly outperforms other explainable clustering methods. ### Main contributions of the paper 1. **Proposing a new explainable clustering method**: - Utilizing Neural Generalized Additive Models (Neural GAMs) to provide fuzzy cluster membership and generate explanations in the original feature space. 2. **Introducing a selection gate mechanism**: - Through the selection gate mechanism, users can explicitly limit the number of features and pairwise interactions used by the model, encouraging the sparsity of explanations. 3. **Demonstrating effectiveness on various datasets**: - Extensive experiments prove the effectiveness of NeurCAM in creating high-quality clusters and demonstrate the interpretability it provides. 4. **Innovative application in text clustering**: - Using transformer-based embeddings, NeurCAM can provide explanations based on single-word and bi-word terms in text clustering tasks, considering the structure and contextual information of documents. ### Summary NeurCAM addresses the issue of declining interpretability in traditional explainable clustering methods for complex problems by combining Neural Generalized Additive Models and a selection gate mechanism. It achieves a good balance between performance and interpretability, performing well across various datasets, with significant advantages in text clustering tasks.