Improved Deep Embedding Clustering with Ensemble Learning

黄栋,王昌栋,赖剑煌,黄宇翔
DOI: https://doi.org/10.3778/j.issn.1673-9418.2007010
2021-01-01
Abstract:Recently the rapid development of the deep learning technique has provided a powerful tool for the clustering research, and has given rise to quite a number of deep neural network-based clustering methods. Among these methods, deep embedding clustering (DEC) has been drawing increasing attention, due to its advantage in performing deep representation learning and optimizing clustering assignment simultaneously. However, one limita-tion to DEC lies in its sensitivity to the hyper-parameter λ, which often requires manual fine-tuning. To address this problem, this paper presents an improved deep embedding clustering method with ensemble learning (IDECEL). Instead of searching for a single optimal hyper-parameter, this paper makes use of a set of diversified hyper-parameters λ to construct an ensemble of diversified base clusterings. By exploiting the concept of entropy, this paper evaluates the uncertainty of the clusters in these base clusterings and weights them accordingly. Further, this paper constructs a locally weighted bipartite graph between base clusters and data samples, and efficiently partitions it to obtain a better clustering result. Experimental results on multiple datasets show that the proposed IDECEL method not only alleviates the hyper-parameter sensitivity problem in DEC, but also exhibits more robust clustering performance than several other deep clustering and ensemble clustering methods.
What problem does this paper attempt to address?