Sparse Clustering Algorithm Based on Multi-Domain Dimensionality Reduction Autoencoder

Yu Kang,Erwei Liu,Kaichi Zou,Xiuyun Wang,Huaqing Zhang

DOI: https://doi.org/10.3390/math12101526

IF: 2.4

2024-05-15

Mathematics

Abstract:The key to high-dimensional clustering lies in discovering the intrinsic structures and patterns in data to provide valuable information. However, high-dimensional clustering faces enormous challenges such as dimensionality disaster, increased data sparsity, and reduced reliability of the clustering results. In order to address these issues, we propose a sparse clustering algorithm based on a multi-domain dimensionality reduction model. This method achieves high-dimensional clustering by integrating the sparse reconstruction process and sparse L1 regularization into a deep autoencoder model. A sparse reconstruction module is designed based on the L1 sparse reconstruction of features under different domains to reconstruct the data. The proposed method mainly contributes in two aspects. Firstly, the spatial and frequency domains are combined by taking into account the spatial distribution and frequency characteristics of the data to provide multiple perspectives and choices for data analysis and processing. Then, a neural network-based clustering model with sparsity is conducted by projecting data points onto multi-domains and implementing adaptive regularization penalty terms to the weight matrix. The experimental results demonstrate superior performance of the proposed method in handling clustering problems on high-dimensional datasets.

mathematics

What problem does this paper attempt to address?

This paper proposes a new method called Sparse Multiview Domain Regularized (SMDR) clustering algorithm to solve the high-dimensional data clustering problem. In high-dimensional data, clustering analysis faces challenges due to the curse of dimensionality, data sparsity, and reduced reliability of clustering results. To address these issues, the paper integrates sparse reconstruction process and sparse L1 regularization into deep autoencoder models, reconstructing data through L1 sparse reconstruction in different domains, and combining spatial and frequency domain information to provide a multi-perspective view for data analysis and processing. The main contributions of SMDR are as follows: 1. It combines spatial distribution and frequency characteristics, analyzing data through the joint spatial and frequency domain to provide a more comprehensive understanding of the data. 2. It implements adaptive regularization penalty on the weight matrix to perform data point projection on multiple domains, achieving a neural network-based sparse clustering model. Experimental results show that SMDR performs superiorly in solving the clustering problem of high-dimensional datasets. Compared with existing clustering methods such as TELL, SLRR, and LSC, SMDR demonstrates higher accuracy in evaluation metrics such as NMI, ARI, and ACC, which proves its effectiveness and robustness in handling high-dimensional data clustering.

Sparse Clustering Algorithm Based on Multi-Domain Dimensionality Reduction Autoencoder

Deep Clustering and Visualization for End-to-End High-Dimensional Data Analysis.

Matrix Factorization and Deep Autoencoder based Clustering Scheme for Large-scale UAV Networks

Unsupervised Anomaly Detection Based on Deep Autoencoding and Clustering

Deep subspace clustering to achieve jointly latent feature extraction and discriminative learning

Deep clustering based on embedded auto-encoder

Multi-view deep subspace clustering via level-by-level guided multi-level features learning

Semi-supervised Hierarchical Clustering Analysis for High Dimensional Data

Deep Embedding Clustering Based on Residual Autoencoder

Pseudo-supervised Deep Subspace Clustering

Auto-Encoder Based Dimensionality Reduction

Deep Spectral Clustering using Dual Autoencoder Network

DAC: Deep Autoencoder-based Clustering, a General Deep Learning Framework of Representation Learning

Deep Discriminative Latent Space for Clustering

Multi-dimensional weighted deep subspace clustering with feature classification

Multi-kernel Fuzzy Clustering Based on Auto-Encoder for Fmri Functional Network.

Exploring structural components in autoencoder-based data clustering

Residual encoder-decoder network for deep subspace clustering

Soft Subspace Fuzzy Clustering with Dimension Affinity Constraint

SCDRHA: A scRNA-Seq Data Dimensionality Reduction Algorithm Based on Hierarchical Autoencoder

Multi-Scale Deep Subspace Clustering With Discriminative Learning