Abstract:Feature selection techniques are widely being used as a preprocessing step to train machine learning algorithms to circumvent the curse of dimensionality, overfitting, and computation time challenges. Projection-based methods are frequently employed in feature selection, leveraging the extraction of linear relationships among features. The absence of nonlinear information extraction among features is notable in this context. While auto-encoder based techniques have recently gained traction for feature selection, their focus remains primarily on the encoding phase, as it is through this phase that the selected features are derived. The subtle point is that the performance of auto-encoder to obtain the most discriminative features is significantly affected by decoding phase. To address these challenges, in this paper, we proposed a novel feature selection based on auto-encoder to not only extracting nonlinear information among features but also decoding phase is regularized as well to enhance the performance of algorithm. In this study, we defined a new model of auto-encoder to preserve the topological information of reconstructed close to input data. To geometric structure of input data is preserved in projected space using Laplacian graph, and geometrical projected space is preserved in reconstructed space using a suitable term (abstract Laplacian graph of reconstructed data) in optimization problem. Preserving abstract Laplacian graph of reconstructed data close to Laplacian graph of input data affects the performance of feature selection and we experimentally showed this. Therefore, we show an effective approach to solve the objective of the corresponding problem. Since this approach can be mainly used for clustering aims, we conducted experiments on ten benchmark datasets and assessed our propped method based on clustering accuracy and normalized mutual information (NMI) metric. Our method obtained considerable superiority over recent state-of-the-art techniques in terms of NMI and accuracy.

Unsupervised feature selection using sparse manifold learning: Auto-encoder approach

Feature Selection and Multi-Kernel Learning for Sparse Representation on a Manifold

Sparse Graph Embedding Unsupervised Feature Selection.

Sparse and Flexible Projections for Unsupervised Feature Selection

Unsupervised Feature Selection Via Local Structure Learning and Sparse Learning

Unsupervised Feature Selection Algorithm Based on Sparse Representation

Unsupervised feature extraction by low-rank and sparsity preserving embedding

Autoencoder Inspired Unsupervised Feature Selection

Joint Feature Selection and Extraction with Sparse Unsupervised Projection

Sparse Representation Preserving for Unsupervised Feature Selection

G-Optimal Feature Selection with Laplacian regularization

Unsupervised feature selection method based on dual manifold learning and dual spatial latent representation

Robust Unsupervised Feature Selection by Nonnegative Sparse Subspace Learning.

Sparse Representation-Based Approach for Unsupervised Feature Selection

Sparse Feature Selection Via Fast Embedding Spectral Analysis

Unsupervised Feature Selection Based on Self-Representation Sparse Regression and Local Similarity Preserving

Semi-Supervised Feature Selection Via Sparse Rescaled Linear Square Regression.

Feature Selective Projection with Low-Rank Embedding and Dual Laplacian Regularization

Unsupervised Feature Selection by Nonnegative Sparsity Adaptive Subspace Learning

Feature selection based on non-negative spectral feature learning and adaptive rank constraint

Double-Structured Sparsity Guided Flexible Embedding Learning for Unsupervised Feature Selection