Abstract:<p>Feature selection is a widely used dimension reduction technique to select feature subsets because of its interpretability. Many methods have been proposed and achieved good results, in which the relationships between adjacent data points are mainly concerned. But the possible associations between data pairs that are not adjacent are always neglected. Different from previous methods, we propose a novel and very simple approach for unsupervised feature selection, named MMFS (Multi-step Markov Probability Relationship for Feature Selection). The idea is using multi-step Markov transition probability to describe the relation between any data pair. Two ways from the positive and negative viewpoints are employed respectively to keep the data structure after feature selection. From the positive viewpoint, the maximum transition probability that can be reached in a certain number of steps is used to describe the relation between two points. Then, the features which can keep the compact data structure are selected. From the viewpoint of negative, the minimum transition probability that can be reached in a certain number of steps is used to describe the relation between two points. On the contrary, the features that least maintain the loose data structure are selected. The two ways can also be combined. Thus three algorithms are proposed. Our main contributions are a novel feature section approach which uses multi-step transition probability to characterize the data structure, and three algorithms proposed from the positive and negative aspects for keeping data structure and select the features to preserve such structure. The performance of our approach is compared with the state-of-the-art methods on eight real-world data sets, and the experimental results show that the proposed MMFS is effective in unsupervised feature selection.</p>

Supervised feature selection method via potential value estimation

U^2F^2S^2 : Uncovering Feature-level Similarities for Unsupervised Feature Selection

A Feature Selection Method Based on Feature Grouping and Genetic Algorithm

Feature Selection Via Scaling Factor Integrated Multi-Class Support Vector Machines

Unsupervised feature selection via multi-step markov probability relationship

Feature Selection Based on Data Clustering

An information-theoretic feature selection method based on estimation of Markov blanket

Neurodynamics-driven supervised feature selection.

Unsupervised feature selection via discrete spectral clustering and feature weights

Feature Selection Approach Based on Improved Fuzzy C-Means with Principle of Refined Justifiable Granularity

A fusion of centrality and correlation for feature selection

Unsupervised feature selection by learning exponential weights

A Feature Selection Method Using Conditional Correlation Dispersion and Redundancy Analysis

An evolutionary feature selection method based on probability-based initialized particle swarm optimization

Fast feature selection for interval-valued data through kernel density estimation entropy

An Approximate Markov Blanket Feature Selection Algorithm

Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods

Unsupervised Feature Selection Algorithm Based on Dual Manifold Re-ranking

Heuristic feature selection method for clustering

Feature selection using feature ranking, correlation analysis and chaotic binary particle swarm optimization