Abstract:The optimal global feature subset cannot be found easily due to the high cost, and most swarm intelligence optimization-based feature selection methods are inefficient in handling high-dimensional data. In this study, a two-stage feature selection model based on fuzzy neighborhood rough sets (FNRS) and binary whale optimization algorithm (BWOA) is developed. First, to denote the fuzziness of samples for mixed data with symbolic and numerical features, fuzzy neighborhood similarity is presented to study the similarity matrix and fuzzy membership degree, and the lower and upper approximations can be developed to present new FNRS model. Fuzzy neighborhood-based uncertainty measures such as dependence degree, knowledge granularity, and entropy measures are studied. From the viewpoints of algebra and information, fuzzy knowledge granularity conditional entropy is presented to form a preselected feature reduction set in the first stage. Second, the cosine curve change is added to develop a new control factor, which slows down the convergence rate of BWOA in the early iteration to fully explore the global, and accelerates the convergence rate in the late iteration. Integrating dependence degree with fuzzy knowledge granularity conditional entropy, a new fitness function is designed for selecting an optimal feature subset in this second stage. Two strategies are fused to avoid BWOA falling into the local optimum: the population partition strategy with the adaptive neighborhood search radius to divide the whale population and the local interference strategy of the elite subgroup to adjust the whale position update. Finally, a two-stage feature selection algorithm is designed, where the Fisher score algorithm is employed to preliminarily delete those redundancy features of high-dimensional datasets. Experiments on six UCI datasets and five gene expression datasets show that our algorithm is valid compared to other related algorithms.

Performance Optimization of Fractal Dimension Based Feature Selection Algorithm

The practical method of fractal dimensionality reduction based on z-ordering technique

On Combining Fractal Dimension with GA for Feature Subset Selecting

Unsupervised dimensionality reduction based on fractal dimension and genetic algorithm

A Two Phases Unsupervised Sequential Forward Fractal Dimensionality Reduction Algorithm

Invariant optimal feature selection: A distance discriminant and feature ranking based solution

TSFNFS: two-stage-fuzzy-neighborhood feature selection with binary whale optimization algorithm

Fractal feature selection model for enhancing high-dimensional biological problems

A Lite Fireworks Algorithm with Fractal Dimension Constraint for Feature Selection

MODIFIED KERNEL-BASED NONLINEAR FEATURE EXTRACTION

A Modified Sequential Deep Floating Search Algorithm For Feature Selection

Fast Attribute Selection Algorithm Based on Fractal Dimension

Fractal Autoencoders for Feature Selection

FSDR: A Novel Deep Learning-based Feature Selection Algorithm for Pseudo Time-Series Data using Discrete Relaxation

Feature Selection with Discernibility and Independence Criteria

FF-Based Feature Selection for Improved Classification of Medical Data

Feature Selection Approach Based on Improved Fuzzy C-Means with Principle of Refined Justifiable Granularity

A More Efficient Branch and Bound Algorithm for Feature Selection

Performance Optimization of a Fuzzy Entropy based Feature Selection and Classification Framework

Discriminative feature selection with directional outliers correcting for data classification

A High Performance Algorithm For Text Feature Automatic Selection