Abstract:Existing feature selection methods easily neglect the distribution of data, and require most of the neighborhood radius in neighborhood rough sets (NRS) to be selected artificially. These limitations result in the misclassification of samples. To address these drawbacks, this paper presents a mixed measure-based feature selection method using the Fisher score and an NRS model. First, the variation coefficient of the features in different decision classes is defined to depict the dispersion degree of different features, based on which, the neighborhood class is described to develop a novel NRS model. The concepts of dependency degree, neighborhood knowledge granularity, and average neighborhood entropy are defined, and then a mixed measure combining the information and algebra views is proposed to measure the uncertainty in neighborhood decision systems. Second, the average correlation degree of the feature subset is computed to assess the redundancy of the reduced feature subset. By combining the classification accuracy of the selected features, the reduction rate of the classification result, and the average correlation degree of the reduced feature set, we can construct an adaptive neighborhood radius function to avoid the artificial selection of the optimal neighborhood radius. Then, an optimal feature subset can be obtained according to the internal and external significance of the features. Third, the variation coefficient of the samples in different decision classes in each feature is defined to compute the dispersion degree of the samples, and the average of all samples in each feature is added to the between-class scatter to eliminate the effect of the different measurement dimensions of the features; then, the Fisher score model is improved to eliminate the noise of the high-dimensional data. Finally, a heuristic feature selection algorithm with the Fisher score based on the new NRS model is designed to select an optimal feature subset. Experimental results applied to five low-dimensional UCI datasets and nine high-dimensional gene expression datasets showed that the developed algorithm is effective and can select an optimal reduced subset with high classification accuracy when compared with some of the latest algorithms.

Semi-supervised Filter Feature Selection Based on Natural Laplacian Score and Maximal Information Coefficient

Semi-supervised feature selection by minimum neighborhood redundancy and maximum neighborhood relevancy

Feature Selection with Conditional Mutual Information Considering Feature Interaction

Semi-supervised feature selection based on discernibility matrix and mutual information

Graph-Based Semi-supervised Feature Selection with Application to Automatic Spam Image Identification

Feature Selection with Attributes Clustering by Maximal Information Coefficient

Laplacian Score for Feature Selection.

Mixed Measure-Based Feature Selection Using the Fisher Score and Neighborhood Rough Sets

Locality Sensitive Semi-Supervised Feature Selection

Joint local structure preservation and redundancy minimization for unsupervised feature selection

A fusion of centrality and correlation for feature selection

MVMR-FS : Non-parametric feature selection algorithm based on Maximum inter-class Variation and Minimum Redundancy

Unsupervised Feature Analysis with Class Margin Optimization

Unsupervised Feature Selection Using Nonnegative Spectral Analysis.

A new filter feature selection algorithm for classification task by ensembling pearson correlation coefficient and mutual information

Feature Selection Via Scaling Factor Integrated Multi-Class Support Vector Machines

Feature selection for multi-label classification by maximizing full-dimensional conditional mutual information

Efficient Semi-Supervised Feature Selection with Noise Insensitive Trace Ratio Criterion

LLE Score: A New Filter-Based Unsupervised Feature Selection Method Based on Nonlinear Manifold Embedding and Its Application to Image Recognition

An Adaptive Feature Selection Method for Microarray Data Analysis

Feature Selection with Missing Labels Using Multilabel Fuzzy Neighborhood Rough Sets and Maximum Relevance Minimum Redundancy