Abstract:Abstract Feature selection is usually a separate procedure which can not beneﬁt from result of the data exploration. In this paper,we propose a unsupervised feature selection method which could reuse a speciﬁc data exploration result. Furthermore, ouralgorithm follows the idea of clustering attributes and combines two state-of-the-art data analyzing methods, that’s maximalinformation coeﬃcient and aﬃnity propagation. Classiﬁcation problems with diﬀerent classiﬁers were tested to validation ourmethod and others. Data experiments result exhibits our unsupervised algorithm is comparable with classical feature selectionmethods and even outperforms some supervised learning algorithms. Data simulation with one credit dataset of our own froma bank of China shows the capability of our method for real world application. Keywords: feature selection, feature clustering, maximal information coeﬃcient, aﬃnity propagation 1. IntroductionData mining shows powerful capability for automatically identifying valuable and potential information fromdata,solotsofareahavebeenproﬁtfromit,suchasexpertsystem,decisionsupportandﬁnancialforecast[1]. Dueto the widespread use of the Internet and the emergence of bioinformatics, the dimensionality of dataset becomelarger and larger. Datasets with hundreds and thousands of attributes may cause the “curse of dimensionality”problem. Furthermore, some of traditional classiﬁcation and clustering algorithms can not work properly. Oneof the most feasible technique to cope with this problem is feature reduction. Feature reduction refers to theresearch of methods which have the reduced dimensions present the original data[2]. In general point of view,there are two categories of feature reduction, namely feature selection(or variable selection), feature extraction(orfeature transform). The former one tries to construct a new feature space by transforming the original featurespace into lower dimensional ones such as PCA and LLE which have been given broad appeal[3]. However, thetransformation result from feature extraction is quite diﬃcult to interpret and explain. This drawback limits theuse of this kind of means in some area. The latter type, by contrast, does not make any transformation, but ﬁltersout some meaningless attributes from original data. In other words, this category of processes chooses a subsetfrom the original feature space.

Information Technology and Quantitative Management , ITQM 2013

Feature Selection with Attributes Clustering by Maximal Information Coefficient

U^2F^2S^2 : Uncovering Feature-level Similarities for Unsupervised Feature Selection

Feature Selection Using Hierarchical Feature Clustering

A New Supervised Feature Selection Method for Pattern Classification.

Feature Selection Based on Quality of Information.

Unsupervised feature selection for multi-cluster data

Feature Selection Based on Data Clustering

A fusion of centrality and correlation for feature selection

A Novel Unsupervised Feature Selection Method for Bioinformatics Data Sets Through Feature Clustering

Feature Selection: A Data Perspective

Unsupervised Feature Selection Based on Feature Relevance

Subspace Learning for Unsupervised Feature Selection Via Matrix Factorization.

A Cost-Sensitive Feature Selection Method for High-Dimensional Data

Unsupervised attribute reduction algorithm framework based on spectral clustering and attribute significance function

A New Unsupervised Feature Selection Algorithm Using Similarity-Based Feature Clustering.

A Supervised Feature Selection Algorithm through Minimum Spanning Tree Clustering

Unsupervised Feature Selection with Graph Learning Via Low-Rank Constraint

Robust Unsupervised Feature Selection Via Matrix Factorization

Effective Feature Selection Using Feature Vector Graph for Classification

Boosting feature selection using information metric for classification