Feature Selection Algorithm Based on Neighborhood Decision Distinguishing Rate

Wenzhi ZHU,Gangquan SI,Yanbin ZHANG
DOI: https://doi.org/10.7652/xjtuxb201302004
2013-01-01
Abstract:The current feature selection algorithms based on the neighborhood rough set (NRS) model are unable to evaluate numerical dataset directly, a discretization procedure becomes necessary to transform the datasets into discrete forms, but inevitably leads to useful decision information loss. To solve this difficulty, a feature selection algorithm based on the neighborhood effective information rate is proposed. In view point of granulated neighborhood, the relation between the decision discernibility and the decision distribution is analyzed, and the neighborhood decision certainty (Nc) is defined to indicate the degree of distinguishing capability in each individual neighborhood granule. The neighborhood decision distinguishing rate (NDDR) of the feature subset, which evaluates the ability of the subspace to approximate decision space, is established based on the sum of the Nc values of the information granules induced by the corresponding feature space. Then the nominal and numerical datasets can be integrated into the same feature selection algorithm framework. The simulation and application illustrate that the proposed algorithm outperforms the other NRS-based ones.
What problem does this paper attempt to address?