MMBDE: A Two-stage Hybrid Feature Selection Method from Microarray Data

Weidong Xie,Yuhuan Chi,Linjie Wang,Kun Yu,Wei Li
DOI: https://doi.org/10.1109/bibm52615.2021.9669496
2021-01-01
Abstract:The discovery of diagnostically significant genes from microarray data is essential for disease diagnosis and drug research. However, the difficulty of analyzing microarray data comes from its high dimensionality and small sample size. Feature selection can effectively remove irrelevant and redundant features, reduce data dimensionality, and improve the accuracy of classifiers. This paper proposes a two-stage hybrid feature selection method MMBDE based on the improved min-Redundancy and Max-Relevance (mRMR) and the improved Binary Differential Evolution (BDE) algorithm. The improved mRMR is used to reduce the feature dimensionality at a coarse-scale significantly. In contrast, the improved BDE is used to refine the feature dimensionality at fine-scale further and select the best features. The experimental results show that MMBDE successfully reduces the dimensionality of microarray gene expression data, obtains high classification accuracy, and extracts effective features closely related to diseases from microarray gene expression data. The relevant datasets and codes can be obtained from https://github.com/xwdshiwo/MMBDE.
What problem does this paper attempt to address?