Subspace Selection for Nonlinear Feature Extraction Based on Kernel Partial Least Squares

Hua-Long Bu,Guo-Zheng Lib,Xue-Qiang Zeng,Jing Xiaa
2008-01-01
Abstract:Feature extraction is one of the most widely used methods for finding a proper representation of data from their original features which is a fundamental problem in machine learning. Although kernel feature extraction methods can obtain nonlinear novel features for further classification and other tasks, there are still irrelevant and redundant features, so using feature selection to select the most discriminative and informative features for classification or data analysis is important, but there are few attentions to it until now. Here we propose a novel method which firstly uses Kernel Partial Least Squares as a nonlinear feature extraction method to get a basis set, and then uses the genetic algorithm, one of the most wildly used feature selection algorithms, to select the most discriminative features. The selected features form a subspace of the kernel space, where different state-of-the-art classification algorithms can be applied for classification. For experiment validation, we use two kinds of classifier: Support Vector Machines and the K Nearest Neighbor. Experimental results on three microarray datasets validate the effectiveness of our method.
What problem does this paper attempt to address?