Generalized Discriminant Analysis for Tumor Classification with Gene Expression Data

Wen-Hui Yang,Dao-Qing Dai,Hong Yan
DOI: https://doi.org/10.1109/icmlc.2006.259021
2006-01-01
Abstract:DNA microarray technology is the latest and the most advanced tool for parallel measuring of the activity and interactions of thousands of genes. The challenge is that the data dimension is large compared to the number of data points, which leads to small sample size (SSS) problem. Principal component analysis plus linear discriminant analysis (PCA+LDA) is a well-known technique to cope with this problem, however, it cannot completely solve the SSS problem. In this paper we propose two novel discriminant techniques. Experimental results on gene expression data sets demonstrate that our methods have good discriminating power and outperform the direct linear discriminant analysis, moreover they are more stable than the PCA+LDA approach
What problem does this paper attempt to address?