Research on Classifying Method Based on Linear Discriminant Analysis for Gene Expression Data

ZHANG Xu-dong,WANG Ya-dong,LI Xia,SU Xiao-hong
DOI: https://doi.org/10.3969/j.issn.1672-5565.2006.01.003
2006-01-01
Abstract:Gene expression data can not be accurately classified by Fisher method,because the number of variables(genes) is far exceeding the number of samples.A new method is proposed in this paper to analyze gene expression data using a series of modified Fishers called Fisher_List.The inimitable idea of this algorithm is that every class of gene expression data has its own decision threshold.Each decision threshold is comprised of the information of all samples and some very important individual samples for classification.In this paper,some experiments are carried out to compare the performance with LogitBoost,AdaBoost,k-nearest-neighbor classifier,classifier trees,support vector machine and new algorithm.The results show that the new algorithm improves the correctness of classification effectively when it is applied to gene expression data and has best performance of all.
What problem does this paper attempt to address?