PCA AND PLS FOR GASTRIC CANCER SUBTYPE CLASSIFICATION

Jian Li
2009-01-01
ACTA BIOPHYSICA SINICA
Abstract:The gastric cancer is one of the most common malignant tumors in the world.There is no uniform method to classify gastric cancer in medicine until now.Gastric cancer may be the intestinal gastric cancer or diffused gastric cancer based on Lauren.It is important to know the subtype of gastric cancer so that to decide how to treat.Using gene expression data to research cancer is one of the hot research subjects at present,and will have strong impact on gastric cancer treatment and diagnosis.The gene expression profiling,generally has small samples and high dimensions because of the expensive experiments and other reasons.Therefore the traditional methods for classification are always failing.We should cut down dimensions of the data before classification.In this paper,the authors applied the partial least squares(PLS) and the principal component analysis(PCA) to the classification of gastric cancer.Two different data sets of gastric cancer had been used.And the results of classification using these two methods were compared with SVM and KNN.The results of the experiments showed that PLS and PCA were both good as the method for dimension reduction.And the result of classification was also good.The merits and the demerits of the two methods were also expounded in the paper.
What problem does this paper attempt to address?