Shrunken centroids regularized discriminant analysis as a promising strategy for metabolomics data exploration
Chen Chen,Zhi-Min Zhang,Mei-Lan Ouyang,Xinbo Liu,Lunzhao Yi,Yi-Zeng Liang,Chao-Ping Zhang
DOI: https://doi.org/10.1002/cem.2685
IF: 2.5
2014-10-30
Journal of Chemometrics
Abstract:Metabolomics datasets generated by modern analytical instruments tend to be increasingly complex. In this study, a recent method named shrunken centroids regularized discriminant analysis (SCRDA) has been introduced and applied in the exploration of metabolomics dataset. It is a supervised method for variable selection, discriminant analysis and biomarker screening. By regularizing the estimate of the within‐class covariance matrix, SCRDA can deal with the singularity issue of linear discriminant analysis. Then a shrinkage estimator is applied to perform variable selection. The method presented is illustrated through the simulated datasets and three complex metabolomics datasets. Commonly used orthogonal partial least squares discriminant analysis and two other similar statistical methods, penalized linear discriminant analysis and nearest shrunken centroids, are used for comparisons. The results illustrate that SCRDA has some desirable abilities in variable selection, classification and prediction. Moreover, the biomarkers identified by SCRDA are further demonstrated to be in accordance with the biochemical research. It has been proved that SCRDA can be applied as a promising strategy in metabolomics. Copyright © 2014 John Wiley & Sons, Ltd. We applied shrunken centroids regularized discriminant analysis in the exploration of metabolomics dataset. By regularizing the estimate of the within‐class covariance matrix, it can deal with the singularity issue in linear discriminant analysis and the shrinkage estimator can be applied to perform variable selection. Shrunken centroids regularized discriminant analysis has some desirable abilities in variable selection, classification, and prediction because it estimates the correlations between variables.
chemistry, analytical,instruments & instrumentation,mathematics, interdisciplinary applications,automation & control systems,computer science, artificial intelligence,statistics & probability