Power enhancement via multivariate outlier testing with gene expression arrays
Adam L. Asare,Zhong Gao,Vincent J. Carey,Richard Wang,Vicki Seyfert-Margolis,A. L. Asare,Z. Gao,V. J. Carey,R. Wang,V. Seyfert-Margolis
DOI: https://doi.org/10.1093/bioinformatics/btn591
IF: 5.8
2008-11-16
Bioinformatics
Abstract:MOTIVATION: As the use of microarrays in human studies continues to increase, stringent quality assurance is necessary to ensure accurate experimental interpretation. We present a formal approach for microarray quality assessment that is based on dimension reduction of established measures of signal and noise components of expression followed by parametric multivariate outlier testing.RESULTS: We applied our approach to several data resources. First, as a negative control, we found that the Affymetrix and Illumina contributions to MAQC data were free from outliers at a nominal outlier flagging rate of alpha=0.01. Second, we created a tunable framework for artificially corrupting intensity data from the Affymetrix Latin Square spike-in experiment to allow investigation of sensitivity and specificity of quality assurance (QA) criteria. Third, we applied the procedure to 507 Affymetrix microarray GeneChips processed with RNA from human peripheral blood samples. We show that exclusion of arrays by this approach substantially increases inferential power, or the ability to detect differential expression, in large clinical studies.AVAILABILITY: http://bioconductor.org/packages/2.3/bioc/html/arrayMvout.html and http://bioconductor.org/packages/2.3/bioc/html/affyContam.html affyContam (credentials: readonly/readonly)
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology