Simultaneous Dimension Reduction And Adjustment For Confounding Variation

Zhixiang Lin,Can Yang,Ying Zhu,John Duchi,Yao Fu,Yong Wang,Bai Jiang,Mahdi Zamanighomi,Xuming Xu,Mingfeng Li,Nenad Sestan,Hongyu Zhao,Wing Hung Wong
DOI: https://doi.org/10.1073/pnas.1617317113
2016-01-01
Abstract:Dimension reduction methods are commonly applied to high-throughput biological datasets. However, the results can be hindered by confounding factors, either biological or technical in origin. In this study, we extend principal component analysis (PCA) to propose AC-PCA for simultaneous dimension reduction and adjustment for confounding (AC) variation. We show that ACPCA can adjust for (i) variations across individual donors present in a human brain exon array dataset and (ii) variations of different species in a model organism ENCODE RNA sequencing dataset. Our approach is able to recover the anatomical structure of neocortical regions and to capture the shared variation among species during embryonic development. For gene selection purposes, we extend AC-PCA with sparsity constraints and propose and implement an efficient algorithm. The methods developed in this paper can also be applied to more general settings.
What problem does this paper attempt to address?