Jnmfma: a Joint Non-Negative Matrix Factorization Meta-Analysis of Transcriptomics Data

Hong-Qiang Wang,Chun-Hou Zheng,Xing-Ming Zhao
DOI: https://doi.org/10.1093/bioinformatics/btu679
IF: 5.8
2014-01-01
Bioinformatics
Abstract:MOTIVATION Tremendous amount of omics data being accumulated poses a pressing challenge of meta-analyzing the heterogeneous data for mining new biological knowledge. Most existing methods deal with each gene independently, thus often resulting in high false positive rates in detecting differentially expressed genes (DEG). To our knowledge, no or little effort has been devoted to methods that consider dependence structures underlying transcriptomics data for DEG identification in meta-analysis context. RESULTS This article proposes a new meta-analysis method for identification of DEGs based on joint non-negative matrix factorization (jNMFMA). We mathematically extend non-negative matrix factorization (NMF) to a joint version (jNMF), which is used to simultaneously decompose multiple transcriptomics data matrices into one common submatrix plus multiple individual submatrices. By the jNMF, the dependence structures underlying transcriptomics data can be interrogated and utilized, while the high-dimensional transcriptomics data are mapped into a low-dimensional space spanned by metagenes that represent hidden biological signals. jNMFMA finally identifies DEGs as genes that are associated with differentially expressed metagenes. The ability of extracting dependence structures makes jNMFMA more efficient and robust to identify DEGs in meta-analysis context. Furthermore, jNMFMA is also flexible to identify DEGs that are consistent among various types of omics data, e.g. gene expression and DNA methylation. Experimental results on both simulation data and real-world cancer data demonstrate the effectiveness of jNMFMA and its superior performance over other popular approaches. AVAILABILITY AND IMPLEMENTATION R code for jNMFMA is available for non-commercial use via http://micblab.iim.ac.cn/Download/. CONTACT hqwang@ustc.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
What problem does this paper attempt to address?