A Normalization-Free and Nonparametric Method Sharpens Large-Scale Transcriptome Analysis and Reveals Common Gene Alteration Patterns in Cancers
Qi-Gang Li,Yong-Han He,Huan Wu,Cui-Ping Yang,Shao-Yan Pu,Song-Qing Fan,Li-Ping Jiang,Qiu-Shuo Shen,Xiao-Xiong Wang,Xiao-Qiong Chen,Qin Yu,Ying Li,Chang Sun,Xiangting Wang,Jumin Zhou,Hai-Peng Li,Yong-Bin Chen,Qing-Peng Kong
DOI: https://doi.org/10.7150/thno.19425
IF: 11.6
2017-01-01
Theranostics
Abstract:Heterogeneity in transcriptional data hampers the identification of differentially expressed genes (DEGs) and understanding of cancer, essentially because current methods rely on cross-sample normalization and/or distribution assumption-both sensitive to heterogeneous values. Here, we developed a new method, Cross-Value Association Analysis (CVAA), which overcomes the limitation and is more robust to heterogeneous data than the other methods. Applying CVAA to a more complex pan-cancer dataset containing 5,540 transcriptomes discovered numerous new DEGs and many previously rarely explored pathways/processes; some of them were validated, both in vitro and in vivo, to be crucial in tumorigenesis, e.g., alcohol metabolism (ADH1B), chromosome remodeling (NCAPH) and complement system (Adipsin). Together, we present a sharper tool to navigate large-scale expression data and gain new mechanistic insights into tumorigenesis.