CancerSplicingQTL: a Database for Genome-Wide Identification of Splicing QTLs in Human Cancer

Jianbo Tian,Zhihua Wang,Shufang Mei,Nan Yang,Yang,Juntao Ke,Ying Zhu,Yajie Gong,Danyi Zou,Xiating Peng,Xiaoyang Wang,Hao Wan,Rong Zhong,Jiang Chang,Jing Gong,Leng Han,Xiaoping Miao
DOI: https://doi.org/10.1093/nar/gky954
IF: 14.9
2018-01-01
Nucleic Acids Research
Abstract:Alternative splicing (AS) is a widespread process that increases structural transcript variation and proteome diversity. Aberrant splicing patterns are frequently observed in cancer initiation, progress, prognosis and therapy. Increasing evidence has demonstrated that AS events could undergo modulation by genetic variants. The identification of splicing quantitative trait loci (sQTLs), genetic variants that affect AS events, might represent an important step toward fully understanding the contribution of genetic variants in disease development. However, no database has yet been developed to systematically analyze sQTLs across multiple cancer types. Using genotype data from The Cancer Genome Atlas and corresponding AS values calculated by TCGASpliceSeq, we developed a computational pipeline to identify sQTLs from 9 026 tumor samples in 33 cancer types. We totally identified 4 599 598 sQTLs across all cancer types. We further performed survival analyses and identified 17 072 sQTLs associated with patient overall survival times. Furthermore, using genome-wide association study (GWAS) catalog data, we identified 1 180 132 sQTLs overlapping with known GWAS linkage disequilibrium regions. Finally, we constructed a user-friendly database, CancerSplicingQTL (http://www.cancersplicingqtl-hust.com/) for users to conveniently browse, search and download data of interest. This database provides an informative sQTL resource for further characterizing the potential functional roles of SNPs that control transcript isoforms in human cancer.
What problem does this paper attempt to address?