The impact of common variants on gene expression in the human brain: from RNA to protein to schizophrenia risk
Qiuman Liang,Yi Jiang,Annie W. Shieh,Dan Zhou,Rui Chen,Feiran Wang,Meng Xu,Mingming Niu,Xusheng Wang,Dalila Pinto,Yue Wang,Lijun Cheng,Ramu Vadukapuram,Chunling Zhang,Kay Grennan,Gina Giase,The PsychENCODE Consortium,Kevin P. White,Junming Peng,Bingshan Li,Chunyu Liu,Chao Chen,Sidney H. Wang
DOI: https://doi.org/10.1101/2023.06.04.543603
2024-12-24
Abstract:: The impact of genetic variants on gene expression has been intensely studied at the transcription level, yielding invaluable insights into the association between genes and the risk of complex disorders, such as schizophrenia (SCZ). However, the downstream impact of these variants and the molecular mechanisms connecting transcription variation to disease risk are not well understood. : We quantitated ribosome occupancy in prefrontal cortex samples of the BrainGVEX cohort. Together with transcriptomics and proteomics data from the same cohort, we performed cis-Quantitative Trait Locus (QTL) mapping and identified 3,253 expression QTLs (eQTLs), 1,344 ribosome occupancy QTLs (rQTLs), and 657 protein QTLs (pQTLs) out of 7,458 genes from 185 samples. Of the eQTLs identified, only 34% have their effects propagated to the protein level. Further analysis on the effect size of prefrontal cortex eQTLs identified from an independent dataset clearly replicated the post-transcriptional attenuation of eQTL effects. We identified omics-specific QTLs and investigated their potential in driving disease risks. Using a variant based approach, we found expression-specific QTLs (esQTLs) for 1,553 genes, ribosome-occupancy-specific QTLs (rsQTLs) for 155 genes, and protein-specific QTLs (psQTLs) for 161 genes. Among these omics-specific QTL, 38 showed strong colocalization with brain associated disorder GWAS signals, 29 of them are esQTLs. Because a gene could contain multiple QTL signals, each could either be shared across omics or omics-specific, we aggregated QTL signals from each omics for each gene and found 11 brain associated disorder risk genes that are driven predominantly by omics-specific QTL, all of them are driven by variants impacting transcriptional regulation. This gene-based approach also enabled us to categorize risk genes containing both omics-specific and shared QTL signals. The limited number of GWAS colocalization discoveries from gene-based omics-specific mapping, however, prompted us to take a complementary approach to investigate the functional relevance of genes driven predominantly by attenuated eQTL signals. Using S-PrediXcan we identified 74 SCZ risk genes across the three omics, 30% of which were novel, and 67% of these risk genes were confirmed to be causal in a MR-Egger test using data from the corresponding omics. Notably, 52 out of the 74 risk genes were identified using eQTL data and 68% of these SCZ-risk-gene-driving eQTLs show little to no evidence of driving corresponding variations at the protein level. : The effect of eQTLs on gene expression in the prefrontal cortex is commonly attenuated post-transcriptionally. Many of the attenuated eQTLs still correlate with GWAS signals of brain associated complex disorders, indicating the possibility that these eQTL variants drive disease risk through mechanisms other than regulating protein expression level. Further investigation is needed to elucidate the mechanistic link between attenuated eQTLs and brain associated complex disorders.
Biology