Overlap between COPD genetic association results and transcriptional quantitative trait loci

Aabida Saferali,Wonji Kim,Robert Chase,NHLBI TransOmics in Precision Medicine (TOPMed),Chris Vollmers,Edwin K Silverman,Michael Cho,Peter Castaldi,Craig P Hersh
DOI: https://doi.org/10.1101/2024.07.08.24310079
2024-07-08
Abstract:Rationale: Genome-wide association studies (GWAS) have identified multiple genetic loci associated with chronic obstructive pulmonary disease (COPD). When integrated with GWAS results, expression quantitative trait locus (eQTL) studies can provide insight into biological mechanisms involved in disease by identifying single nucleotide polymorphisms (SNPs) that contribute to whole gene expression. However, there are multiple genetically driven regulatory and isoform-specific effects which cannot be detected in traditional eQTL analyses. Here, we identify SNPs that are associated with alternative splicing (sQTL) in addition to eQTLs to identify novel functions for COPD associated genetic variants. Methods: We performed RNA sequencing on whole blood from 3743 subjects in the COPDGene Study. RNA sequencing data from lung tissue of 1241 subjects from the Lung Tissue Research Consortium (LTRC), and whole genome sequencing data on all subjects. Associations between all SNPs within 1000 kb of a gene (cis-) and splice and gene expression quantifications were tested using tensorQTL. In COPDGene a total of 11,869,333 SNPs were tested for association with 58,318 splice clusters, and 8,792,206 SNPs were tested for association with 70,094 splice clusters in LTRC. We assessed colocalization with COPD-associated SNPs from a published GWAS[1]. Results After adjustment for multiple statistical testing, we identified 28,110 splice-sites corresponding to 3,889 unique genes that were significantly associated with genotype in COPDGene whole blood, and 58,258 splice-sites corresponding to 10,307 unique genes associated with genotype in LTRC lung tissue. We found 7,576 sQTL splice-sites corresponding to 2,110 sQTL genes were shared between whole blood and lung, while 20,534 sQTL splice-sites in 3,518 genes were unique to blood and 50,682 splice-sites in 9,677 genes were unique to lung. To determine what proportion of COPD-associated SNPs were associated with transcriptional splicing, we performed colocalization analysis between COPD GWAS and sQTL data, and found that 38 genomic windows, corresponding to 38 COPD GWAS loci had evidence of colocalization between QTLs and COPD. The top five colocalizations between COPD and lung sQTLs include NPNT, FBXO38, HHIP, NTN4 and BTC. Conclusions A total of 38 COPD GWAS loci contain evidence of sQTLs, suggesting that analysis of sQTLs in whole blood and lung tissue can provide novel insights into disease mechanisms.
Genetic and Genomic Medicine
What problem does this paper attempt to address?
The main objective of this paper is to reveal new insights into the pathogenesis of Chronic Obstructive Pulmonary Disease (COPD) by analyzing the association between genetic variations related to COPD and the transcriptome. Specifically, the research team conducted the following work: 1. **Research Background**: COPD is a complex disease in which genetic factors play a significant role in disease susceptibility. Although Genome-Wide Association Studies (GWAS) have identified multiple genetic loci associated with COPD, the specific pathogenic mechanisms of these loci remain unclear. 2. **Research Methods**: - RNA sequencing was performed on whole blood and lung tissue samples from 3,743 individuals from the COPDGene study and 1,241 individuals from the Lung Tissue Research Consortium (LTRC). - The correlation between single nucleotide polymorphisms (SNPs) located within 1,000 kb of genes and splicing quantitative trait loci (sQTL) as well as expression quantitative trait loci (eQTL) was tested. - Bayesian methods were used to assess whether there is colocalization between GWAS results and sQTL/eQTL data, i.e., to determine whether they share the same causal variants. 3. **Main Findings**: - In the COPDGene whole blood samples, 28,110 splicing sites corresponding to 3,889 unique genes were identified; in the LTRC lung tissue samples, 58,258 splicing sites corresponding to 10,307 unique genes were identified. - 38 COPD GWAS loci were found to have sQTL evidence, suggesting that analyzing sQTL can provide new perspectives for understanding disease mechanisms. - Specifically, the paper discusses several significant colocalized candidate genes in detail, such as NPNT, FBXO38, HHIP, NTN4, and BTC, which may be involved in the pathogenesis of COPD through the regulation of splicing. In summary, this paper aims to explore the molecular mechanisms of COPD by analyzing the impact of genetic variations on splicing and to identify potential new therapeutic targets.