Integration of Population-Level Genotype Data with Functional Annotation Reveals Over-Representation of Long Noncoding RNAs at Ovarian Cancer Susceptibility Loci
Brett M. Reid,Jennifer B. Permuth,Y. Ann Chen,Jamie K. Teer,Alvaro N. A. Monteiro,Zhihua Chen,Jonathan Tyrer,Andrew Berchuck,Georgia Chenevix-Trench,Jennifer A. Doherty,Ellen L. Goode,Edwin S. Iverson,Kate Lawrenson,Celeste L. Pearce,Paul D. Pharoah,Catherine M. Phelan,Susan J. Ramus,Mary Anne Rossing,Joellen M. Schildkraut,Jin Q. Cheng,Simon A. Gayther,Thomas A. Sellers
DOI: https://doi.org/10.1158/1055-9965.epi-16-0341
2016-01-01
Cancer Epidemiology Biomarkers & Prevention
Abstract:Abstract Background: Genome-wide association studies (GWAS) have identified multiple loci associated with epithelial ovarian cancer (EOC) susceptibility, but further progress requires integration of epidemiology and biology to illuminate true risk loci below genome-wide significance levels (P < 5 × 10−8). Most risk SNPs lie within non–protein-encoding regions, and we hypothesize that long noncoding RNA (lncRNA) genes are enriched at EOC risk regions and represent biologically relevant functional targets. Methods: Using imputed GWAS data from about 18,000 invasive EOC cases and 34,000 controls of European ancestry, the GENCODE (v19) lncRNA database was used to annotate SNPs from 13,442 lncRNAs for permutation-based enrichment analysis. Tumor expression quantitative trait locus (eQTL) analysis was performed for sub-genome-wide regions (1 × 10−5 > P > 5 × 10−8) overlapping lncRNAs. Results: Of 5,294 EOC-associated SNPs (P < 1.0 × 10−5), 1,464 (28%) mapped within 53 unique lncRNAs and an additional 3,484 (66%) SNPs were correlated (r2 > 0.2) with SNPs within 115 lncRNAs. EOC-associated SNPs comprised 130 independent regions, of which 72 (55%) overlapped with lncRNAs, representing a significant enrichment (P = 5.0 × 10−4) that was more pronounced among a subset of 5,401 lncRNAs with active epigenetic regulation in normal ovarian tissue. EOC-associated lncRNAs and their putative promoters and transcription factors were enriched for biologically relevant pathways and eQTL analysis identified five novel putative risk regions with allele-specific effects on lncRNA gene expression. Conclusions: lncRNAs are significantly enriched at EOC risk regions, suggesting a mechanistic role for lncRNAs in driving predisposition to EOC. Impact: lncRNAs represent key candidates for integrative epidemiologic and functional studies. Further research on their biologic role in ovarian cancer is indicated. Cancer Epidemiol Biomarkers Prev; 26(1); 116–25. ©2016 AACR.