Informed genome-wide association analysis with family history as a secondary phenotype identifies novel loci of lung cancer.

Julia G Poirier,Paul Brennan,James D McKay,Margaret R Spitz,Heike Bickeböller,Angela Risch,Geoffrey Liu,Loic Le Marchand,Shelley Tworoger,John McLaughlin,Albert Rosenberger,Joachim Heinrich,Irene Brüske,Thomas Muley,Brian E Henderson,Lynne R Wilkens,Xuchen Zong,Yafang Li,Ke Hao,Wim Timens,Yohan Bossé,Don D Sin,Ma'en Obeidat,Christopher I Amos,Rayjean J Hung
DOI: https://doi.org/10.1002/gepi.21882
2015-01-01
Genetic Epidemiology
Abstract:Lung cancer is the leading cause of cancer death worldwide. Although several genetic variants associated with lung cancer have been identified in the past, stringent selection criteria of genome-wide association studies (GWAS) can lead to missed variants. The objective of this study was to uncover missed variants by using the known association between lung cancer and first-degree family history of lung cancer to enrich the variant prioritization for lung cancer susceptibility regions. In this two-stage GWAS study, we first selected a list of variants associated with both lung cancer and family history of lung cancer in four GWAS (3,953 cases, 4,730 controls), then replicated our findings for 30 variants in a meta-analysis of four additional studies (7,510 cases, 7,476 controls). The top ranked genetic variant rs12415204 in chr10q23.33 encoding FFAR4 in the Discovery set was validated in the Replication set with an overall OR of 1.09 (95% CI = 1.04, 1.14, P=1.63x 10(-4)). When combining the two stages of the study, the strongest association was found in rs1158970 at Ch4p15.2 encoding KCNIP4 with an OR of 0.89 (95% CI = 0.85, 0.94, P = 9.64 x 10(-6)). We performed a stratified analysis of rs12415204 and rs1158970 across all eight studies by age, gender, smoking status, and histology, and found consistent results across strata. Four of the 30 replicated variants act as expression quantitative trait loci (eQTL) sites in 1,111 nontumor lung tissues and meet the genome-wide 10% FDR threshold.
What problem does this paper attempt to address?