Two-stage genome-wide association studies with dna pooling and genetic model selection

Min Yuan,YaNing Yang,Gang Zheng
IF: 1.4
2009-01-01
Statistica Sinica
Abstract:The two-stage design is a common cost-effective approach for genome-wide association studies. The first stage serves as a screening to identify a subset of single-nucleotide polymorphisms (SNPs) from 100,000 to 500,000 SNPs using a fraction of case-control samples. In the second stage, only the selected SNPs are genotyped using the remaining case-control samples. On the other hand, DNA pooling is another common strategy to save genotyping cost. In this article, we propose a method using DNA pooling in the first stage and genotype-based analysis in the second stage. A joint analysis to combine both stages is applied to a two-stage design with DNA pooling when the underlying genetic model is known. When the genetic model is unknown, we use a robust procedure in the joint analysis by applying genetic model selection in the second stage based on the difference of Hardy-Weinberg disequilibrium coefficients between cases and controls. Performance of our method and comparison with other approaches are investigated by simulation studies.
What problem does this paper attempt to address?