Abstract:With millions of single-nucleotide polymorphisms (SNPs) identified and characterized, genomewide association studies have begun to identify susceptibility genes for complex traits and diseases. These studies involve the characterization and analysis of very-high-resolution SNP genotype data for hundreds or thousands of individuals. We describe a computationally efficient approach to testing association between SNPs and quantitative phenotypes, which can be applied to whole-genome association scans. In addition to observed genotypes, our approach allows estimation of missing genotypes, resulting in substantial increases in power when genotyping resources are limited. We estimate missing genotypes probabilistically using the Lander-Green or Elston-Stewart algorithms and combine high-resolution SNP genotypes for a subset of individuals in each pedigree with sparser marker data for the remaining individuals. We show that power is increased whenever phenotype information for ungenotyped individuals is included in analyses and that high-density genotyping of just three carefully selected individuals in a nuclear family can recover >90% of the information available if every individual were genotyped, for a fraction of the cost and experimental effort. To aid in study design, we evaluate the power of strategies that genotype different subsets of individuals in each pedigree and make recommendations about which individuals should be genotyped at a high density. To illustrate our method, we performed genomewide association analysis for 27 gene-expression phenotypes in 3-generation families (Centre d'Etude du Polymorphisme Humain pedigrees), in which genotypes for similar to 860,000 SNPs in 90 grandparents and parents are complemented by genotypes for similar to 6,700 SNPs in a total of 168 individuals. In addition to increasing the evidence of association at 15 previously identified cis-acting associated alleles, our genotype-inference algorithm allowed us to identify associated alleles at 4 cis-acting loci that were missed when analysis was restricted to individuals with the high-density SNP data. Our genotype-inference algorithm and the proposed association tests are implemented in software that is available for free.

Simultaneous Analysis of Common and Rare Variants in Complex Traits: Application to SNPs (Scarvasnp)

A Robust Model-free Approach for Rare Variants Association Studies Incorporating Gene-Gene and Gene-Environmental Interactions

A LASSO-based Approach to Analyzing Rare Variants in Genetic Association Studies

SCAMPI: A scalable statistical framework for genome-wide interaction testing harnessing cross-trait correlations

Methods for the Analysis and Interpretation for Rare Variants Associated with Complex Traits.

A Novel SNP-set Analytical Method Without Distinguishing Common Variants or Rare Variants in Genome-Wide Association Study

Accelerating Sparse Canonical Correlation Analysis for Large Brain Imaging Genetics Data

Mapping structural variants to rare disease genes using long-read whole genome sequencing and trait-relevant polygenic scores

Novel Association Strategy with Copy Number Variation for Identifying New Risk Loci of Human Diseases

Approach of Fusing Multiple Tests to Analyzing Rare Genetic Variants

Identifying rare variants using a Bayesian regression approach

A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.

A Probabilistic Method for Identifying Rare Variants Underlying Complex Traits

A Robust and Powerful Set-Valued Approach to Rare Variant Association Analyses of Secondary Traits in Case-Control Sequencing Studies.

Methods for Association Analysis and Meta‐Analysis of Rare Variants in Families

Extending Rare-Variant Testing Strategies: Analysis of Noncoding Sequence and Imputed Genotypes

Comparing Partial Least Square Approaches in A Gene- or Region-Based Association Study for Multiple Quantitative Phenotypes

Family-Based Association Tests for Genomewide Association Scans

Subset scanning for multi-trait analysis using GWAS summary statistics

A statistical framework for powerful multi-trait rare variant analysis in large-scale whole-genome sequencing studies

RVTESTS: an Efficient and Comprehensive Tool for Rare Variant Association Analysis Using Sequence Data.