Abstract:Background The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc. Results We implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8 , without invoking false positive risk. Conclusions We developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS.

Accounting for multiple comparisons in a genome-wide association study (GWAS)

A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms

EBT: a Statistic Test Identifying Moderate Size of Significant Features with Balanced Power and Precision for Genome-Wide Rate Comparisons

Using Alternative Definitions of Controls to Increase Statistical Power in GWAS

A Robust and Efficient Statistical Method for Genetic Association Studies Using Case and Control Samples from Multiple Cohorts

Comparison of dimension reduction based logistic regression models for case control genome wide association study： principal components analysis vs. partial least squares

On Combining Data From Genome-Wide Association Studies to Discover Disease-Associated SNPs

A nonparametric test for association with multiple loci in the retrospective case-control study.

Impact of genotyping errors on statistical power of association tests in genomic analyses: A case study.

In search of causal variants: refining disease association signals using cross-population contrasts

An integrated approach to reduce the impact of minor allele frequency and linkage disequilibrium on variable importance measures for genome-wide data

Adjusting for principal components can induce spurious associations in genome-wide association studies in admixed populations

Revisiting the genome-wide significance threshold for common variant GWAS

Efficient and powerful familywise error control in genome-wide association studies using generalized linear models

Evaluating variations of genotype calling: a potential source of spurious associations in genome-wide association studies

Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data

Controlling the Rate of GWAS False Discoveries

Alternative Methods for H1 Simulations in Genome Wide Association Studies

Gene‐based association tests in family samples using GWAS summary statistics

Detect and Adjust for Population Stratification in Population-Based Association Study Using Genomic Control Markers: an Application of Affymetrix Genechip® Human Mapping 10K Array

Robust Reference Powered Association Test of Genome-Wide Association Studies