Abstract:Abstract The use of genetic polymorphism data to understand the dynamics of adaptation and identify the loci that are involved has become a major pursuit of modern evolutionary genetics. In addition to the classical “hard sweep” hitchhiking model, recent research has drawn attention to the fact that the dynamics of adaptation can play out in a variety of different ways and that the specific signatures left behind in population genetic data may depend somewhat strongly on these dynamics. One particular model for which a large number of empirical examples are already known is that in which a single derived mutation arises and drifts to some low frequency before an environmental change causes the allele to become beneficial and sweeps to fixation. Here, we pursue an analytical investigation of this model, bolstered and extended via simulation study. We use coalescent theory to develop an analytical approximation for the effect of a sweep from standing variation on the genealogy at the locus of the selected allele and sites tightly linked to it. We show that the distribution of haplotypes that the selected allele is present on at the time of the environmental change can be approximated by considering recombinant haplotypes as alleles in the infinite-alleles model. We show that this approximation can be leveraged to make accurate predictions regarding patterns of genetic polymorphism following such a sweep. We then use simulations to highlight which sources of haplotypic information are likely to be most useful in distinguishing this model from neutrality, as well as from other sweep models, such as the classic hard sweep and multiple-mutation soft sweeps. We find that in general, adaptation from a unique standing variant will likely be difficult to detect on the basis of genetic polymorphism data from a single population time point alone, and when it can be detected, it will be difficult to distinguish from other varieties of selective sweeps. Samples from multiple populations and/or time points have the potential to ease this difficulty.

Robust identification of local adaptation from allele frequencies

Using Environmental Correlations to Identify Loci Underlying Local Adaptation

Demography-adjusted tests of neutrality based on genome-wide SNP data

The Population Genetic Signature of Polygenic Local Adaptation

Improving population-specific allele frequency estimates by adapting supplemental data: an empirical Bayes approach

Bayesian Inference of Natural Selection from Allele Frequency Time Series

A Probabilistic Method for Testing and Estimating Selection Differences Between Populations.

Testing for Associations between Loci and Environmental Gradients Using Latent Factor Mixed Models

DnaSAM: Software to perform neutrality testing for large datasets with complex null models

Needles in the Haystack: Identifying Individuals Present in Pooled Genomic Data

An integrated approach to reduce the impact of minor allele frequency and linkage disequilibrium on variable importance measures for genome-wide data

BlockFeST: Bayesian calculation of region-specific FST to detect local adaptation

Properties of neutrality tests based on allele frequency spectrum

The Precision and Power of Population Branch Statistics in Identifying the Genomic Signatures of Local Adaptation

A Genome-Scan Method to Identify Selected Loci Appropriate for Both Dominant and Codominant Markers: A Bayesian Perspective

A Coalescent Model for a Sweep of a Unique Standing Variant

Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data

A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data

Testing for Ancient Selection Using Cross-population Allele Frequency Differentiation

Efficient test for deviation from Hardy Weinberg Equilibrium with known or ambiguous typing in highly polymorphic loci

Localizing Post-Admixture Adaptive Variants with Object Detection on Ancestry-Painted Chromosomes