Abstract:The initial belief that haplotype block boundaries and haplotypes were largely shared across populations was a foundation for constructing a haplotype map of the human genome using common SNP markers. The HapMap data document the generality of a block-like pattern of linkage disequilibrium (LD) with regions of low and high haplotype diversity but differences among the populations. Studies of many additional populations demonstrate that LD patterns can be highly variable among populations both across and within geographic regions. Because of this variation, emphasis has shifted to the generalizability of tagSNPs, those SNPs that capture the bulk of variation in a region. We have examined the LD and tagSNP patterns based upon over 2000 individual samples in 38 populations and 134 SNPs in 10 genetically independent loci for a total of 517 kb with an average density of 1 SNP/5 kb. Four different 'block' definitions and the pairwise LD tagSNP selection algorithm have been applied. Our results not only confirm large variation in block partition among populations from different regions (agreeing with previous studies including the HapMap) but also show that significant variation can occur among populations within geographic regions. None of the block-defining algorithms produces a consistent pattern within or across all geographic groups. In contrast, tagSNP transferability is much greater than the similarity of LD patterns and, although not perfect, some generalizations of transferability are possible. The analyses show an asymmetric pattern of tagSNP transferability coinciding with the subsetting of variation attributed to the spread of modern humans around the world.

Haplotype Block Partition with Limited Resources and Applications to Human Chromosome 21 Haplotype Data.

Dynamic programming algorithms for haplotype block partitioning: applications to human chromosome 21 haplotype data

A Dynamic Programming Algorithm for Haplotype Block Partitioning and Its Application in Association Studies.

Dynamic Programming Algorithms for Haplotype Block Partitioning and Tag SNP Selection Using Haplotype Data or Genotype Data

The Effect of Haplotype-Block Definitions on Inference of Haplotype-Block Structure and Htsnps Selection

[Analysis and Application of SNP and Haplotype in the Human Genome].

HapBlock: Haplotype Block Partitioning and Tag SNP Selection Software Using a Set of Dynamic Programming Algorithms.

Haplotype Block Partitioning and Tag SNP Selection Using Genotype Data and Their Applications to Association Studies

Inference of missing SNPs and information quantity measurements for haplotype blocks.

Genome-wide Compatible SNP Intervals and Their Properties

HaploBlockFinder: Haplotype Block Analyses

An Overview of the Haplotype Problems and Algorithms.

Htsnper1.0: Software for Haplotype Block Partition and Htsnps Selection.

Linkage Disequilibrium Sharing and Haplotype-Tagged SNP Portability Between Populations.

Significant Variation in Haplotype Block Structure but Conservation in Tagsnp Patterns among Global Populations

Long-range Polony Haplotyping of Individual Human Chromosome Molecules

Selecting Additional Tag SNPs for Tolerating Missing Data in Genotyping.

A general approach to single-nucleotide polymorphism discovery

Linear Algebraic Tag SNP Selection and Haplotype Reconstruction

The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium.

Large-scale Genotyping of Complex DNA