Abstract:Abstract Background The genealogical histories of individuals within populations are of interest to studies aiming both to uncover detailed pedigree information and overall quantitative population demographic histories. However, the analysis of quantitative details of individual genealogical histories has faced challenges from incomplete available pedigree records and an absence of objective and quantitative details in pedigree information. Although complete pedigree information for most individuals is difficult to track beyond a few generations, it is possible to describe a person’s genealogical history using their genetic relatives revealed by identity by descent (IBD) segments—long genomic segments shared by two individuals within a population, which are identical due to inheritance from common ancestors. When modern biobanks collect genotype information for a significant fraction of a population, dense genetic connections of a person can be traced using such IBD segments, offering opportunities to characterize individuals in the context of the underlying populations. Here, we conducted an individual-centric analysis of IBD segments among the UK Biobank participants that represent 0.7% of the UK population. Results We made a high-quality call set of IBD segments over 5 cM among all 500,000 UK Biobank participants. On average, one UK individual shares IBD segments with 14,000 UK Biobank participants, which we refer to as “relatives.” Using these segments, approximately 80% of a person’s genome can be imputed. We subsequently propose genealogical descriptors based on the genetic connections of relative cohorts of individuals sharing at least one IBD segment and show that such descriptors offer important information about one’s genetic makeup, personal genealogical history, and social behavior. Through analysis of relative counts sharing segments at different lengths, we identified a group, potentially British Jews, who has a distinct pattern of familial expansion history. Finally, using the enrichment of relatives in one’s neighborhood, we identified regional variations of personal preference favoring living closer to one’s extended families. Conclusions Our analysis revealed genetic makeup, personal genealogical history, and social behaviors at the population scale, opening possibilities for further studies of individual’s genetic connections in biobank data.

Personalized genealogical history of UK individuals inferred from biobank-scale IBD segments

Characterizing identity by descent segments in Chinese interpopulation unrelated individual pairs

Accurate detection of identity-by-descent segments in human ancient DNA

Simulating pedigrees ascertained on the basis of observed IBD sharing

Conflation of short identity-by-descent segments bias their inferred length distribution

Pan-UK Biobank GWAS improves discovery, analysis of genetic architecture, and resolution into ancestry-enriched effects

The length of haplotype blocks and signals of structural variation in reconstructed genealogies

Estimating effective population size trajectories from time-series Identity-by-Descent (IBD) segments

A Probabilistic Method for Estimating the Sharing of Identity by Descent for Populations with Migration

Reconstruct recent multi-population migration history by using identical-by-descent sharing

Open-source benchmarking of IBD segment detection methods for biobank-scale cohorts

DeepKin: precise estimation of in-depth relatedness and its application in UK Biobank

Sparse haplotype-based fine-scale local ancestry inference at scale reveals recent selection on immune responses

A fast linkage method for population GWAS cohorts with related individuals

Accelerating Heritability, Genetic Correlation, and Genome‐Wide Association Imaging Genetic Analyses in Complex Pedigrees

A scalable approach for genome-wide inference of ancestral recombination graphs

Proportion of genome shared identical by descent by relatives: concept, computation, and applications

Discovery of runs-of-homozygosity diplotype clusters and their associations with diseases in UK Biobank

Uncovering hidden gene-trait patterns through biclustering analysis of the UK Biobank

A rapid, accurate approach to inferring pedigrees in endogamous populations

Identifying novel genetic and phenotypic associations to genomic features by leveraging off-target reads in exome sequencing data