Fast simulation of identity-by-descent segments

Seth D Temple,Sharon R Browning,Elizabeth A Thompson
DOI: https://doi.org/10.1101/2024.12.13.628449
2024-12-16
Abstract:The worst-case runtime complexity to simulate identity-by-descent segments is quadratic in sample size. We propose two main techniques to reduce the compute time, which are motivated by coalescent and recombination processes. We observe average runtimes to simulate detectable IBD segments around a locus that scale approximately linearly in sample size and take a couple of seconds for sample sizes less than ten thousand. In contrast, we find that existing methods to simulate IBD segments take minutes to hours for sample sizes exceeding a few thousand. When using IBD segments to study recent positive selection around a locus, our efficient algorithm makes feasible statistical inferences that would be otherwise intractable.
Biology
What problem does this paper attempt to address?