Efficient whole genome haplotyping and high-throughput single molecule phasing with barcode-linked reads

David Redin,Tobias Frick,Hooman Aghelpasand,Jennifer Theland,Max Käller,Erik Borgström,Remi-Andre Olsen,Afshin Ahmadian
DOI: https://doi.org/10.1101/356121
2018-06-26
Abstract:ABSTRACT The future of human genomics is one that seeks to resolve the entirety of genetic variation through sequencing. The prospect of utilizing genomics for medical purposes require cost-efficient and accurate base calling, long-range haplotyping capability, and reliable calling of structural variants. Short read sequencing has lead the development towards such a future but has struggled to meet the latter two of these needs 1 . To address this limitation, we developed a technology that preserves the molecular origin of short sequencing reads, with an insignificant increase to sequencing costs. We demonstrate a novel library preparation method which enables whole genome haplotyping, long-range phasing of single DNA molecules, and de novo genome assembly through barcode-linked reads (BLR). Millions of random barcodes are used to reconstruct megabase-scale phase blocks and call structural variants. We also highlight the versatility of our technology by generating libraries from different organisms using only picograms to nanograms of input material.
What problem does this paper attempt to address?