Graphasing: phasing diploid genome assembly graphs with single-cell strand sequencing

Mir Henglin,Maryam Ghareghani,William T. Harvey,David Porubsky,Sergey Koren,Evan E. Eichler,Peter Ebert,Tobias Marschall
DOI: https://doi.org/10.1186/s13059-024-03409-1
IF: 17.906
2024-10-12
Genome Biology
Abstract:Haplotype information is crucial for biomedical and population genetics research. However, current strategies to produce de novo haplotype-resolved assemblies often require either difficult-to-acquire parental data or an intermediate haplotype-collapsed assembly. Here, we present Graphasing, a workflow which synthesizes the global phase signal of Strand-seq with assembly graph topology to produce chromosome-scale de novo haplotypes for diploid genomes. Graphasing readily integrates with any assembly workflow that both outputs an assembly graph and has a haplotype assembly mode. Graphasing performs comparably to trio phasing in contiguity, phasing accuracy, and assembly quality, outperforms Hi-C in phasing accuracy, and generates human assemblies with over 18 chromosome-spanning haplotypes.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?