Abstract:The functional consequences of structural variants (SVs) in mammalian genomes are challenging to study. This is due to several factors, including: 1) their numerical paucity relative to other forms of standing genetic variation such as single nucleotide variants (SNVs) and short insertions or deletions (indels); 2) the fact that a single SV can involve and potentially impact the function of more than one gene and/or regulatory element; and 3) the relative immaturity of methods to generate and map SVs, either randomly or in targeted fashion, in or model systems. Towards addressing these challenges, we developed , a straightforward method that enables the multiplex generation and mapping of several major forms of SVs (deletions, inversions, translocations) throughout a mammalian genome. is based on the integration of “shuffle cassettes’’ to the genome, wherein each shuffle cassette contains components that facilitate its site-specific recombination (SSR) with other integrated shuffle cassettes (via Cre-loxP), its mapping to a specific genomic location (via T7-mediated transcription or IVT), and its identification in single-cell RNA-seq (scRNA-seq) data (via T7-mediated transcription or IST). In this proof-of-concept, we apply to induce and map thousands of genomic SVs in mouse embryonic stem cells (mESCs) in a single experiment. Induced SVs are rapidly depleted from the cellular population over time, possibly due to Cre-mediated toxicity and/or negative selection on the rearrangements themselves. Leveraging T7 IST of barcodes whose positions are already mapped, we further demonstrate that we can efficiently genotype which SVs are present in association with each of many single cell transcriptomes in scRNA-seq data. Finally, preliminary evidence suggests our method may be a powerful means of generating extrachromosomal circular DNAs (ecDNAs). Looking forward, we anticipate that may be broadly useful for the systematic exploration of the functional consequences of SVs on gene expression, the chromatin landscape, and 3D nuclear architecture. We further anticipate potential uses for modeling of ecDNAs, as well as in paving the path to a minimal mammalian genome.

Simultaneous de novo calling and phasing of genetic variants at chromosome-scale using NanoStrand-seq

Pseudo-Sanger Sequencing: Massively Parallel Production of Long and Near Error-Free Reads Using NGS Technology

Gapless assembly of complete human and plant chromosomes using only nanopore sequencing

NanoSNP: a progressive and haplotype-aware SNP caller on low-coverage nanopore sequencing data

Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation

Development of Multiomics in Situ Pairwise Sequencing (Mip-Seq) for Single-cell Resolution Multidimensional Spatial Omics

Efficient whole genome haplotyping and high-throughput single molecule phasing with barcode-linked reads

NanoVar: accurate characterization of patients' genomic structural variants using low-depth nanopore sequencing

Phased nanopore assembly with Shasta and modular graph phasing with GFAse

Large-scale Genotyping of Complex DNA

Long range haplotyping of paired-homologous chromosomes by single-chromosome sequencing of a single cell

Nanopore sequencing and assembly of a human genome with ultra-long reads

scNanoSeq-CUT&Tag: a single-cell long-read CUT&Tag sequencing method for efficient chromatin modification profiling within individual cells

A molecular orbital study of 2,4,5-trihydroxyphenethylamine and related polyhydroxyphenethylamines.

Fully phased human genome assembly without parental data using single-cell strand sequencing and long reads

A New Approach to Decode DNA Methylome and Genomic Variants Simultaneously from Double Strand Bisulfite Sequencing.

Targeted phasing of 2–200 kilobase DNA fragments with a short-read sequencer and a single-tube linked-read library method

De Novoassembly of Human Genome at Single-Cell Levels

Chromosome-scale, haplotype-resolved assembly of human genomes

Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads

Multiplex generation and single cell analysis of structural variants in a mammalian genome