Comparative analysis of CRISPR/Cas9-targeted nanopore sequencing approaches in repeat expansion disorders

Louise Benarroch,Pierre-Yves Boelle,Helene Madry,Badreddine Mohand Oumoussa,Nobuyuki Eura,Ichizo Nishino,Karim Labreche,Guillaume Bassez,Tanya Stojkovic,Genevieve Gourdon,Gisele Bonne,Stephanie Tome
DOI: https://doi.org/10.1101/2024.12.04.626786
2024-12-07
Abstract:More than 50 repeat expansion disorders have been identified, with long-read sequencing marking a new milestone in the diagnosis of these disorders. Despite these major achievements, the comprehensive characterization of short tandem repeats in a pathological context remains challenging, primarily due to their inherent characteristics such as motif complexity, high GC content, and variable length. In this study, our aim was to thoroughly characterize repeat expansions in two neuromuscular diseases: myotonic dystrophy type 1 (DM1) and oculopharyngodistal myopathy (OPDM) using CRISPR/Cas9-targeted long-read sequencing (Oxford Nanopore Technologies, ONT). We conducted precise analyses of the DM1 and OPDM loci, determining repeat size, repeat length distribution, expansion architecture and DNA methylation, using three different basecallers (Guppy, Bonito and Dorado). We demonstrated the importance of the basecalling strategy in repeat expansion characterization. We proposed guidelines to perform CRISPR-Cas9 targeted long-read sequencing (no longer supported by ONT), from library preparation to bioinformatical analyses. Finally, we showed, for the first time, somatic mosaicism, hypermethylation of LRP12 loci in symptomatic patients and changes in the repeat tract structure of OPDM patients. We propose a strategy based on CRISPR/Cas9-enrichment long-read sequencing for repeat expansion diseases, which could be readily applicable in research but also in diagnostic settings.
Bioinformatics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to accurately characterize the features of repeat expansions in repeat expansion disorders (such as myotonic dystrophy type 1 (DM1) and oculopharyngeal distal myopathy (OPDM)) through CRISPR/Cas9 - targeted long - read sequencing technology. Specifically, the research aims to: 1. **Accurately analyze the features of repeat expansions**: Determine the repeat size, repeat length distribution, expansion structure, and DNA methylation status. 2. **Evaluate the influence of different base callers**: Compare the performance of three base callers, Guppy, Bonito, and Dorado, in repeat expansion characterization to determine the best calling strategy. 3. **Develop guidelines for CRISPR/Cas9 - targeted long - read sequencing**: Provide detailed experimental steps and suggestions from library preparation to bioinformatics analysis. 4. **Reveal somatic mosaicism for the first time**: Discover somatic mosaicism in OPDM patients and describe the hypermethylation state at the LRP12 locus. 5. **Explore changes in repeat sequence structures**: For the first time, identify changes in repeat sequence structures in three genes (LRP12, GIPC1, and NOTCH2NLC) in OPDM patients. Through these objectives, the research aims to provide a reliable and efficient method for the diagnosis and research of repeat expansion disorders.