Abstract:Nucleophilic addition of bisulfite to pyrimidine bases has been known for a half century, and the reaction has been in use for at least a quarter of a century for identifying 5-methylcytidine in DNA. This account focuses on the chemistry of bisulfite with pseudouridine, an isomer of the RNA nucleoside uridine in which the uracil base is connected to C1' of ribose via C5 instead of N1. Pseudouridine, Ψ, is the most common nucleotide modification found in cellular RNA overall, in part due to its abundance in rRNAs and tRNAs. It has a stabilizing influence on RNA structure because N1 is now available for additional hydrogen bonding and because the heterocycle is slightly better at π stacking. The isomerization of U to Ψ in RNA strands is catalyzed by 13 different enzymes in humans and 11 in E. coli; some of these enzymes are implicated in disease states which is testament to the biological importance of pseudouridine in cells. Recently, pseudouridine came into the limelight as the key modification that, after N1 methylation, enables mRNA vaccines to be delivered efficiently into human tissue with minimal generation of a deleterious immunogenic response. Here we describe the bisulfite reaction with pseudouridine which gives rise to a chemical sequencing method to map the modified base in the epitranscriptome. Unlike the reaction with cytidine, the addition of bisulfite to Ψ leads irreversibly to form an adduct that is bypassed during cDNA synthesis by reverse transcriptases yielding a characteristic deletion signature. Although there were hints to the structure of the bisulfite adduct(s) 30 to 50 years ago, it took modern spectroscopic and computational methods to solve the mystery. Raman spectroscopy along with extensive NMR, ECD, and computational work led to the assignment of the major product as the (R) diastereomer of an oxygen adduct at C1' of a ring-opened pseudouridine. Mechanistically, this arose from a succession of conjugate addition, E2 elimination, and a [2,3] sigmatropic rearrangement, all of which are stereodefined reactions. A minor reaction with excess bisulfite led to the (S) isomer of a S-adducted SO3- group. Understanding structure and mechanism aided the design of a Ψ-specific sequencing reaction and guided attempts to improve the utility and specificity of the method. Separately, we have been investigating the use of nanopore direct RNA sequencing, a single-molecule method that directly analyzes RNA strands isolated from cells after end-ligation of adaptor sequences. By combining the electrical current and base-calling data from the nanopore with dwell-time analysis from the helicase employed to deliver RNA to the nanopore, we were able to map Ψ sites in nearly all sequence contexts. This analysis was employed to find Ψ residues in the SARS-CoV-2 vRNA, to analyze the sequence context effects of mRNA vaccine synthesis via in vitro transcription, and to evaluate the impact of stress on chemical modifications in the E. coli ribosome. Most recently, we found that bisulfite treatment of RNA leading to Ψ adducts could modulate the nanopore signal to help in mapping modifications of low occupancy.

Quantitative sequencing using BID-seq uncovers abundant pseudouridines in mammalian mRNA at base resolution

BID-seq for transcriptome-wide quantitative sequencing of mRNA pseudouridine at base resolution

Absolute quantitative and base-resolution sequencing reveals comprehensive landscape of pseudouridine across the human transcriptome

Quantitative Profiling of Pseudouridylation Landscape in the Human Transcriptome

A bisulfite-assisted and ligation-based qPCR amplification technology for locus-specific pseudouridine detection at base resolution

Quantitative mapping of pseudouridines in bacteria RNA

Integrative analysis of nanopore direct RNA sequencing data reveals a role of PUS7-dependent pseudouridylation in regulation of m6A and m5C modifications

Interferon inducible pseudouridine modification in human mRNA by quantitative nanopore profiling

Quantitative profiling of pseudouridylation dynamics in native RNAs with nanopore sequencing

Bisulfite and Nanopore Sequencing for Pseudouridine in RNA

Quantitative detection of pseudouridine in RNA by mass spectrometry

Decoding Pseudouridine: an Emerging Target for Therapeutic Development.

Chemical pulldown reveals dynamic pseudouridylation of the mammalian transcriptome

mRNA psi profiling using nanopore DRS reveals cell-type-specific pseudouridylation

Promoted Read-through and Mutation Against Pseudouridine-CMC by an Evolved Reverse Transcriptase

Integrative analysis of nanopore direct RNA sequencing data reveals a role of PUS7-dependent pseudouridylation in regulation of m 6 A and m 5 C modifications

Nanopore signal deviations from pseudouridine modifications in RNA are sequence-specific: quantification requires dedicated synthetic controls

Transcriptome-wide analysis of pseudouridylation of mRNA and non-coding RNAs in Arabidopsis

Transcriptome-wide analysis of pseudouridylation in Drosophila melanogaster

NanoMUD: Profiling of pseudouridine and N1-methylpseudouridine using Oxford Nanopore direct RNA sequencing