[Advances in mapping analysis of ribonucleic acid modifications through sequencing]
Jun Xiong,Tian Feng,Bi-Feng Yuan
DOI: https://doi.org/10.3724/SP.J.1123.2023.12025
Abstract:Over 170 chemical modifications have been discovered in various types of ribonucleic acids (RNAs), including messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), and small nuclear RNA (snRNA). These RNA modifications play crucial roles in a wide range of biological processes such as gene expression regulation, RNA stability maintenance, and protein translation. RNA modifications represent a new dimension of gene expression regulation known as the "epitranscriptome". The discovery of RNA modifications and the relevant writers, erasers, and readers provides an important basis for studies on the dynamic regulation and physiological functions of RNA modifications. Owing to the development of detection technologies for RNA modifications, studies on RNA epitranscriptomes have progressed to the single-base resolution, multilayer, and full-coverage stage. Transcriptome-wide methods help discover new RNA modification sites and are of great importance for elucidating the molecular regulatory mechanisms of epitranscriptomics, exploring the disease associations of RNA modifications, and understanding their clinical applications. The existing RNA modification sequencing technologies can be categorized according to the pretreatment approach and sequencing principle as direct high-throughput sequencing, antibody-enrichment sequencing, enzyme-assisted sequencing, chemical labeling-assisted sequencing, metabolic labeling sequencing, and nanopore sequencing technologies. These methods, as well as studies on the functions of RNA modifications, have greatly expanded our understanding of epitranscriptomics. In this review, we summarize the recent progress in RNA modification detection technologies, focusing on the basic principles, advantages, and limitations of different methods. Direct high-throughput sequencing methods do not require complex RNA pretreatment and allow for the mapping of RNA modifications using conventional RNA sequencing methods. However, only a few RNA modifications can be analyzed by high-throughput sequencing. Antibody enrichment followed by high-throughput sequencing has emerged as a crucial approach for mapping RNA modifications, significantly advancing the understanding of RNA modifications and their regulatory functions in different species. However, the resolution of antibody-enrichment sequencing is limited to approximately 100-200 bp. Although chemical crosslinking techniques can achieve single-base resolution, these methods are often complex, and the specificity of the antibodies used in these methods has raised concerns. In particular, the issue of off-target binding by the antibodies requires urgent attention. Enzyme-assisted sequencing has improved the accuracy of the localization analysis of RNA modifications and enables stoichiometric detection with single-base resolution. However, the enzymes used in this technique show poor reactivity, specificity, and sequence preference. Chemical labeling sequencing has become a widely used approach for profiling RNA modifications, particularly by altering reverse transcription (RT) signatures such as RT stops, misincorporations, and deletions. Chemical-assisted sequencing provides a sequence-independent RNA modification detection strategy that enables the localization of multiple RNA modifications. Additionally, when combined with the biotin-streptavidin affinity method, low-abundance RNA modifications can be enriched and detected. Nevertheless, the specificity of many chemical reactions remains problematic, and the development of specific reaction probes for particular modifications should continue in the future to achieve the precise localization of RNA modifications. As an indirect localization method, metabolic labeling sequencing specifically localizes the sites at which modifying enzymes act, which is of great significance in the study of RNA modification functions. However, this method is limited by the intracellular labeling of RNA and cannot be applied to biological samples such as clinical tissues and blood samples. Nanopore sequencing is a direct RNA-sequencing method that does not require RT or the polymerase chain reaction (PCR). However, challenges in analyzing the data obtained from nanopore sequencing, such as the high rate of false positives, must be resolved. Discussing sequencing analysis methods for various types of RNA modifications is instructive for the future development of novel RNA modification mapping technologies, and will aid studies on the functions of RNA modifications across the entire transcriptome.