RNAvigate: efficient exploration of RNA chemical probing datasets

Patrick S Irving,Kevin M Weeks
DOI: https://doi.org/10.1093/nar/gkae089
IF: 14.9
2024-02-13
Nucleic Acids Research
Abstract:Abstract Chemical probing technologies enable high-throughput examination of diverse structural features of RNA, including local nucleotide flexibility, RNA secondary structure, protein and ligand binding, through-space interaction networks, and multistate structural ensembles. Deep understanding of RNA structure–function relationships typically requires evaluating a system under structure- and function-altering conditions, linking these data with additional information, and visualizing multilayered relationships. Current platforms lack the broad accessibility, flexibility and efficiency needed to iterate on integrative analyses of these diverse, complex data. Here, we share the RNA visualization and graphical analysis toolset RNAvigate, a straightforward and flexible Python library that automatically parses 21 standard file formats (primary sequence annotations, per- and internucleotide data, and secondary and tertiary structures) and outputs 18 plot types. RNAvigate enables efficient exploration of nuanced relationships between multiple layers of RNA structure information and across multiple experimental conditions. Compatibility with Jupyter notebooks enables nonburdensome, reproducible, transparent and organized sharing of multistep analyses and data visualization strategies. RNAvigate simplifies and accelerates discovery and characterization of RNA-centric functions in biology.
biochemistry & molecular biology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to efficiently explore and analyze RNA chemical probe data sets to promote in - depth understanding of the relationship between RNA structure and function. Specifically, the author has developed a tool named RNAvigate, aiming to address the following two main challenges: 1. **Data pre - processing and filtering**: In order to identify and highlight the key features of the data set, pre - processing steps such as background correction and normalization are required. These steps are crucial for cleaning the data, removing noise, and improving the accuracy of data analysis. 2. **Comprehensive visualization of multi - layer data**: In order to fully understand an RNA system, data at multiple levels (such as primary structure, secondary structure, tertiary structure, etc.) need to be combined for visualization, and these data need to be compared under different experimental conditions. Existing tools are often only able to handle a single type of data, lacking flexibility and ease of use. RNAvigate solves these problems by providing a flexible and easy - to - use Python library that can automatically parse 21 standard file formats and generate 18 different types of charts. This enables researchers to more efficiently explore the subtle relationships between RNA structural information and make comparisons under different experimental conditions. In addition, RNAvigate is compatible with Jupyter notebooks, supporting transparent, reproducible, and organized multi - step analysis and data visualization strategies, thereby simplifying the process of RNA - related function discovery and characterization.