rnaCrosslinkOO: an object-oriented R package for the analysis of RNA structural data generated by RNA crosslinking experiments

Jonathan L Price,Omer Ziv,Malte L Pinckert,Andrew Lim,Eric A Miska
DOI: https://doi.org/10.1093/bioinformatics/btae193
IF: 5.8
2024-03-29
Bioinformatics
Abstract:Abstract Summary RNA (ribonucleic acid) molecules have secondary and tertiary structures in vivo which play a crucial role in cellular processes such as the regulation of gene expression, RNA processing and localization. The ability to investigate these structures will enhance our understanding of their function and contribute to the diagnosis and treatment of diseases caused by RNA dysregulation. However, there are no mature pipelines or packages for processing and analyzing complex in vivo RNA structural data. Here, we present rnaCrosslinkOO (RNA Crosslink Object-Oriented), a novel software package for the comprehensive analysis of data derived from the COMRADES (Crosslinking of Matched RNA and Deep Sequencing) method. rnaCrosslinkOO offers a comprehensive pipeline from raw sequencing reads to the identification and comparison of RNA structural features. It includes read processing and alignment, clustering of duplexes, data exploration, folding and comparisons of RNA structures. rnaCrosslinkOO also enables comparisons between conditions, the identification of inter-RNA interactions, and the incorporation of reactivity data to improve structure prediction. Availability and implementation rnaCrosslinkOO is freely available to noncommercial users and implemented in R, with the source code and documentation accessible at https://CRAN.R-project.org/package=rnaCrosslinkOO. The software is supported on Linux, macOS, and Windows platforms.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the current lack of mature data processing and analysis pipelines or software packages for studying complex in - vivo RNA structure data. Specifically, the paper introduces **rnaCrosslinkOO**, a new R - language software package aimed at comprehensively analyzing data generated through RNA cross - linking experiments. These problems include: 1. **Complexity of data processing and analysis**: The data generated by RNA cross - linking experiments is complex and requires a comprehensive analysis process from raw sequencing reads to the identification and comparison of RNA structural features. 2. **Inadequacies of existing tools**: Existing analysis tools and methods have shortcomings in handling RNA cross - linking data, especially in terms of data visualization and ease of use. 3. **Multi - condition comparison and interaction identification**: There is a need to be able to make comparisons under different conditions, identify interactions between RNAs, and combine reactivity data to improve the accuracy of structure prediction. The main functions of **rnaCrosslinkOO** include: - **Read processing and alignment**: Starting from raw sequencing reads, perform processing and alignment. - **Double - strand clustering**: Cluster double - strands to reduce computing time and improve resolution. - **Data exploration**: Provide methods for exploring global interactions, high - abundance RNAs, and RNA - RNA interactions. - **RNA folding and structure comparison**: Support the folding and comparison of RNA structures, including comparisons under different conditions. - **Chemical probe data integration**: Although there is currently no method for simultaneously generating RNA cross - linking and chemical probe data in the same sample, an option for integrating chemical probe data is provided. Through these functions, **rnaCrosslinkOO** aims to provide a comprehensive and easy - to - use tool to help researchers better understand the structure and function of in - vivo RNA, thereby promoting the diagnosis and treatment of diseases.