TorRNA - Improved Prediction of Backbone Torsion Angles of RNA by Leveraging Large Language Models

Sriram Devata,Deva Priyakumar U.
DOI: https://doi.org/10.26434/chemrxiv-2024-cj4r0
2024-05-31
Abstract:RNA molecules play a significant role in many biological pathways and have diverse functional roles, which is a result of their structural flexibility to fold into diverse conformations. This structural flexibility makes it challenging to obtain the structures of RNAs experimentally. Deep learning can be used to predict the secondary structures of RNA and other properties such as the backbone torsion angles, to be used as restraints for the computational optimization of the tertiary structures of RNA. TorRNA is a transformer encoder-decoder model, that takes an input RNA sequence and predicts the (pseudo)torsion angles of each nucleotide with a pre-trained RNA-FM model as the encoder. TorRNA is able to achieve a performance boost of 2% − 16% over the previous (pseudo)torsion angle prediction method for RNAs. We also demonstrate that TorRNA can used as a tool for model quality assessment of candidate RNA structures.
Chemistry
What problem does this paper attempt to address?
The paper focuses on the prediction of torsion angles, one of the secondary structure properties of RNA molecules, which is a key factor in understanding RNA's three-dimensional structure and function. Due to the difficulty in experimentally determining RNA structures, the researchers propose a deep learning model called TorRNA, which uses a pre-trained RNA base model (RNA-FM) as the encoder and a Transformer decoder architecture to predict the torsion angle of each nucleotide. TorRNA improves the accuracy of predicting RNA torsion angles by 2%-16% compared to existing methods such as SPOT-RNA-1D and can be used to evaluate the quality of candidate RNA structure models. In the paper, the authors construct a new dataset that includes RNA structures from the PDB database in recent years to train and test the TorRNA model. The results show that TorRNA outperforms or performs comparably to SPOT-RNA-1D in predicting torsion angles in various structural regions and exhibits stability for RNA sequences of different lengths. Furthermore, by comparing prediction errors with structural accuracy metrics, the paper demonstrates that the prediction error of TorRNA can serve as an effective proxy for evaluating the quality of RNA structure models.