Deep Learning Methods for RNA Torsion Angle Prediction

Xiujuan Ou,Yi Xiao,
DOI: https://doi.org/10.7498/aps.72.20231069
IF: 0.906
2023-01-01
Acta Physica Sinica
Abstract:Modeling RNA tertiary structures is one of the basic problems in molecular biophysics, which is crucial to understand RNA biological functions and design new structures. RNA tertiary structures are mainly determined by seven torsions of main-chain and side-chain backbone, the accurate prediction of these torsion angles is the basis of modeling RNA tertiary structures. At present, there are only a few methods using deep learning to predict RNA torsion angles, and the prediction accuracy needs to be further improved if it is used to model RNA tertiary structures. In this study, we also developed a deep learning method 1DRNA to predict RNA backbone torsions and pseudotorsion angles, including two different deep learning models, the convolution model (DRCNN) that considers the features of adjacent nucleotides and the Hyper-long-short-term memory model (DHLSTM) that considers the features of chain all the nucleotides. We then empirically show that DRCNN and DHLSTM outperforms existing state-of-the-art methods on the same datasets, the prediction accuracy of DRCNN model is improved by 5% to 28% for βδζχηθ angles, and the prediction accuracy of DHLSTM model is improved by 6% to 15% for βδζχηθ angles. The DRCNN model predicted better results than the DHLSTM model and the existing models in the δζχηθ angles, and the DHLSTM model predicted better results than the DRCNN model and the existing model in the βε angles, and the existing models predicted better results than the DRCNN model and DHLSTM model in the αγ angles. DRCNN and the existing models predicted a richer distribution of angles than the DHLSTM model. In terms of model stability, the DHLSTM model is much more stable than the DRCNN model and the existing models, with fewer outliers. The results also show that the αγ angles are the most difficult to predict, the angles of the ring region is more difficult to predict than the angles of the helix region, the model is also not sensitive to the change of the target sequence length, and the deviation of the model prediction angles from the decoys can also be used to evaluate the RNA tertiary structures quality.
physics, multidisciplinary
What problem does this paper attempt to address?