New algorithm for predicting ribonucleic acid secondary structure including pseudoknots

Hengwu Li,Daming Zhu

2004-01-01

Abstract:A new model to predict RNA secondary structure including pseudoknots and its corresponding dynamic programming algorithm are presented. This algorithm can compute random planar pseudoknots and one non-planar pseudoknots, and requires O(n5) time complexity and O(n4) space complexity.

What problem does this paper attempt to address?

A New Pseudoknots Folding Algorithm for RNA Structure Prediction.

Hengwu Li,Daming Zhu

DOI: https://doi.org/10.1007/11533719_12

2005-01-01

Abstract:A new dynamic programming algorithm with On4 time and On3 space is presented to predict the RNA secondary structure including nested pseudoknots and a subclass of crossed pseudoknots. Compared with the Jens algorithm of On4 time and On2 space, this algorithm can predict more complex pseudoknots. Compared with the Rivas algorithm of On6 time and On4 space, this algorithm has the same power for the planar pseudoknots prediction.
A dynamic programming algorithm for RNA structure prediction including pseudoknots

Elena Rivas,Sean R. Eddy

DOI: https://doi.org/10.48550/arXiv.physics/9807048

1998-07-28

Abstract:We describe a dynamic programming algorithm for predicting optimal RNA secondary structure, including pseudoknots. The algorithm has a worst case complexity of ${\cal O}(N^6)$ in time and ${\cal O}(N^4)$ in storage. The description of the algorithm is complex, which led us to adopt a useful graphical representation (Feynman diagrams) borrowed from quantum field theory. We present an implementation of the algorithm that generates the optimal minimum energy structure for a single RNA sequence, using standard RNA folding thermodynamic parameters augmented by a few parameters describing the thermodynamic stability of pseudoknots. We demonstrate the properties of the algorithm by using it to predict structures for several small pseudoknotted and non-pseudoknotted RNAs. Although the time and memory demands of the algorithm are steep, we believe this is the first algorithm to be able to fold optimal (minimum energy) pseudoknotted RNAs with the accepted RNA thermodynamic model.

Biological Physics,Quantitative Biology
A Hopfield Neural Network Based Algorithm for RNA Secondary Structure Prediction

Qi Liu,Xiuzi Ye,Yin Zhang

DOI: https://doi.org/10.1109/IMSCCS.2006.9

2006-01-01

Abstract:In this paper a Hopfield neural network (HNN) based parallel algorithm is presented for predicting the secondary structure of ribonucleic acids (RNA). The HNN here is used to find the near-maximum independent set of an adjacent graph made of RNA base pairs and then compute the stable secondary structure of RNA. We modified the motion equation proposed in paper to reflect more biological essence of RNA secondary structure in which the ther mo dynamic parameters of base pair is used in our algorithm to control the variation rate of inhibitory and encouragement terms in the equation. Comparisons with the algorithm presented in paper and other two classical prediction methods (Zuker 's and Nussinov 's) show that our method is more sensitive and specific. In addition, our algorithm can be very efficient and be applied to sequences up to several thousands of base long with more degree of parallelism
New Heuristic Algorithm of RNA Secondary Structure Prediction with Pseudoknots

Zhendong Liu,Daming Zhu

DOI: https://doi.org/10.1109/cis.2011.32

2013-01-01

Journal of Computers

Abstract:Based on the relative stability of the n-stems in RNA molecules, Minimum free energy method is adopted widely to predict RNA secondary structure, a heuristic algorithm is presented to predict RNA pseudoknotted structure, the algorithm takes O(n3) time and O(n2) space. This algorithm not only reduces the time complexity to O(n3), but also widens the maximum length of the sequence. The preliminary experimental test on the RNA sub-sequences in PseudoBase confirm that the algorithm outperforms other known algorithms in predicting accuracy, sensitivity and specificity.
Design and Implementation of an Algorithm for the RNA Secondary Structure Prediction

LI Heng-wu,ZHU Da-ming,JI Xiu-hua

2006-01-01

Abstract:A computational model and dynamic programming algorithm is presented to predict the RNA secondary structure. A combinatorial strategy of subsequences and the intrinsic characteristics of RNA secondary structure are adopted to compute the structures of planar multi-pseudoknots and a non-planar pseudoknot. Compared with the Rivas algorithm, 2n4 space is subtracted and the time complexity is reduced from O(n6) to O(n5). The experiments demonstrate that the proposed algorithm is effective.
Prediction for RNA Planar Pseudoknots

Hengwu Li,Daming Zhu,Zhendong Liu,Hong Li

DOI: https://doi.org/10.1080/10002007088537465

2007-01-01

Abstract:Based on m-stems and semi-extensible structure, a model is presented to represent RNA planar pseudoknots, and corresponding dynamic programming algorithm is designed and implemented to predict arbitrary planar pseudoknots and simple non-planar pseudoknots with O (n(4)) time and O (n(3)) space. The algorithm folds total 245 sequences in the Pseudobase database, and the test results indicate that the algorithm has good accuracy, sensitivity and specificity.
An Approximation Scheme for RNA Folding Structure Prediction Including Pseudoknots

Zhendong Liu,Daming Zhu,Wei Cui,Nan Liu

DOI: https://doi.org/10.1109/CIS.2013.9

2013-01-01

Abstract:The paper further investigates the computational problem and complexity of predicting Ribonucleic Acid structure. In order to find a way to optimize the Ribonucleic Acid pseudoknotted structure, we investigate the Ribonucleic Acid pseudoknotted structure based on thermal dynamic model, computational methods, minimum free energy are adopted to predict Ribonucleic Acid structure. The contribution of this paper is to obtain an efficient Approximation algorithm for finding RNA pseudoknotted structure, compared with other algorithms, the algorithm takes O(n3) time and O(n2) space. The experimental test in PseudoBase shows that the algorithm is more effective and exact than other algorithms, and the algorithm can predict arbitrary pseudoknots. And we also give a proof of existing 1+e (e0) Polynomial Time Approximation Scheme(PTAS) in Searching Maximum Number of Stackings.
Predicting Model And Algorithm In Rna Folding Structure Including Pseudoknots

Zhendong Liu,Daming Zhu,Qionghai Dai

DOI: https://doi.org/10.1142/S0218001418510059

IF: 1.261

2018-01-01

International Journal of Pattern Recognition and Artificial Intelligence

Abstract:The prediction of RNA structure with pseudoknots is a nondeterministic polynomial-time hard (NP-hard) problem; according to minimum free energy models and computational methods, we investigate the RNA-pseudoknotted structure. Our paper presents an efficient algorithm for predicting RNA structure with pseudoknots, and the algorithm takes O(n(3)) time and O(n(2)) space, the experimental tests in Rfam10.1 and PseudoBase indicate that the algorithm is more effective and precise. The predicting accuracy, the time complexity and space complexity outperform existing algorithms, such as Maximum Weight Matching (MWM) algorithm, PKNOTS algorithm and Inner Limiting Layer (ILM) algorithm, and the algorithm can predict arbitrary pseudoknots. And there exists a 1 + epsilon (epsilon > 0) polynomial time approximation scheme in searching maximum number of stackings, and we give the proof of the approximation scheme in RNA-pseudoknotted structure. We have improved several types of pseudoknots considered in RNA folding structure, and analyze their possible transitions between types of pseudoknots.
Improved Predicting Algorithm of RNA Pseudoknotted Structure

Zhendong Liu,Daming Zhu,Qionghai Dai

DOI: https://doi.org/10.1504/ijcse.2019.099641

2016-01-01

International Journal of Computational Science and Engineering

Abstract:The prediction of RNA structure with pseudoknots is NP-hard problem. According to minimum free energy models and computational methods, we investigate the RNA pseudoknotted structures and their characteristics. The paper presents an efficient algorithm for predicting RNA structures with pseudoknots, and the algorithm runs in O ( n 3 ) time and O ( n 2 ) space. The experimental tests in Rfam10.1 and PseudoBase indicate that the algorithm is more effective and precise, and the algorithm can predict arbitrary pseudoknots. And through our research, we can draw that there exists an 1 + ε ( ε > 0) polynomial time approximation scheme in searching maximum number of stackings, and we give the proof of the approximation scheme in RNA pseudoknotted structures.
Predicting Algorithm of RNA Folding Structure with Pseudoknots.

Liu Zhendong,Zhu Daming,Dai Qionghai

DOI: https://doi.org/10.1109/CIS.2015.17

2015-01-01

Abstract:It is NP-hard problem of predicting RNA structure with pseudoknots. According to minimum free energy models and computational methods, we investigate the RNA pseudoknotted structure. The paper present an efficient algorithm for predicting RNA structure with pseudoknots, and the algorithm takes O(n(3)) time and O(n(2)) space. The experimental tests in Rfam10.1 and PseudoBase indicate that the algorithm is more effective and precise, the algorithm can predict arbitrary pseudoknots. And the paper present a 2-approximation algorithm by analyzing the RNA folding structure, there exists 1+ t (t>0) polynomial time approximation scheme in searching maximum number of stackings, and we give the proof of the approximation scheme in RNA structure.
Predicting scheme of RNA folding structure including pseudoknots.

Zhendong Liu,Daming Zhu,Hongwei Ma

DOI: https://doi.org/10.1504/IJSNET.2014.067096

2014-01-01

International Journal of Sensor Networks

Abstract:The problem of predicting ribonucleic acid (RNA) structure with pseudoknots makes it NP-hard. To find optimal RNA pseudoknotted structure, we investigate the RNA pseudoknotted structure based on computational methods and models with minimum free energy (MFE). The contribution of this paper is to obtain an efficient algorithm for predicting RNA pseudoknotted structure with pseudoknots, and the algorithm takes O(n³) time and O(n²) space. The experimental test in PseudoBase and Rfam10.1 shows that the proposed algorithm is more effective and precise than other compared algorithms, and can predict arbitrary pseudoknots. Furthermore, we prove that there exists 1 + ε (ε > 0) polynomial time approximation scheme (PTAS) in searching maximum number of stackings. We also present a 2-approximation algorithm and analyse the approximation algorithm.
[Predicting RNA Secondary Structures Including Pseudoknots by Covariance with Stacking and Minimum Free Energy].

Jinwei Yang,Zhigang Luo,Xiaoyong Fang,Jinhua Wang,Kecheng Tang

DOI: https://doi.org/10.3321/j.issn:1000-3061.2008.04.022

2008-01-01

Abstract:Prediction of RNA secondary structures including pseudoknots is a difficult topic in RNA field. Current predicting methods usually have relatively low accuracy and high complexity. Considering that the stacking of adjacent base pairs is a common feature of RNA secondary structure, here we present a method for predicting pseudoknots based on covariance with stacking and minimum free energy. A new score scheme, which combined stacked covariance with free energy, was used to assess the evaluation of base pair in our method. Based on this score scheme, we utilized an iterative procedure to compute the optimized RNA secondary structure with minimum score approximately. In each interaction, helix of high covariance and low free energy was selected until the sequences didn't form helix, so two crossing helixes which were selected from different iterations could form a pseudoknot. We test our method on data sets of ClustalW alignments and structural alignments downloaded from RNA databases. Experimental results show that our method can correctly predict the major portion of pseudoknots. Our method has both higher average sensitivity and specificity than the reference algorithms, and performs much better for structural alignments than for ClustalW alignments. Finally, we discuss the influence on the performance by the factor of covariance weight, and conclude that the best performance is achieved when lambda1 : lambda2 = 5 : 1.
A predicting algorithm of RNA secondary structure based on stems

Zhendong Liu,Hengwu Li,Daming Zhu

DOI: https://doi.org/10.1108/03684921011046825

IF: 2.352

2013-01-01

Kybernetes

Abstract:Purpose - The purpose of this paper is to design an algorithm to predict RNA secondary structure, compared with other relevant algorithm, its time complexity and space complexity are reduced. Design/methodology/approach - The dynamic programming algorithm need more time and space; it is very difficult to predict the RNA secondary structure which have more 1,000 bases. The nested RNA secondary structure algorithms cannot predict the RNA secondary structure containing pseudoknots, so the fast algorithm is needed to predict the RNA secondary structure containing pseudoknots urgently. Based on the greedy principle, a model is designed to solve the problem. Findings - A greedy algorithm is presented to predict RNA secondary structure. Research limitations/implications The problem for predicting RNA secondary structure including pseudoknots is NP-complete. Practical implications - The paper presents a valuable and useful method for predicting the RNA secondary structure. Originality/value - The new algorithm needs O(n(3)) time and O(n) space; the experimental results indicate that the algorithm has good accuracy and sensitivity.
Approximation Scheme for Rna Structure Prediction Based on Base Pair Stacking

Hengwu Li,Daming Zhu,Zhenzhong Xu,Huijian Han

2007-01-01

Abstract:Pseudoknotted RNA secondary structure prediction is an important problem in computational biology. Existing polynomial time algorithms have no performance guarantee or can handle only limited types of pseudoknots. In this paper for the general problem of pseudoknotted RNA secondary structure prediction, a polynomial time approximation scheme is presented to predict pseudoknotted RNA secondary structure by dynamic programming and branch bound based on base pair stacking. Compared with existing polynomial time algorithm, it has exact approximation performance and can predict arbitrary pseudoknots.
A Helix-based Minimum Free Energy Algorithm for RNA Secondary Structure Prediction

夏培明,张岩

DOI: https://doi.org/10.3969/j.issn.1008-0570.2009.09.058

2009-01-01

Abstract:Predicting of RNA secondary structure is an important content in computational biology. Based on minimum free energy principle,a new prediction algorithm-helix-based dynamic programming is presented. The time complexity is O (n3) while the space complexity is O(n2). It has a good result in predicting RNA secondary structure including pseudoknots.
Improved Approximation Algorithm Of Rna Structure Prediction With Pseudoknots

Zhendong Liu,Daming Zhu

DOI: https://doi.org/10.1109/ICInfA.2012.6246911

2012-01-01

Abstract:Based on MFE principle and the relative stability of the n-stems in RNA molecules, Minimum free energy method is adopted widely to predict RNA secondary structure, an improved approximation algorithm is presented to predict RNA pseudoknotted structure, the algorithm can solve arbitrary nested or parallel pseudoknots the algorithm takes O(n(3)) time and O(n(2)) space. This algorithm not only reduces the time complexity to O(n(3)), but also widens the maximum length of the sequence. The preliminary experimental test on the RNA sub-sequences in PseudoBase confirm that the algorithm outperforms other known algorithms in predicting accuracy, sensitivity and specificity.
A Heuristic Algorithm of RNA Pseudoknotted Structure Prediction Based on Stem

Zhendong Liu,Daming Zhu

DOI: https://doi.org/10.1115/1.859971.paper135

2012-01-01

Abstract:Minimum free energy method is adopted widely to predict RNA secondary structure. Based on the relative stability of the stems in RNA molecules, a heuristic algorithm is presented to predict RNA pseudo-knotted structure, the algorithm takes O(n3) time and O(n2) space, this algorithm not only reduces the time complexity to O(n3), but also widens the maximum length of the sequence. The preliminary experimental test on the RNA subsequences in PseudoBase confirm that the algorithm outperforms other known algorithms in predicting accuracy, time complexity and space complexity.
Accurate prediction of RNA secondary structure including pseudoknots through solving minimum-cost flow with learned potentials

Tiansu Gong,Fusong Ju,Dongbo Bu

DOI: https://doi.org/10.1038/s42003-024-05952-w

IF: 6.548

2024-03-09

Communications Biology

Abstract:Abstract Pseudoknots are key structure motifs of RNA and pseudoknotted RNAs play important roles in a variety of biological processes. Here, we present KnotFold, an accurate approach to the prediction of RNA secondary structure including pseudoknots. The key elements of KnotFold include a learned potential function and a minimum-cost flow algorithm to find the secondary structure with the lowest potential. KnotFold learns the potential from the RNAs with known structures using an attention-based neural network, thus avoiding the inaccuracy of hand-crafted energy functions. The specially designed minimum-cost flow algorithm used by KnotFold considers all possible combinations of base pairs and selects from them the optimal combination. The algorithm breaks the restriction of nested base pairs required by the widely used dynamic programming algorithms, thus enabling the identification of pseudoknots. Using 1,009 pseudoknotted RNAs as representatives, we demonstrate the successful application of KnotFold in predicting RNA secondary structures including pseudoknots with accuracy higher than the state-of-the-art approaches. We anticipate that KnotFold, with its superior accuracy, will greatly facilitate the understanding of RNA structures and functionalities.

biology
Characteristics and prediction of RNA structure.

Hengwu Li,Daming Zhu,Caiming Zhang,Huijian Han,Keith A Crandall

DOI: https://doi.org/10.1155/2014/690340

2014-01-01

BioMed Research International

Abstract:RNA secondary structures with pseudoknots are often predicted by minimizing free energy, which is NP-hard. Most RNAs fold during transcription from DNA into RNA through a hierarchical pathway wherein secondary structures form prior to tertiary structures. Real RNA secondary structures often have local instead of global optimization because of kinetic reasons. The performance of RNA structure prediction may be improved by considering dynamic and hierarchical folding mechanisms. This study is a novel report on RNA folding that accords with the golden mean characteristic based on the statistical analysis of the real RNA secondary structures of all 480 sequences from RNA STRAND, which are validated by NMR or X-ray. The length ratios of domains in these sequences are approximately 0.382L, 0.5L, 0.618L, and L, where L is the sequence length. These points are just the important golden sections of sequence. With this characteristic, an algorithm is designed to predict RNA hierarchical structures and simulate RNA folding by dynamically folding RNA structures according to the above golden section points. The sensitivity and number of predicted pseudoknots of our algorithm are better than those of the Mfold, HotKnots, McQfold, ProbKnot, and Lhw-Zhu algorithms. Experimental results reflect the folding rules of RNA from a new angle that is close to natural folding.
DMfold: A Novel Method to Predict RNA Secondary Structure with Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle

Linyu Wang,Yuanning Liu,Xiaodan Zhong,Haiming Liu,Chao Lu,Cong Li,Hao Zhang

DOI: https://doi.org/10.3389/fgene.2019.00143

IF: 3.7

2019-01-01

Frontiers in Genetics

Abstract:While predicting the secondary structure of RNA is vital for researching its function, determining RNA secondary structure is challenging, especially for that with pseudoknots. Typically, several excellent computational methods can be utilized to predict the secondary structure (with or without pseudoknots), but they have their own merits and demerits. These methods can be classified into two categories: the multi-sequence method and the single-sequence method. The main advantage of the multi-sequence method lies in its use of the auxiliary sequences to assist in predicting the secondary structure, but it can only successfully predict in the presence of multiple highly homologous sequences. The single-sequence method is associated with the major merit of easy operation (only need the target sequence to predict secondary structure), but its folding parameters are the common features of diversity RNA, which cannot describe the unique characteristics of RNA, thus potentially resulting in the low prediction accuracy in some RNA. In this paper, ‘DMfold’, a method based on the Deep Learning and Improved Base Pair Maximization Principle, is proposed to predict the secondary structure with pseudoknots, which fully absorbs the advantages and avoids some disadvantages of those two methods. Notably, DMfold could predict the secondary structure of RNA by learning similar RNA in the known structures, which uses the similar RNA sequences instead of the highly homogeneous sequences in the multi-sequence method, thereby reducing the requirement for auxiliary sequences. In DMfold, it only needs to input the target sequence to predict the secondary structure. Its folding parameters are fully extracted automatically by deep learning, which could avoid the lack of folding parameters in the single-sequence method. Experiments show that our method is not only simple to operate, but also improves the prediction accuracy compared to multiple excellent prediction methods. A repository containing our code can be found at https://github.com/linyuwangPHD/RNA-Secondary-Structure-Database.

New algorithm for predicting ribonucleic acid secondary structure including pseudoknots

A New Pseudoknots Folding Algorithm for RNA Structure Prediction.

A dynamic programming algorithm for RNA structure prediction including pseudoknots

A Hopfield Neural Network Based Algorithm for RNA Secondary Structure Prediction

New Heuristic Algorithm of RNA Secondary Structure Prediction with Pseudoknots

Design and Implementation of an Algorithm for the RNA Secondary Structure Prediction

Prediction for RNA Planar Pseudoknots

An Approximation Scheme for RNA Folding Structure Prediction Including Pseudoknots

Predicting Model And Algorithm In Rna Folding Structure Including Pseudoknots

Improved Predicting Algorithm of RNA Pseudoknotted Structure

Predicting Algorithm of RNA Folding Structure with Pseudoknots.

Predicting scheme of RNA folding structure including pseudoknots.

[Predicting RNA Secondary Structures Including Pseudoknots by Covariance with Stacking and Minimum Free Energy].

A predicting algorithm of RNA secondary structure based on stems

Approximation Scheme for Rna Structure Prediction Based on Base Pair Stacking

A Helix-based Minimum Free Energy Algorithm for RNA Secondary Structure Prediction

Improved Approximation Algorithm Of Rna Structure Prediction With Pseudoknots

A Heuristic Algorithm of RNA Pseudoknotted Structure Prediction Based on Stem

Accurate prediction of RNA secondary structure including pseudoknots through solving minimum-cost flow with learned potentials

Characteristics and prediction of RNA structure.

DMfold: A Novel Method to Predict RNA Secondary Structure with Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle