On a barrier height problem for RNA branching

Christine Heitsch,Chi N. Y. Huynh,Greg Johnston
2023-03-22
Abstract:The branching of an RNA molecule is an important structural characteristic yet difficult to predict correctly, especially for longer sequences. Using plane trees as a combinatorial model for RNA folding, we consider the thermodynamic cost, known as the barrier height, of transitioning between branching configurations. Using branching skew as a coarse energy approximation, we characterize various types of paths in the discrete configuration landscape. In particular, we give sufficient conditions for a path to have both minimal length and minimal branching skew. The proofs offer some biological insights, notably the potential importance of both hairpin stability and domain architecture to higher resolution RNA barrier height analyses.
Biomolecules,Combinatorics
What problem does this paper attempt to address?
The paper primarily discusses the thermodynamic cost of RNA molecular branching structures, especially the challenges faced when predicting the correct branching structures of longer sequences. The authors use plane trees as a combinatorial model for RNA folding to study the thermodynamic cost of transitioning from one branching configuration to another, known as the "barrier height." They characterize different path types in the RNA configuration landscape by introducing "branching skew" as a rough energy approximation and provide existence proofs for paths that satisfy both minimum length and minimum branching skew conditions. Specifically, the paper focuses on the formation process of non-crossing, canonical base pairs in RNA secondary structures. Given an RNA sequence, there may be multiple possible secondary structures, but the most biologically relevant structures typically have lower free energy approximations under the nearest neighbor thermodynamic model (NNTM). The barrier height problem considers the thermodynamic cost of transitioning from a low-energy state configuration to another low-energy state configuration. Unlike previous studies that focused on adding or removing single base pairs, this paper takes a complementary approach, focusing on larger structural rearrangements, using plane trees as a combinatorial model for RNA branching configurations. Plane trees are a type of rooted tree with linearly ordered subtrees, commonly used to represent the arrangement of helices and loops in RNA secondary structures. The authors associate helices with edges and loops with vertices, where the external loop acts as a unique root vertex. By focusing on the overall arrangement of edges/helices and vertices/loops, the mathematical results provide insights for designing RNA sequences with specific branching structures and also facilitate understanding of RNA prediction accuracy. The paper further extends the theoretical branching analysis by considering folding paths between plane trees. By defining "pairing exchange" operations, the paper analyzes different types of transition paths that are amenable to combinatorial analysis in a model based on branching skew. The proofs in the paper provide some biological insights, such as the importance of hairpin stability and RNA structural domain architecture, which are crucial for higher-resolution analysis of RNA barrier heights. Overall, the paper aims to deepen the understanding of the RNA molecule folding process through mathematical and combinatorial methods, with a particular focus on the thermodynamic cost of transitions between different branching configurations.