Structural comparison of homologous protein-RNA interfaces reveals widespread overall conservation contrasted with versatility in polar contacts
Ikram Mahmoudi,Chloé Quignot,Carla Martins,Jessica Andreani
DOI: https://doi.org/10.1371/journal.pcbi.1012650
2024-12-04
PLoS Computational Biology
Abstract:Protein-RNA interactions play a critical role in many cellular processes and pathologies. However, experimental determination of protein-RNA structures is still challenging, therefore computational tools are needed for the prediction of protein-RNA interfaces. Although evolutionary pressures can be exploited for structural prediction of protein-protein interfaces, and recent deep learning methods using protein multiple sequence alignments have radically improved the performance of protein-protein interface structural prediction, protein-RNA structural prediction is lagging behind, due to the scarcity of structural data and the flexibility involved in these complexes. To study the evolution of protein-RNA interface structures, we first identified a large and diverse dataset of 2,022 pairs of structurally homologous interfaces (termed structural interologs). We leveraged this unique dataset to analyze the conservation of interface contacts among structural interologs based on the properties of involved amino acids and nucleotides. We uncovered that 73% of distance-based contacts and 68% of apolar contacts are conserved on average, and the strong conservation of these contacts occurs even in distant homologs with sequence identity below 20%. Distance-based contacts are also much more conserved compared to what we had found in a previous study of homologous protein-protein interfaces. In contrast, hydrogen bonds, salt bridges, and π-stacking interactions are very versatile in pairs of protein-RNA interologs, even for close homologs with high interface sequence identity. We found that almost half of the non-conserved distance-based contacts are linked to a small proportion of interface residues that no longer make interface contacts in the interolog, a phenomenon we term "interface switching out". We also examined possible recovery mechanisms for non-conserved hydrogen bonds and salt bridges, uncovering diverse scenarios of switching out, change in amino acid chemical nature, intermolecular and intramolecular compensations. Our findings provide insights for integrating evolutionary signals into predictive protein-RNA structural modeling methods. Protein-RNA interactions are crucial to many biological functions and can play a role in diseases. We adopted a computational strategy to analyze and compare experimental 3D structures of protein-RNA interfaces. We first built a diverse dataset of 2,022 pairs of structurally similar protein-RNA interfaces, called structural interologs, and confirmed the existence of an evolutionary relationship in most of these interface pairs. We analyzed spatially close amino acid-nucleotide pairs across the interface, revealing that they are most often similar between interologs, even when the interfaces have strongly diverged. However, polar contacts such as hydrogen bonds are most often differently distributed between interologs, even in closely related interfaces. This finding highlights that spatial arrangement is more conserved than sequence in protein-RNA interactions and suggests principles guiding the evolution of these molecular associations. Our study has important implications for predicting protein-RNA interactions, both by providing useful rules for transferring contacts from a template with known structure to an interface of interest, and by paving the way for applying machine-learning techniques to integrate these patterns of contact conservation. This holds the promise of accelerating the identification of potential therapeutic targets and improving our molecular understanding for disease mechanisms mediated by protein-RNA interactions.
biochemical research methods,mathematical & computational biology