Evolutionary analyses of intrinsically disordered regions reveal widespread signals of conservation
Marc D. Singleton,Michael B. Eisen
DOI: https://doi.org/10.1371/journal.pcbi.1012028
2024-04-26
PLoS Computational Biology
Abstract:Intrinsically disordered regions (IDRs) are segments of proteins without stable three-dimensional structures. As this flexibility allows them to interact with diverse binding partners, IDRs play key roles in cell signaling and gene expression. Despite the prevalence and importance of IDRs in eukaryotic proteomes and various biological processes, associating them with specific molecular functions remains a significant challenge due to their high rates of sequence evolution. However, by comparing the observed values of various IDR-associated properties against those generated under a simulated model of evolution, a recent study found most IDRs across the entire yeast proteome contain conserved features. Furthermore, it showed clusters of IDRs with common "evolutionary signatures," i . e . patterns of conserved features, were associated with specific biological functions. To determine if similar patterns of conservation are found in the IDRs of other systems, in this work we applied a series of phylogenetic models to over 7,500 orthologous IDRs identified in the Drosophila genome to dissect the forces driving their evolution. By comparing models of constrained and unconstrained continuous trait evolution using the Brownian motion and Ornstein-Uhlenbeck models, respectively, we identified signals of widespread constraint, indicating conservation of distributed features is mechanism of IDR evolution common to multiple biological systems. In contrast to the previous study in yeast, however, we observed limited evidence of IDR clusters with specific biological functions, which suggests a more complex relationship between evolutionary constraints and function in the IDRs of multicellular organisms. Proteins are the molecular machines that carry out many processes required for life at an atomic level. Though many proteins use fixed structures to perform their functions, proteins with flexible segments are widespread, especially in multicellular organisms. Furthermore, these intrinsically disordered regions (IDRs) are often involved in essential cellular functions. However, the sequences of IDRs evolve quickly, which challenges traditional bioinformatics methods that depend on sequence conservation to predict function. Several studies have demonstrated that distributed biophysical features of IDRs are constrained rather than their exact sequences, and a recent study in yeast found that IDRs with common patterns of conserved features were associated with specific functions. Therefore, in this work we ask if IDRs in fruit flies, another common laboratory organism, also have patterns of conservation with associated functions. We build on the previous study by integrating their approach into a fully statistical framework based on mathematical models of trait evolution. Though we identify widespread signals of conservation in the IDRs of fruit flies, we find less evidence of a simple relationship between features and function. These methods and results will provide a valuable resource that can guide future experimental analyses of IDRs in fruit flies and other organisms.
biochemical research methods,mathematical & computational biology