BEAN and HABAS: Polyphyletic insertions in RNAP that point to deep-time evolutionary divergence of bacteria

Claudia Alvarez-Carreño,Angela T. Huynh,Anton S. Petrov,Christine Orengo,Loren Dean Williams
DOI: https://doi.org/10.1101/2024.04.02.587612
2024-05-06
Abstract:The β and β’ subunits of the RNA polymerase (RNAP) are large proteins with complex multi-domain architectures that include several insertional domains. Here, we analyze the multi-domain organizations of bacterial RNAP-β and RNAP-β’ using sequence, experimentally determined structures and AlphaFold structure predictions. We observe that bacterial lineage-specific domains in RNAP-β belong to a group of domains that we call BEAN (Broadly Embedded ANnex) and that in RNAP-β’, bacterial lineage-specific domains are HAmmerhead/BArrel-Sandwich Hybrid (HABAS) domains. The BEAN domain has a characteristic three-dimensional structure composed of two square bracket-like elements that are antiparallel relative to each other. The HABAS domain contains a four-stranded open β-sheet with a GD-box-like motif in one of the β-strands and the adjoining loop. The BEAN domain is identified not only in the bacterial RNAP-β’, but also in the archaeal version of universal ribosomal protein L10. The HABAS domain is observed as an insertional domain in several metabolic proteins. The phylogenetic distributions of bacterial lineage-specific insertional domains of β and β’ subunits of RNAP follow the Tree of Life. The presence of insertional domains can help establish a relative timeline of events in the evolution of a protein because insertion is inferred to post-date the base domain. We discuss mechanisms that might account for the discovery of homologous insertional domains in non-equivalent locations in bacteria and archaea.
Evolutionary Biology
What problem does this paper attempt to address?