The protein structurome of Orthornavirae and its dark matter

Pascal Mutz,Antoni Pedro Camargo,Harutyun Sahakyan,Uri Neri,Anamarija Butkovic,Yuri I. Wolf,Mart Krupovic,Valerian V Dolja,Eugene V. Koonin
DOI: https://doi.org/10.1101/2024.09.25.615016
2024-09-26
Abstract:Metatranscriptomics is uncovering more and more diverse families of viruses with RNA genomes comprising the viral kingdom Orthornavirae in the realm Riboviria. Thorough protein annotation and comparison are essential to get insights into the functions of viral proteins and virus evolution. In addition to sequence- and hmm profile based methods, protein structure comparison adds a powerful tool to uncover protein functions and relationships. We constructed an Orthornavirae structurome consisting of already annotated as well as unannotated (dark matter) proteins and domains encoded in viral genomes. We used protein structure modeling and similarity searches to illuminate the remaining dark matter in hundreds of thousands of orthornavirus genomes. The vast majority of the dark matter domains showed either generic folds, such as single α-helices, or no high confidence structure predictions. Nevertheless, a variety of lineage-specific globular domains that were new either to orthornaviruses in general or to particular virus families were identified within the proteomic dark matter of orthornaviruses, including several predicted nucleic acid-binding domains and nucleases. In addition, we identified a case of exaptation of a cellular nucleoside monophosphate kinase as an RNA-binding protein in several virus families. Notwithstanding the continuing discovery of numerous orthornaviruses, it appears that all the protein domains conserved in large groups of viruses have already been identified. The rest of the viral proteome seems to be dominated by poorly structured domains including intrinsically disordered ones that likely mediate specific virus-host interactions.
Evolutionary Biology
What problem does this paper attempt to address?