Compositional diversity and evolutionary pattern of coronavirus accessory proteins

Jingzhe Shang,Na Han,Ziyi Chen,Yousong Peng,Liang Li,Hangyu Zhou,Chengyang Ji,Jing Meng,Taijiao Jiang,Aiping Wu
DOI: https://doi.org/10.1093/bib/bbaa262
IF: 9.5
2020-10-30
Briefings in Bioinformatics
Abstract:Abstract Accessory proteins play important roles in the interaction between coronaviruses and their hosts. Accordingly, a comprehensive study of the compositional diversity and evolutionary patterns of accessory proteins is critical to understanding the host adaptation and epidemic variation of coronaviruses. Here, we developed a standardized genome annotation tool for coronavirus (CoroAnnoter) by combining open reading frame prediction, transcription regulatory sequence recognition and homologous alignment. Using CoroAnnoter, we annotated 39 representative coronavirus strains to form a compositional profile for all of the accessary proteins. Large variations were observed in the number of accessory proteins of 1–10 for different coronaviruses, with SARS-CoV-2 and SARS-CoV having the most (9 and 10, respectively). The variation between SARS-CoV and SARS-CoV-2 accessory proteins could be traced back to related coronaviruses in other hosts. The genomic distribution of accessory proteins had significant intra-genus conservation and inter-genus diversity and could be grouped into 1, 4, 2 and 1 types for alpha-, beta-, gamma-, and delta-coronaviruses, respectively. Evolutionary analysis suggested that accessory proteins are more conservative locating before the N-terminal of proteins E and M (E-M), while they are more diverse after these proteins. Furthermore, comparison of virus-host interaction networks of SARS-CoV-2 and SARS-CoV accessory proteins showed that they share multiple antiviral signaling pathways, those involved in the apoptotic process, viral life cycle and response to oxidative stress. In summary, our study provides a tool for coronavirus genome annotation and builds a comprehensive profile for coronavirus accessory proteins covering their composition, classification, evolutionary pattern and host interaction.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on the compositional diversity of coronavirus accessory proteins and their evolutionary patterns. Specifically, the paper aims to comprehensively study the compositional characteristics, classification, evolutionary patterns, and interactions with hosts of coronavirus accessory proteins by developing a standardized coronavirus genome annotation tool (CoroAnnoter). These problems are of great significance for understanding how coronaviruses adapt to different hosts and their epidemic variations. The specific objectives of the paper include: 1. **Develop a standardized genome annotation tool**: Combine open reading frame prediction, transcriptional regulatory sequence identification, and homology alignment methods to develop a semi - automated coronavirus genome annotation tool, CoroAnnoter. 2. **Annotate the genomes of representative coronavirus strains**: Use CoroAnnoter to annotate 39 representative coronavirus strains to form the compositional profiles of these viruses' accessory proteins. 3. **Analyze the compositional diversity of accessory proteins**: Observe the variation in the number of accessory proteins in different coronaviruses (1 - 10), especially that SARS - CoV - 2 and SARS - CoV have the largest number of accessory proteins (9 and 10 respectively). 4. **Explore the evolutionary patterns of accessory proteins**: Analyze the distribution patterns of accessory proteins in the genome and find the conservation within the same genus and the diversity between different genera. 5. **Study the functions of accessory proteins**: Compare the interaction networks between SARS - CoV - 2 and SARS - CoV accessory proteins and host proteins, and find that they share multiple antiviral signaling pathways, involving apoptotic processes, virus life cycles, and oxidative stress responses. Through these studies, the paper provides a tool for coronavirus genome annotation and establishes a comprehensive profile of coronavirus accessory proteins, covering information on their composition, classification, evolutionary patterns, and host interactions. This provides an important basis for in - depth understanding of the biological characteristics of coronaviruses and potential therapeutic targets.