Classification and characterization of multigene family proteins of African swine fever viruses

Zhaozhong Zhu,Huiting Chen,Li Liu,Yang Cao,Taijiao Jiang,Yuanqiang Zou,Yousong Peng
DOI: https://doi.org/10.1093/bib/bbaa380
IF: 9.5
2020-12-18
Briefings in Bioinformatics
Abstract:Abstract African swine fever virus (ASFV) poses serious threats to the pig industry. The multigene family (MGF) proteins are extensively distributed in ASFVs and are generally classified into five families, including MGF-100, MGF-110, MGF-300, MGF-360 and MGF-505. Most MGF proteins, however, have not been well characterized and classified within each family. To bridge this gap, this study first classified MGF proteins into 31 groups based on protein sequence homology and network clustering. A web server for classifying MGF proteins was established and kept available for free at http://www.computationalbiology.cn/MGF/home.html. Results showed that MGF groups of the same family were most similar to each other and had conserved sequence motifs; the genetic diversity of MGF groups varied widely, mainly due to the occurrence of indels. In addition, the MGF proteins were predicted to have large structural and functional diversity, and MGF proteins of the same MGF family tended to have similar structure, location and function. Reconstruction of the ancestral states of MGF groups along the ASFV phylogeny showed that most MGF groups experienced either the copy number variations or the gain-or-loss changes, and most of these changes happened within strains of the same genotype. It is found that the copy number decrease and the loss of MGF groups were much larger than the copy number increase and the gain of MGF groups, respectively, suggesting the ASFV tended to lose MGF proteins in the evolution. Overall, the work provides a detailed classification for MGF proteins and would facilitate further research on MGF proteins.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?
The paper primarily addresses the classification and characterization of multigene family (MGF) proteins in African swine fever virus (ASFV). Specifically: 1. **Classification of MGF Proteins**: Due to the widespread distribution of MGF proteins in ASFV but their unknown or insufficiently characterized functions, the researchers first classified MGF proteins based on protein sequence homology, dividing them into 31 different groups. 2. **Establishment of a Classification System**: To facilitate the study of MGF proteins, the authors also established a web server (http://www.computationalbiology.cn/MGF/home.html) for free online classification of these proteins. 3. **Genetic Diversity Analysis**: The study found that there is extensive genetic diversity among different groups of MGF proteins, primarily caused by insertion and deletion variations (indels). 4. **Structure and Function Prediction**: The research predicted that MGF proteins have significant structural and functional diversity, and found that proteins within the same MGF family tend to have similar structures, locations, and functions. 5. **Evolutionary Analysis**: By reconstructing the ancestral states of MGF proteins, the study revealed the dynamic changes of these proteins during the evolution of ASFV, including copy number variations and gains or losses, which often occur within the same genotype of the virus. In summary, this paper provides a foundation for the functional study of MGF proteins in African swine fever virus through detailed classification and characterization analysis, and helps to further understand the roles of these proteins in the virus lifecycle and their evolutionary mechanisms.