Bio-Chemoinformatics-Driven Analysis of nsp7 and nsp8 Mutations and Their Effects on Viral Replication Protein Complex Stability

Bryan John J. Subong,Takeaki Ozawa
DOI: https://doi.org/10.3390/cimb46030165
2024-03-18
Current Issues in Molecular Biology
Abstract:The nonstructural proteins 7 and 8 (nsp7 and nsp8) of SARS-CoV-2 are highly important proteins involved in the RNA-dependent polymerase (RdRp) protein replication complex. In this study, we analyzed the global mutation of nsp7 and nsp8 in 2022 and 2023 and analyzed the effects of mutation on the viral replication protein complex using bio-chemoinformatics. Frequently occurring variants are found to be single amino acid mutations for both nsp7 and nsp8. The most frequently occurring mutations for nsp7 which include L56F, L71F, S25L, M3I, D77N, V33I and T83I are predicted to cause destabilizing effects, whereas those in nsp8 are predicted to cause stabilizing effects, with the threonine to isoleucine mutation (T89I, T145I, T123I, T148I, T187I) being a frequent mutation. A conserved domain database analysis generated critical interaction residues for nsp7 (Lys-7, His-36 and Asn-37) and nsp8 (Lys-58, Pro-183 and Arg-190), which, according to thermodynamic calculations, are prone to destabilization. Trp-29, Phe-49 of nsp7 and Trp-154, Tyr-135 and Phe-15 of nsp8 cause greater destabilizing effects to the protein complex based on a computational alanine scan suggesting them as possible new target sites. This study provides an intensive analysis of the mutations of nsp7 and nsp8 and their possible implications for viral complex stability.
biochemistry & molecular biology
What problem does this paper attempt to address?
This paper aims to explore the mutation situations of non - structural protein 7 (nsp7) and non - structural protein 8 (nsp8) in the SARS - CoV - 2 virus and their impacts on the stability of the viral replication protein complex. Specifically, through a combination of bioinformatics and chemoinformatics methods, the researchers analyzed the mutation situations in the global nsp7 and nsp8 protein sequence data in 2022 and 2023, and evaluated the impacts of these mutations on the stability of the viral replication protein complex. ### Research Background SARS - CoV - 2 is the pathogen causing the COVID - 19 pandemic. Its viral replication complex mainly consists of three non - structural proteins: nsp12 (RNA - dependent RNA polymerase), nsp7 and nsp8. These proteins play a crucial role in the process of viral RNA synthesis. Therefore, understanding their structures and functions is of great significance for the development of antiviral drugs. ### Research Methods 1. **Sequence Mining and Alignment**: - Obtain the protein sequence data of SARS - CoV - 2 human isolates in 2022 and 2023 from the NCBI database. - Use Geneious Prime software for sequence alignment to identify variant sequences. 2. **Conserved Domain Analysis**: - Use the NCBI Conserved Domain Database (CDD) to analyze the conserved amino acid residues of nsp7 and nsp8. 3. **Protein Structure Modeling and Stability Analysis**: - Use the Robetta server to construct a model of the viral replication protein complex containing nsp12, nsp7 and nsp8. - Predict the impact of mutations on the stability of the protein complex by calculating the change in Gibbs free energy after mutation (\(\Delta\Delta G\)). ### Main Findings 1. **Mutation Frequency**: - The data in 2022 and 2023 show that most of the sequences of nsp7 and nsp8 are still wild - type, but there are a small number of variations. - Single - amino - acid mutations are the most common mutation types. 2. **Impact of Mutations on Stability**: - For nsp7, the most common mutations such as L56F and L71F lead to the instability of the protein complex (\(\Delta\Delta G < 0\)), while S26F and M75I lead to stability (\(\Delta\Delta G > 0\)). - For nsp8, most mutations (such as T89I, T145I, T148I) lead to the stability of the protein complex, but the P133S mutation leads to instability. 3. **Impact of Mutations of Key Amino Acid Residues**: - Through CDD analysis, the key amino acid residues of nsp7 are determined as Lys - 7, His - 36 and Asn - 37, and the key amino acid residues of nsp8 are determined as Lys - 58, Pro - 183 and Arg - 190. - Mutations of these key residues mostly lead to the instability of the protein complex. ### Conclusion This study provides a detailed analysis of nsp7 and nsp8 mutations and their impacts on the stability of the viral replication protein complex. These findings are helpful for understanding the evolutionary process of viral proteins and provide potential targets for the development of antiviral drugs against these proteins. In particular, the key amino acid residues identified in the study may become the focus of future drug design.