Discovering SARS-CoV-2 Genes and Mutations Adapted for Humans in 2594 Genomes
Weitao Sun
DOI: https://doi.org/10.1109/bibm52615.2021.9669705
2021-01-01
Abstract:Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a positive-sense single-stranded virus approximately 30 kb in length, is the cause of the ongoing global life-threatening novel coronavirus disease-2019 (COVID-19) outbreak. Studies confirmed significant genome differences between SARS-CoV-2 and SARS-CoV, suggesting that the distinctions in pathogenicity and virulence might be related to genomic diversity. However, the relationship between genomic differences and SARS-CoV-2 fitness has not been fully explained, especially for open reading frame (ORF)-encoded accessory proteins. RNA viruses have a high mutation rate, but how SARS-CoV-2 mutations accelerate host adaptation is not clear. This study shows that the host-genome similarity (HGS) of SARS-CoV-2 is significantly higher than that of SARS-CoV, especially in the ORF6 and ORFS genes that encode proteins antagonizing innate immunity in vivo. A power law relationship was discovered between the HGS of ORF3b, ORF6, and N and the expression of interferon (IFN)-sensitive response element (ISRE)-containing promoters. This finding implies that the increase in HGS in the SARS-CoV-2 genome may further inhibit FN I synthesis and cause delayed host innate immunity. An ORF1ab mutation, 1081SG>T, which occurred in virus populations with high HGS but rarely in low-HGS populations, was identified in 2594 genomes with geolocations of China, the USA and Europe. The genomic mutation caused the amino acid mutation M37F in the transmembrane protein nsp6. The results suggest that the ORF6 and ORFS genes and the residue mutation M37F may play important roles in SARS-CoV-2 adaptation to humans. However, the underlying basis by which the mutations mediate adaptation to humans is still unknown. The findings demonstrate that HGS analysis is a reliable way to identify important genes and mutations in adaptive strains, which may help in the search for potential targets for pharmaceutical agents.
What problem does this paper attempt to address?