Rapid spread of mutant alleles in worldwide COVID-19 strains revealed by genome-wide SNP analysis

Zhenglin Zhu,Gexin Liu,Kaiwen Meng,Liuqing Yang,Geng Meng
DOI: https://doi.org/10.21203/rs.3.rs-23205/v1
2020-01-01
Abstract:Abstract The novel coronavirus (COVID-19) has become a pandemic and is threatening human health globally. Here, we report 14 newly evolved COVID-19 single nucleotide polymorphism (SNP) alleles those underwent a rapid increase (12 cases) or decrease (2 cases) in their frequency from between 10% and 50% in the last three months. The 14 SNPs are mostly (13/14) located in the coding region and are mainly (9/14) nonsynonymous substitutions. Out of the 14 SNPs, 12 SNPs showed a complete linkage in SNP pairs and clustered into 4 linkage groups, named LG_1 to LG_4. SNPs located in 514 and 27046 are independent events. Analyses in population genetics show that the increases in the new alleles result from genetic differentiation between Europe and America. We found that the mutants in LG_1 are driven by balancing selection and arose rapidly in Europe but not in America. The mutants in LG_2 and LG_3, also driven by balancing selection, arose rapidly in American but not in European strains. Based on analysis of geographic COVID-19 cases worldwide, we found that the mutants in LG_1 positively correlate the fatality rate of COVID-19 while those in LG_2 and LG_3 negatively correlate with the fatality rate. The correlations are statistically significant, suggesting that the virus strains possessing mutants in LG_1 are more aggressive, while those in LG_2 and LG_3 are in opposite. Further analysis revealed that mutants in LG_1 have been identified more frequently in European strains than in American strains, while mutants in LG_2 and LG_3 have been found more frequently in American strains. This may partially explain the higher fatality rates of COVID-19 infection in Italy, England and France, compared with the United States. These findings should be instructive for epidemiological surveys and disease control of COVID-19 in the future.
What problem does this paper attempt to address?