Abstract:BACKGROUNDSARS-CoV is the causative agent of severe acute respiratory syndrome (SARS) which has been associated with outbreaks of SARS in Guangdong, Hong Kong and Beijing of China, and other regions worldwide. SARS-CoV from human has shown some variations but its origin is still unknown. The genotyping and phylogeny of SARS-CoV were analyzed and reported in this paper.METHODSFull or partial genomes of 44 SARS-CoV strains were collected from GenBank. The genotype, single nucleotide polymorphism and phylogeny of these SARS-CoV strains were analyzed by molecular biological, bioinformatic and epidemiological methods.RESULTSThere were 188 point mutations in the 33 virus full genomes with the counts of mutation mounting to 297. Further analysis was carried out among 36 of 188 loci with more than two times of mutation. All the 36 mutation loci occurred in coding sequences and 22 loci were non-synonymous. The gene mutation rates of replicase 1AB, S2 domain of spike glycoprotein and nucleocapsid protein were lower (0.079% - 0.103%). There were 4 mutation loci in S1 domain of spike glycoprotein. The gene mutation rate of ORF10 was the highest (3.333%) with 4 mutation loci in this small domain (120 bp) and 3 of 4 loci related to deletion mutation. By bioinformatics processing and analysis, the nucleotides at 7 loci of genome (T:T:A:G:T:C:T/C:G:G:A:C:T:C) can classify SARS-CoV into two types. Therefore a novel definition is put forward that according to these 7 loci of mutation, 40 strains of SARS-CoV in GenBank can be grouped into two genotypes, T:T:A:G:T:C:T and C:G:G:A:C:T:C, and named as SARS-CoV Yexin genotype and Xiaohong genotype. The two genotypes can be further divided into some sub-genotypes. These genotypes can also be approved by phylogenetic tree of three levels of 44 loci of mutation, spike glycoprotein gene and complete genome sequence. Compared to various strains among SARS-CoV Yexin genotype and Xiaohong genotype, GD01 strain of Yexin genotype is more closely related to SARS-CoV like-virus from animals.CONCLUSIONThe results mentioned above suggest that SARS-CoV is responding to host immunological pressures and experiencing variation which provide clues, information and evidence of molecular biology for the clinical pathology, vaccine developing and epidemic investigation.

Relationship of SARS-CoV to Other Pathogenic RNA Viruses Explored by Tetranucleotide Usage Profiling

A Complete Sequence and Comparative Analysis of a SARS-associated Virus (isolate BJ01).

Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales

Molecular Biological Analysis of Genotyping and Phylogeny of Severe Acute Respiratory Syndrome Associated Coronavirus.

Molecular Evolution and Multilocus Sequence Typing of 145 Strains of Sars-Cov

Analysis of synonymous codon usage patterns in torque teno sus virus 1 (TTSuV1)

Characterization of the Substitution Hotspots in SARS-CoV-2 Genome Using BioAider and Detection of a SR-rich Region in N Protein Providing Further Evidence of Its Animal Origin

[The Genome Comparison of SARS-CoV and Other Coronaviruses].

A Preliminary Phylogenetic Analysis of 14 Coding Sequences from SARS Virus and Other Coronaviruses

Genomic Feature Analysis of Betacoronavirus Provides Insights Into SARS and COVID-19 Pandemics

Analysis of the genetic diversity in RNA-directed RNA polymerase sequences: implications for an automated RNA virus classification system

Testing the hypothesis of a recombinant origin of the SARS-associated coronavirus

Evolution and Variation of the SARS-CoV Genome

On the verge of life: Distribution of nucleotide sequences in viral RNAs

RNA Barcode Segments for SARS-CoV-2 Identification from HCoVs and SARSr-CoV-2 Lineages

Site Discrepancy of Synonymous Codon Usage in SARS Coronavirus and Other Viruses in Coronaviridae

CpG Usage in RNA Viruses: Data and Hypotheses.

Nucleotide and dinucleotide preference of segmented viruses are shaped more by segment: In case study of tomato spotted wilt virus

Structural Genomic Analysis of SARS-CoV-2 and Other Coronaviruses

Codon usage bias analysis of the spike protein of human coronavirus 229E and its host adaptability

Evolution of codon usage in 2019-new coronavirus causing human infection