Literature DB >> 16112670

Molecular evolution and multilocus sequence typing of 145 strains of SARS-CoV.

Zhi-Gang Wang1, Zhi-Hua Zheng, Lei Shang, Lan-Juan Li, Li-Ming Cong, Ming-Guang Feng, Yun Luo, Su-Yun Cheng, Yan-Jun Zhang, Miao-Gui Ru, Zan-Xin Wang, Qi-Yu Bao.   

Abstract

In this study, we have identified 876 polymorphism sites in 145 complete or partial genomes of SARS-CoV available in the NCBI GenBank. One hundred and seventy-four of these sites existed in two or more SARS-CoV genome sequences. According to the sequence polymorphism, all SARS-CoVs can be divided into three groups: (I) group 1, animal-origin viruses (such as SARS-CoV SZ1, SZ3, SZ13 and SZ16); (II) group 2, all viruses with clinical origin during first epidemic; and (III) group 3, SARS-CoV GD03T0013. According to 10 special loci, group 2 again can be divided into genotypes C and T, which can be further divided into sub-genotypes C1-C4 and T1-T4. Positive Darwinian selections were identified between any pair of these three groups. Genotype C gives neutral selection. Genotype T, however, shows negative selection. By comparing the death rates of SARS patients in the different regions, it was found that the death rate caused by the viruses of the genotype C was lower than that of the genotype T. SARS-CoVs might originate from an unknown ancestor.

Entities:  

Mesh:

Year:  2005        PMID: 16112670      PMCID: PMC7118731          DOI: 10.1016/j.febslet.2005.07.075

Source DB:  PubMed          Journal:  FEBS Lett        ISSN: 0014-5793            Impact factor:   4.124


Introduction

In 2002, a new infectious disease was found in the southern part of China which had high mortality and morbidity and was termed “severe acute respiratory syndrome” (SARS). It affected 8096 people and led 774 patients to death around the world, within a very short span of time [1]. Several studies have shown that SARS is caused by a new member of the coronavirus family. The SARS coronavirus (SARS‐CoV) was supposed to be transmitted from wild animals [2, 3, 4]. This hypothesis was then supported by the identification of a SARS‐CoV‐like virus in civet cats, sold in markets in south China. This isolate had more than 99% sequence identity with SARS‐CoV, indicating that the virus was recently transferred from animals to human beings [5]. However, according to a recent report there is no direct evidence that the civet cat virus is the origin of the SARS‐CoV [6]. Although unlikely, the possibility that SARS‐CoV positive animals were infected from humans cannot be formally excluded, and it was indeed reported that SARS‐CoV was transmitted from man to pig [7]. The mechanism how the pathogen broke the barrier between its natural reservoir and man is still unclear [8]. After the first SARS epidemic, there were three incidences of laboratory infections in Singapore, Taiwan and Beijing [9, 10, 11]. Furthermore, in Guangdong, China, SARS patients were observed to suffer from “mild clinical symptoms” [12]. It is therefore important to further explore the characteristics of the SARS‐CoV genome and trace the source of the epidemic. There were several studies on SARS‐CoV phylogeny and genotyping published immediately after SARS was identified. According to 4 polymorphism sites, Ruan et al. [13] first genotyped 14 previously published SARS‐CoV genomes at the beginning of May, 2003. Subsequently, Li et al. [14] proposed genotypes of C and T in 17 SARS‐CoV genomes based on 5 polymorphism sites. Regional and international transmission types were also proposed [15, 16]. SARS‐CoV Su‐10 and CUHK‐W1 were genotyped into two different groups even though they were all isolated in Hongkong [17]. Based on the 7 loci of 44 SARS‐CoV genomes, C and T genotypes were further modified and named as the Yexin and Xiaohong genotypes [18]. According to 5 polymorphism sites, 63 SARS‐CoV genomes were divided into early, middle and late phase genotype groups [19]. The polymorphism sites analyzed in these papers were almost the same. Three successful examples have been reported where infection chains have been traced by comparing variations in SARS‐CoV genomes. The first SARS‐CoV laboratory infection incidence was confirmed by identifying 13 variant loci in the SARS‐CoV genomes [9]. Secondly, it was found that the SARS‐CoV isolated from the patient with “mild clinical symptoms” and the SARS‐CoV (or SARS‐CoV‐like virus) isolated from animals resembled each other in the sequence encoding the spike protein [19]. This study indicates that virus‐carrying animals may be a risk for the investigator. Thirdly, sequence analysis confirmed the laboratory infection of a technician who was working with SARS‐CoV [20]. Up to date, there are already 295 complete or partial SARS‐CoV genome sequences available in GenBank. A new method of genotyping is proposed in this study. The results show that all SARS‐CoV genomes are clustered into three groups and that SARS‐CoV genomes in the second group belong to genotypes C and T. These two genotypes can be further divided into 8 sub‐genotypes. In this study we also analyzed the SARS death rate, SARS‐CoV origin, and population genetics.

Materials and methods

SARS‐CoV genome data

SARS‐CoV ZJ01 (Accession No. AY297028) was isolated in Zhejiang province, China [14]. Other SARS‐CoV genome sequences were downloaded from NCBI GenBank database. Two hundred and ninety‐five complete or partial SARS‐CoV genomes were searched. After removing shorter and repeated records, 101 complete genomes and 44 partial genome sequences were collected for final analysis (by September 20, 2004). The detailed information of sequences of the earliest 20 records in GenBank is shown in Table 1 and the other sequences are listed in Table S1 (Supplementary materials). The newest information about domestic and international SARS fatality rates were downloaded from Health Ministry of China (http://168.160.224.167/) and WHO websites (http://www.who.int/csr/sars/country/Table2004_04_21/en/).
Table 1

The earliest 20 SARS‐CoV genomes available in GenBank

IsolateAccession No.Length of sequence (bp)Date of submissionDate of first releaseInstitute of submission
Tor2 AY274119.3 29 75113‐4‐200314‐4‐2003BCCA Genome Sciences Centre, Canada
BJ01 AY278488.2 29 72517‐4‐200321‐4‐2003 a Academy of Military Medical Sciences, China
BJ02 AY278487.3 29 74517‐4‐200321‐4‐2003 a Academy of Military Medical Sciences, China
BJ03 AY278490.3 29 74017‐4‐200321‐4‐2003 a Academy of Military Medical Sciences, China
Urbani AY278741.1 29 72717‐4‐200321‐4‐2003CDC, Atlanta, USA
HKU‐398849 AY278491.2 29 74217‐4‐200318‐4‐2003University of Hong Kong
CUHK‐W1 AY278554.2 29 73617‐4‐200318‐4‐2003Chinese University of Hong Kong
GD01 AY278489.2 29 75717‐4‐200321‐4‐2003 a Academy of Military Medical Sciences, China
BJ04 AY279354.2 29 73219‐4‐200323‐4‐2003 a Academy of Military Medical Sciences, China
CUHK‐Su10 AY282752.1 29 73624‐4‐20037‐5‐2003Chinese University of Hong Kong
Sin2500 AY283794.1 29 71127‐4‐20039‐5‐2003Genome Institute of Singapore
Sin2677 AY283795.1 29 70527‐4‐20039‐5‐2003Genome Institute of Singapore
Sin2679 AY283796.1 29 71127‐4‐20039‐5‐2003Genome Institute of Singapore
Sin2748 AY283797.1 29 70627‐4‐20039‐5‐2003Genome Institute of Singapore
Sin2774 AY283798.1 29 71127‐4‐20039‐5‐2003Genome Institute of Singapore
TW1 AY291451.1 29 7296‐5‐200314‐5‐2003Taiwan University
Frankfurt1 AY291315.1 29 7276‐5‐200316‐3‐2004 b University of Wuerzburg, Germany
ZJ01 AY297028.1 29 71512‐5‐200319‐5‐2003Zhejiang CDC, China
TWC AY321118.1 29 72511‐6‐200326‐6‐2003Taiwan CDC, China
HSR 1 AY323977.2 29 75116‐6‐200324‐6‐2003Scientific Research Institute, Italy

Partial sequence at the date of first release. BJ01 genome became complete in May, 2003. BJ02, BJ03, BJ04 and GZ01 became complete in June, 2003.

Date of first release not available.

The earliest 20 SARS‐CoV genomes available in GenBank Partial sequence at the date of first release. BJ01 genome became complete in May, 2003. BJ02, BJ03, BJ04 and GZ01 became complete in June, 2003. Date of first release not available.

Analysis platform tools and methods

The analysis platform was PC server based on P4 ultra thread technological CPU and the operating system was Windows XP. ClustalW version 1.83 was used for multiple sequence alignments. TreeView (Win32) version 1.6.6 and MEGA 2.0 were used to draw phylogenetic tree. DNASP 4.0 was used to analyze single nucleoside polymorphism. The SARS‐CoV sequences were arrayed with ClustalW program, and alignment was further manually examined and adjusted. Only those variant sequence loci that were present in at least two independent sequences were selected for further analyses [13, 18, 19]. Deletion and insertion in SARS‐CoV genomes and the neutral mutation in the spike protein gene, etc., were also estimated. The latest version (NC_004718) of the first submitted SARS‐CoV genome sequence (SARS‐CoV TOR2, AY274119) was used as the control reference.

Results

The distribution of single nucleotide polymorphism loci of SARS‐CoV genome

Eight hundred and seventy‐six mutation loci were identified among the 145 complete or partial SARS‐CoV genomes (the over all mutation rate was 2.94%, 876/29 751), of which 174 loci were identified in more than two genome sequences. To avoid errors that might be introduced by sequencing and cell culture passages, only the latter group of 174 loci was analyzed. Different characteristics of the polymorphism sites were found. The loci related to T mutation accounted for 60.9% (106/174), and C, A and G mutation loci accounted for 39.6% (69/174), 39.0% (68/174) and 27.0% (47/174), respectively. It was obvious that T mutation had the priority. Most of the loci, except for 7 sites, were limited to one pair of nucleotide variations. One hundred and twenty loci were transition mutations (78 CT, 42 AG), and 48 loci were transversion mutations. The ratio of transition mutations to transversion mutations was 2.5 (120/48), and 6 loci were of deletion mutation.

The distribution of deleted regions in SARS‐CoV genome

The sequence deletion in SARS‐CoV genome mainly occurred in the regions between sars7a to N protein by losing 12, 29, 30, 31, 32, 39, 49, 57, 82, 137 or 415 nt, mostly losing a 29 nt fragment in sars8b (Table 2 ).
Table 2

The deletion and insertion locations of SARS‐CoV genomes

No. of strainNameLocation of genome
17......20371251972528526156270662716627660277022774427754277812778227806278082784127865278662786728145......29727
1GD69tacc...+10..............29..tgac
1SinP2tacc....6..............29..tgac
4SinP5 …tacc.....2.............29..tgac
1Sino1‐11tacc......3............29..tgac
1PUMC03tacc......3............29..tgac
1TW11tacc.......2...........29..tgac
1TWJtacc........2..........29..tgac
1Sin846tacc.........137.....29..tgac
4LC2 …tacc..........415....tgac
1Sin849tacc...........49.....29.
1GZ‐B...........39.....29..tgac
1GZ‐C...........39...3212.tgac
1Sin852............57...29..tgac
1Sin2677tacc..............6....29..tgac
2WHU …tacc...............2...29..tgac
1Sin2748tacc................5..29..tgac
4ZS‐A …tacc.................82....tgac
1GZ60.................30....tgac
2PUMC02 …tacc...................31...tgac
83Tor2 …tacc....................29..tgac
12GD01 …tacc....................+29..tgac
The deletion and insertion locations of SARS‐CoV genomes

SARS‐CoV genotype and sub‐genotype

Key mutation loci are shown in Table 3 (identified by using nucleotide–nucleotide BLAST program at NEBI and Clustalw 1.83), and the phylogenetic tree of 174 loci (Fig. 1 ) showed that, according to the polymorphism sites of C9404T, C9479T, G17564T, G19838A, A21721G, C22222T, G22517A, G23823T, T27243C and C27827T, the SARS‐CoV genomes of the first epidemic can be divided into two genotypes, genotype C and T. The characteristic of the genotype C was that the genomes had at least one position with the same nucleotide as 10 loci listed above (C:C:G:G:A:C:G:G:T:C). These genomes included GD01, ZS‐C, CUHK‐W1, BJ01, BJ302‐1 and GZ‐C, etc. The characteristic of the genotype T was that the genomes must have all the 10 nucleotides of T:T:T:A:G:T:A:T:C:T. This genotype included GZ‐B, ZJ01, Sin2679, Urbani, TC1, CUHK‐AG01, Sino1‐11, TOR2 and AS, etc. These polymorphism sites were located in 5 different regions of the replicase 1A, 1B, spike protein gene, sars6 and sars8a. Except the 4th and the 7th loci that were synonymous mutations, the others were non‐synonymous mutations with amino acid changes of A/V, A/V, E/D, G/D, T/I, D/Y, P/L and R/C, respectively. It revealed the strongly biased characteristic of changes in genotypes associated with changes in phenotypes. Genotype C can be further divided into 4 sub‐genotypes. Sub‐genotype C1 had, besides the 10 sites mentioned above (Table 3), C3626T, C8559T, T22207C and G22522A loci that would distinguish them from the others. In the same way, sub‐genotype C2 had G3962A, G9448T and T19882C and sub‐genotype C4 had T9854C site. As shown in Fig. 1, the sub‐genotype C3 is a transitional one. Genotype T could also be divided into 4 sub‐genotypes, sub‐genotype T1, T2, T3 and T4. Besides the 10 common polymorphism sites shown above, they had 3, 4, 4 and 3 variant loci, respectively (Table 3). ZJ01, HSR1 and Sin2679, belonging to genotype T and with more polymorphism sites identified in their genomes, located in the midst of the phylogenetic tree (Fig. 1) and were not sub‐genotyped in this paper (Table 3). Only some genomes in both sub‐types C1 and C2 had the 29 nt fragments. SARS‐CoV genomes in sub‐type T3 lacked a region of 415 nt (Fig 1, Table 2).
Table 3

The distribution characteristics of the key mutation loci of SARS‐CoV genomes

Figure 1

Neighbor‐joining tree for the 174 mutation sites of SARS‐CoV genomes. The tree was reconstructed based on the 174 variant sites among the genomes. Bootstrap = 1000. The length indicated number of variant sites. ○: genomes with 29 nt insert. ▵: genomes with 415 nt deletion. C: genotype C; C1–C4: sub‐genotype C1–C4; T: genotype T; T1–T4: sub‐genotype T1–T4.

Neighbor‐joining tree for the 174 mutation sites of SARS‐CoV genomes. The tree was reconstructed based on the 174 variant sites among the genomes. Bootstrap = 1000. The length indicated number of variant sites. ○: genomes with 29 nt insert. ▵: genomes with 415 nt deletion. C: genotype C; C1–C4: sub‐genotype C1–C4; T: genotype T; T1–T4: sub‐genotype T1–T4. The distribution characteristics of the key mutation loci of SARS‐CoV genomes

Spike protein gene and complete genome phylogeny analysis

To verify the genotyping results of the 174 loci shown above, we reconstructed the phylogenetic trees of 122 spike protein genes and 101 complete genomes (Fig. 2A and B ). As can be seen in Fig. 2A, most SARS‐CoV were clustered in accordance with their genotypes. Some partial genome sequences were situated in this tree, such as CUHK‐L2, SZ1, GZ43, HSZ‐A, GD03T0013, BJ302‐1, HKU‐65806 and LY, could not be properly located in the phylogenetic tree based on 174 loci. The genomes of GZ‐C and QXC1 showed intercrossing features between genotype C and T. Some sub‐genotypes, such as C1, C2, T2 and T3, were easily recognized in the spike protein gene tree, but some sub‐genotypes were intercrossed. In the phylogenetic tree of complete genomes, the clusters of genotype C and T and 8 sub‐types are obvious (Fig. 2B). One of the reasons for this might be that the spike protein genes were less informative than the complete genomes.
Figure 2

Neighbor‐joining trees of spike gene and complete SARS‐CoV genomes. The trees were constructed using the Kimura 2‐parameter. Bootstrap = 1000. ○: with 29 nt segments. ♦: T genotype. (A) The tree of 122 spike genes of SARS‐CoV. The length indicated number of nucleotide difference per site of spike gene. (B) The topological tree of 101 complete genomes of SARS‐CoV.

Neighbor‐joining trees of spike gene and complete SARS‐CoV genomes. The trees were constructed using the Kimura 2‐parameter. Bootstrap = 1000. ○: with 29 nt segments. ♦: T genotype. (A) The tree of 122 spike genes of SARS‐CoV. The length indicated number of nucleotide difference per site of spike gene. (B) The topological tree of 101 complete genomes of SARS‐CoV.

The evolutionary relationship among GD03T0013, the animal source viruses and the first epidemic SARS‐CoVs

Fig. 3 shows that the genetic distance of recent SARS‐CoV genome of GD03T0013 was closer to the first epidemic SARS‐CoV GZ02 than to the animal‐origin SARS‐CoV SZ3.
Figure 3

Neighbor‐joining trees of newly occurred, animal‐origin and first epidemic SARS‐CoVs. (A) The tree was constructed using the Kimura 2‐parameter of nucleotide difference. Bootstrap = 1000. The length indicated number of nucleotide difference per site of the spike gene. The number of cluster point indicated bootstrap replication percent. The genetic distance of the newly occurred GD03T0013 was closer to the first epidemic GZ02 than the animal‐origin SZ3. (B) The tree was constructed using the p‐distance of nucleotide difference. Bootstrap = 1000. The number of cluster point indicated bootstrap replication percent. The FIPV‐X06170 (feline infectious peritonitis virus) was used as an out‐group within the spike gene data set. A bootscan recombination analysis of the spike gene revealed high nucleotide identity between the SARS virus and a feline infectious peritonitis virus throughout the gene [21].

Neighbor‐joining trees of newly occurred, animal‐origin and first epidemic SARS‐CoVs. (A) The tree was constructed using the Kimura 2‐parameter of nucleotide difference. Bootstrap = 1000. The length indicated number of nucleotide difference per site of the spike gene. The number of cluster point indicated bootstrap replication percent. The genetic distance of the newly occurred GD03T0013 was closer to the first epidemic GZ02 than the animal‐origin SZ3. (B) The tree was constructed using the p‐distance of nucleotide difference. Bootstrap = 1000. The number of cluster point indicated bootstrap replication percent. The FIPV‐X06170 (feline infectious peritonitis virus) was used as an out‐group within the spike gene data set. A bootscan recombination analysis of the spike gene revealed high nucleotide identity between the SARS virus and a feline infectious peritonitis virus throughout the gene [21].

Polymorphism and diversity analysis of the spike gene among the genotypes and the groups

The genetic characteristics of the genotypes were further analyzed based on the sequence polymorphism of the spike protein genes (Table 4 ). The P a/P s value was 1.047 for the polymorphism sites among the genomes of genotype C that showed neutrality or fine selection. In genotype T, non‐synonymous mutation loci decreased, and the P a/P s value was 0.230 (P < 0.05, with significant differences between the genomes), which showed negative selection in this genotype. The P s% value of synonymous mutation, which was less influenced by the general environment selection, had no significance between the two genotypes. The P s% value was one of the characteristic indices identifying the genetic relationship of the populations, and the P s% value of group 2 (C + T) was approximately half of that of the group 1 (animal source viruses, Table 4). The animal source viruses (SARS‐CoV SZ1, SZ3, SZ13 and SZ16) also showed negative selection among themselves.
Table 4

Polymorphism and diversity analysis of the spike gene among the genotypes and the groups

Genotype or groupNo. of genomesPolymorphism in genotype or groupDiversity
Animal sourceGD03T0013ZS‐B a
No. of genomes with non‐synonymous substitution P a%No. of genomes with synonymous substitution P s% P a/P s P‐value K a% K s% K a/K s K a% K s% K a/K s K a% K s% K a/K s
C43330.17770.1691.0470.4550.6020.4671.2890.6150.4561.3490.2060.2340.880
T73150.028120.1220.2300.0190.6450.5251.2290.6450.5241.2310.2600.2970.875
C + T116460.110190.1510.7280.3010.6290.5031.2500.6340.4991.2710.2400.2740.876
Animal source420.03560.3400.1030.0030.7000.5131.3650.5250.2851.842

ZS‐B was excluded from genotype C. Diversity between genotypes C and T: K a% = 0.160, K s% = 0.169, and K a/K s = 0.947. Diversity between GD03T0013 and ZS‐B: K a% = 0.595, K s% = 0.228, and K a/K s = 2.609.

Polymorphism and diversity analysis of the spike gene among the genotypes and the groups ZS‐B was excluded from genotype C. Diversity between genotypes C and T: K a% = 0.160, K s% = 0.169, and K a/K s = 0.947. Diversity between GD03T0013 and ZS‐B: K a% = 0.595, K s% = 0.228, and K a/K s = 2.609. According to the phylogenetic trees based on the different data sets described above, the genetic diversities between any pair of 3 groups (the animal source viruses, the first epidemic viruses including C and T genotypes, and GD03T0013) were evaluated. The K a/K s values of each two groups were all greater than 1, showing positive Darwinian selections between these groups. On the other hand, the K a/K s values between genotype C and ZS‐B (one of the first epidemic viruses and the closest to the animal origin viruses) or between genotype T and ZS‐B were all less than 1 (1, 2A, Table 4). All analyses described above indicated that the animal source virus and GD03T0013 virus could be classified as two relatively independent populations. 145 SARS‐CoV genome sequences could be classified into three groups, and the group 2 could further be divided into two genotypes and eight sub‐genotypes (Fig. 4 ).
Figure 4

Distribution of SARS‐CoVs in three groups, two genotypes and eight sub‐genotypes.

Distribution of SARS‐CoVs in three groups, two genotypes and eight sub‐genotypes.

Estimated timing of SARS‐CoV origin

Based on the new concept of dividing all the 145 SARS‐CoV genome sequences into three groups, the origin of group 2 and group 3 (assuming group 1 as outer source group) was estimated. Eight candidates including four animal origin genomes of group 1 identified in April to May, 2003, three early genomes of HGZ8L1‐A, ZS‐B and GZ02 in sub‐genotype C1 of group 2 in January to February, 2003, and the genome of GD03T0013 of group 3 in December, 2003, were selected to analyze the date when their most recent common ancestor was existed. Based on analysis of the diversities of spike protein genes, the synonymous mutation rate K s between group 1 and 2 was 0.00321 ± 0.00135, and the synonymous mutation rate K s between group 1 and 3 was 0.00509 ± 0.00199. Assuming that the synonymous mutation rate was constant, a linear regression analysis was evaluated. The linear equation of K s and occurrence time T is K s = 0.000171T + 0.00321 (from January, 2003, with one month intervals), and the date for the most recent common ancestor was estimated to lie in May 2002 (range between early March, 2002 and late July, 2002).

Relation between genotypes and SARS fatality rate

Information about SARS fatality rate and genotypes from all regions are shown in Table 5 . Comparing SARS fatality rates of different regions, it was found that SARS‐CoV genomes of genotype T had the highest case fatality rate.
Table 5

Comparison of the case fatality rates and genotypes of SARS‐CoV in different regions

RegionCumulative number of caseNumber of deathCase fatality ratio (%)C genotypeT genotype
Mainland China532734974211
Beijing25211938145
Guangdong1512584263
Shanghai822521
Zhejiang412501
Hubei711401
Hong Kong175529917312
Singapore2383314024
Taiwan3463711022
Canada251431701
Germany90002
Italy40002
Russian10001
Thailand922201
Other15611700
Total8096774104576
Comparison of the case fatality rates and genotypes of SARS‐CoV in different regions

Discussion

Several studies on SARS‐CoV genome genotyping and grouping have been published [13, 14, 15, 16, 17, 18, 19]. Some presented the regional and international transmission types according to the epidemic regions and some divided the viruses into early, middle, and late phase groups according to the time of the SARS epidemic in a certain region (Guangdong of China). Based on genome characteristics of 145 SARS‐CoVs, a new concept of three groups, two genotypes and eight sub‐types was established. The genotype C of group 2 in this work was almost compatible with the regional transmission type and the early, middle phase groups, while genotype T of group 2 was compatible with the international transmission type and the late phase group as well [16, 19]. One of the advantages of the new classification is that it fits global and regional epidemic feature analysis, such as for SARS‐CoVs in Taiwan and Singapore. Up to date, all the genomes from clinical samples in the early, middle, and late phase of the SARS epidemic in these regions belonged to genotype T. The concept of dividing all SARS‐CoVs into three groups was mainly based on the phylogenetic analysis and the positive Darwinian selection. Because of the limited virus number in group 1 and 3, this grouping has to be further confirmed. The source of SARS‐CoVs was traced. It was clear that genotype T was derived from genotype C [14, 18, 19]. The origins of each sub‐genotype were also elucidated [22, 23, 24] and are consistent with our results. But the origin of the SARS‐CoV remains a question and needs further investigation. Zhao et al. [25] analyzed 16 SARS‐CoV genomes and proposed that the closest common ancestor might have existed during the spring of 2002 which is consistent with the date, May 2002, estimated by the spike gene synonymous mutation rate of the three groups in this work. It is very probable that the three groups originated from an unknown common ancestor. The genotype classification of SARS‐CoV plays an important role in tracing and controlling SARS prevalence. According to genome sequence variation analysis, genotype shifts of SARS‐CoVs in the main epidemic regions during the first epidemic were obvious. The sub‐genotypes C1 and C2, that have a closer relationship to the virus of animal origin, first appeared in Guangdong, and then appeared the sub‐genotype C3. The dominant viruses in HongKong were of genotype T, but genotype C also appeared there. Similarly, although sub‐genotype C4 dominated in Beijing, genotype T also existed there. In Singapore, they had all the genomes of genotypes T, mostly of sub‐genotypes T1 and T2. The viruses in Taiwan were all of genotype T, mostly of sub‐genotype T4. With more SARS‐CoV genome sequences and more complete epidemiological data released, the characteristics of the SARS‐CoV's variations in the genotypes, as well as the features about the epidemic phases, the epidemic regions, the clinical symptoms, and the viruses’ adaptation to the human host, will become clearer and clearer. The genotype was apparently correlated with virulence of the SARS‐CoV. SARS‐CoV GD03T0013 was from the patient with “mild clinical symptoms”, and its genome sequence was similar to that of the sub‐genotype C1. Experiments have also demonstrated that SARS‐CoVs of the genotype T had a stronger ability to cause cyto‐pathogenic effects (CPE) than those of the genotype C. The CPE of the genotype T remained stable during passages in cell cultures [26]. In the first epidemic, SARS patients in Guangzhou seldom suffered from diarrhea, and the SARS‐CoV genomes were mainly of the genotype C [27]. The majority of SARS patients in Hongkong, however, suffered from diarrhea and it turned out that SARS‐CoVs of the genotype T dominated [28]. SARS‐CoVs of different genotypes might have different virulence. Moreover, the serum from mice immunized with inactivated SARS‐CoV BJ01 of genotype C is able to neutralize the invasiveness of SARS‐CoV BJ01 itself into Vero cells [29]. Further investigations are required to demonstrate whether this serum could prevent the cells from the invasiveness of the SARS‐CoV PUMC01 or others of the genotype T. If, in the future, a SARS‐CoV epidemic appears again, after rounds of random mutations, it might cause slight clinical symptoms at the initial stage of the infection. Through positive Darwinian selection, a virus with greater virulence might cause more severe clinical symptoms. A period of neutrality selection and negative selection might follow, and different symptoms and case fatality rates might appear. On the other hand, different epidemic features might occur if there are stable intermediate hosts for SARS‐CoVs of the group 2 (such as SARS‐CoV of genotype T) after the first epidemic, and these viruses might cause epidemics in man again. Table S1. Supplementary table. Click here for additional data file.
  24 in total

1.  Molecular biological analysis of genotyping and phylogeny of severe acute respiratory syndrome associated coronavirus.

Authors:  Zhi-gang Wang; Lan-juan Li; Yun Luo; Jun-yan Zhang; Min-ya Wang; Su-yun Cheng; Yan-jun Zhang; Xiao-meng Wang; Yi-yu Lu; Nan-ping Wu; Ling-ling Mei; Zan-xin Wang
Journal:  Chin Med J (Engl)       Date:  2004-01       Impact factor: 2.628

2.  Inactivated SARS-CoV vaccine prepared from whole virus induces a high level of neutralizing antibodies in BALB/c mice.

Authors:  Lin Tang; Qingyu Zhu; Ede Qin; Man Yu; Zhifen Ding; Huiying Shi; Xiaojie Cheng; Caiping Wang; Guohui Chang; Qiang Zhu; Fang Fang; Haiyan Chang; Shuangli Li; Xumin Zhang; Xishu Chen; Jun Yu; Jian Wang; Ze Chen
Journal:  DNA Cell Biol       Date:  2004-06       Impact factor: 3.311

3.  Molecular phylogeny of coronaviruses including human SARS-CoV.

Authors:  Lei Gao; Ji Qi; Haibin Wei; Yigang Sun; Bailin Hao
Journal:  Chin Sci Bull       Date:  2003

4.  Severe acute respiratory syndrome-associated coronavirus genotype and its characterization.

Authors:  Lanjuan Li; Zhigang Wang; Yiyu Lu; Qiyu Bao; Suhong Chen; Nanping Wu; Suyun Cheng; Jingqing Weng; Yanjun Zhang; Juying Yan; Lingling Mei; Xiaomeng Wang; Hanping Zhu; Yingpu Yu; Minli Zhang; Minhong Li; Jun Yao; Qunying Lu; Pingping Yao; Xiaochen Bo; Jianer Wo; Shengqi Wang; Songnian Hu
Journal:  Chin Med J (Engl)       Date:  2003-09       Impact factor: 2.628

5.  Phylogeny of SARS-CoV as inferred from complete genome comparison.

Authors:  Zhen Qi; Yu Hu; Wei Li; Yanjun Chen; Zhihua Zhang; Shiwei Sun; Hongchao Lu; Jingfen Zhang; Dongbo Bu; Lunjiang Ling; Runsheng Chen
Journal:  Chin Sci Bull       Date:  2003

6.  Coronavirus as a possible cause of severe acute respiratory syndrome.

Authors:  J S M Peiris; S T Lai; L L M Poon; Y Guan; L Y C Yam; W Lim; J Nicholls; W K S Yee; W W Yan; M T Cheung; V C C Cheng; K H Chan; D N C Tsang; R W H Yung; T K Ng; K Y Yuen
Journal:  Lancet       Date:  2003-04-19       Impact factor: 79.321

7.  Molecular evolution and multilocus sequence typing of 145 strains of SARS-CoV.

Authors:  Zhi-Gang Wang; Zhi-Hua Zheng; Lei Shang; Lan-Juan Li; Li-Ming Cong; Ming-Guang Feng; Yun Luo; Su-Yun Cheng; Yan-Jun Zhang; Miao-Gui Ru; Zan-Xin Wang; Qi-Yu Bao
Journal:  FEBS Lett       Date:  2005-09-12       Impact factor: 4.124

8.  SARS-associated coronavirus transmitted from human to pig.

Authors:  Weijun Chen; Minghua Yan; Ling Yang; Boliang Ding; Bo He; Yingzhen Wang; Xiuli Liu; Chenhui Liu; Hui Zhu; Bo You; Shengyong Huang; Jiangguo Zhang; Feng Mu; Zhao Xiang; Xiaoli Feng; Jie Wen; Jianqiu Fang; Jun Yu; Huanming Yang; Jian Wang
Journal:  Emerg Infect Dis       Date:  2005-03       Impact factor: 6.883

9.  Enteric involvement of severe acute respiratory syndrome-associated coronavirus infection.

Authors:  Wai K Leung; Ka-Fai To; Paul K S Chan; Henry L Y Chan; Alan K L Wu; Nelson Lee; Kwok Y Yuen; Joseph J Y Sung
Journal:  Gastroenterology       Date:  2003-10       Impact factor: 22.682

10.  The role of evolution in the emergence of infectious diseases.

Authors:  Rustom Antia; Roland R Regoes; Jacob C Koella; Carl T Bergstrom
Journal:  Nature       Date:  2003-12-11       Impact factor: 49.962

View more
  9 in total

1.  Emerging SARS-CoV-2 Diversity Revealed by Rapid Whole-Genome Sequence Typing.

Authors:  Ahmed M Moustafa; Paul J Planet
Journal:  Genome Biol Evol       Date:  2021-09-01       Impact factor: 4.065

Review 2.  Detection of SARS-CoV-2 B.1.351 (Beta) Variant through Wastewater Surveillance before Case Detection in a Community, Oregon, USA.

Authors:  Melissa Sutton; Tyler S Radniecki; Devrim Kaya; Dana Alegre; Matthew Geniza; Anne-Marie Girard; Katherine Carter; Mark Dasenko; Justin L Sanders; Paul R Cieslak; Christine Kelly; Brett M Tyler
Journal:  Emerg Infect Dis       Date:  2022-04-22       Impact factor: 16.126

3.  The human severe acute respiratory syndrome coronavirus (SARS-CoV) 8b protein is distinct from its counterpart in animal SARS-CoV and down-regulates the expression of the envelope protein in infected cells.

Authors:  Choong-Tat Keng; Yook-Wah Choi; Matthijs R A Welkers; Daphne Z L Chan; Shuo Shen; Seng Gee Lim; Wanjin Hong; Yee-Joo Tan
Journal:  Virology       Date:  2006-07-31       Impact factor: 3.616

4.  Characterizing 56 complete SARS-CoV S-gene sequences from Hong Kong.

Authors:  Julian W Tang; Jo L K Cheung; Ida M T Chu; Margaret Ip; Mamie Hui; Malik Peiris; Paul K S Chan
Journal:  J Clin Virol       Date:  2006-11-16       Impact factor: 3.168

5.  Molecular evolution and multilocus sequence typing of 145 strains of SARS-CoV.

Authors:  Zhi-Gang Wang; Zhi-Hua Zheng; Lei Shang; Lan-Juan Li; Li-Ming Cong; Ming-Guang Feng; Yun Luo; Su-Yun Cheng; Yan-Jun Zhang; Miao-Gui Ru; Zan-Xin Wang; Qi-Yu Bao
Journal:  FEBS Lett       Date:  2005-09-12       Impact factor: 4.124

6.  Evolution of genomes, host shifts and the geographic spread of SARS-CoV and related coronaviruses.

Authors:  Daniel Janies; Farhat Habib; Boyan Alexandrov; Andrew Hill; Diego Pol
Journal:  Cladistics       Date:  2008-02-16       Impact factor: 5.254

7.  Bat SARS-Like WIV1 coronavirus uses the ACE2 of multiple animal species as receptor and evades IFITM3 restriction via TMPRSS2 activation of membrane fusion.

Authors:  Mei Zheng; Xuesen Zhao; Shuangli Zheng; Danying Chen; Pengcheng Du; Xinglin Li; Dong Jiang; Ju-Tao Guo; Hui Zeng; Hanxin Lin
Journal:  Emerg Microbes Infect       Date:  2020-12       Impact factor: 7.163

Review 8.  Phylogenetic perspectives on the epidemiology and origins of SARS and SARS-like coronaviruses.

Authors:  Chi Wai Yip; Chung Chau Hon; Mang Shi; Tommy Tsan-Yuk Lam; Ken Yan-Ching Chow; Fanya Zeng; Frederick Chi-Ching Leung
Journal:  Infect Genet Evol       Date:  2009-09-30       Impact factor: 3.342

9.  The global population of SARS-CoV-2 is composed of six major subtypes.

Authors:  Ivair José Morais; Richard Costa Polveiro; Gabriel Medeiros Souza; Daniel Inserra Bortolin; Flávio Tetsuo Sassaki; Alison Talis Martins Lima
Journal:  Sci Rep       Date:  2020-10-26       Impact factor: 4.379

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.