| Literature DB >> 15629030 |
Shengli Bi1, E'de Qin, Zuyuan Xu, Wei Li, Jing Wang, Yongwu Hu, Yong Liu, Shumin Duan, Jianfei Hu, Yujun Han, Jing Xu, Yan Li, Yao Yi, Yongdong Zhou, Wei Lin, Hong Xu, Ruan Li, Zizhang Zhang, Haiyan Sun, Jingui Zhu, Man Yu, Baochang Fan, Qingfa Wu, Wei Lin, Lin Tang, Baoan Yang, Guoqing Li, Wenming Peng, Wenjie Li, Tao Jiang, Yajun Deng, Bohua Liu, Jianping Shi, Yongqiang Deng, Wei Wei, Hong Liu, Zongzhong Tong, Feng Zhang, Yu Zhang, Cui'e Wang, Yuquan Li, Jia Ye, Yonghua Gan, Jia Ji, Xiaoyu Li, Xiangjun Tian, Fushuang Lu, Gang Tan, Ruifu Yang, Bin Liu, Siqi Liu, Songgang Li, Jun Wang, Jian Wang, Wuchun Cao, Jun Yu, Xiaoping Dong, Huanming Yang.
Abstract
Beijing has been one of the epicenters attacked most severely by the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) since the first patient was diagnosed in one of the city's hospitals. We now report complete genome sequences of the BJ Group, including four isolates (Isolates BJ01, BJ02, BJ03, and BJ04) of the SARS-CoV. It is remarkable that all members of the BJ Group share a common haplotype, consisting of seven loci that differentiate the group from other isolates published to date. Among 42 substitutions uniquely identified from the BJ group, 32 are non-synonymous changes at the amino acid level. Rooted phylogenetic trees, proposed on the basis of haplotypes and other sequence variations of SARS-CoV isolates from Canada, USA, Singapore, and China, gave rise to different paradigms but positioned the BJ Group, together with the newly discovered GD01 (GD-Ins29) in the same clade, followed by the H-U Group (from Hong Kong to USA) and the H-T Group (from Hong Kong to Toronto), leaving the SP Group (Singapore) more distant. This result appears to suggest a possible transmission path from Guangdong to Beijing/Hong Kong, then to other countries and regions.Entities:
Mesh:
Year: 2003 PMID: 15629030 PMCID: PMC5172409 DOI: 10.1016/s1672-0229(03)01023-4
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1Accumulated number of probable cases and deaths of SARS in Beijing, China, and the world. Data sources: http://www.moh.gov.cn/zhgl/yqfb/index.htm; http://www.who.int/csr/sars/country/en/.
Samples and Clinical Data for the BJ Group (Isolates BJ01-BJ04)
| Isolate | GenBank accession number | Tissue | Sample | Clinical outcome |
|---|---|---|---|---|
| BJ01 | AY278488 | Lung | Autopsy | Deceased |
| BJ02 | AY278487 | Nose & throat | Swabs (mixed patients infected by BJ01) | Recovered |
| BJ03 | AY278490 | Liver & lymph nodes | Autopsy (same as BJ01) | Deceased |
| BJ04 | AY279354 | Lung | Autopsy | Deceased |
Information Summary of the Complete Genome Sequences of the BJ Group and Other 13 Isolates of SARS-CoV*
| Isolate | Genome size (nt) | Accession number | Modification date |
|---|---|---|---|
| GD01 | 29,757 | AY278489 | 29-May-03 |
| ZJ01 | 29,715 | AY297028.1 | 19-May-03 |
| TW1 | 29,729 | AY291451.1 | 14-May-03 |
| CUHK-W1 | 29,736 | AY278554.2 | 14-May-03 |
| CUHK-Su10 | 29,736 | AY282752.1 | 7-May-03 |
| Urbani | 29,727 | AY278741.1 | 21-Apr-03 |
| HKU-39849 | 29,742 | AY278491.2 | 18-Apr-03 |
| TOR2 | 29,751 | NC_004718.3 | 22-May-03 |
| SIN2500 | 29,711 | AY283794.1 | 9-May-03 |
| SIN2677 | 29,705 | AY283795.1 | 9-May-03 |
| SIN2679 | 29,711 | AY283796.1 | 9-May-03 |
| SIN2748 | 29,706 | AY283797.1 | 9-May-03 |
| SIN2774 | 29,711 | AY283798.1 | 9-May-03 |
Data were retrieved from GenBank: http://www.ncbi.nlm.nih.gov.
The Group-Specific Haplotypes of SARS-CoV
Fig. 2Distribution of the non-synonymous substitutions in the SARS-CoV genome from the known cases. The tall vertical bars represent the substitutions detected from the BJ Group and the low bars denote those from non-BJ Groups. The scale marks the nucleotide positions in reference to BJ01.
Summarized Substitutions Identified in the BJ Group and other 13 Isolates of SARS-CoV
| ORF | Size(nt) | No.of S | Percentage of substitutions (%) | No. of N-Syn | Percentage of N-Syn (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| BJ Group | All | BJ Group | All | BJ Group | All | BJ Group | All | ||
| R | 21,222 | 25 | 92 | 0.12 | 0.43 | 21 | 65 | 84 | 71 |
| S | 3,768 | 9 | 22 | 0.24 | 0.58 | 6 | 13 | 67 | 59 |
| BGI-PUP1 | 825 | 4 | 9 | 0.48 | 1.09 | 2 | 6 | 50 | 67 |
| BGI-PUP2 | 465 | 2 | 5 | 0.43 | 1.08 | 2 | 4 | 100 | 80 |
| E | 231 | 0 | 1 | 0 | 0.43 | 0 | 1 | 100 | |
| M | 666 | 0 | 4 | 0 | 0.60 | 0 | 4 | 100 | |
| BGI-PUP3 | 192 | 0 | 2 | 0 | 1.04 | 0 | 2 | 100 | |
| BGI-PUP5 | 120 | 0 | 1 | 0 | 0.83 | 0 | 1 | 100 | |
| BGI-PUP6 | 255 | 0 | 1 | 0 | 0.39 | 0 | 1 | 100 | |
| N | 1,269 | 1 | 4 | 0.08 | 0.32 | 1 | 3 | 100 | 75 |
| Non-ORF | 1 | 1 | |||||||
| Total | 29,725 | 42 | 142 | 0.13 | 0.46 | 32 | 100 | 76 | 70 |
S and N-Sys stand for synonymous and non-synonymous substitutions, respectively.
A single substitution at the same position in a region overlapping with two ORFs was counted as 2. The total number is 137 when such a substitution event was counted only once so the total number of substitutions contributed by the BJ Group is reduced to 40.
Comparison of the Mutation Rates in SARS-CoV, Influenza Virus, and HIV*
| Virus | ORF | Size (nt) | No. of substitutions | Substitute rate (%) | No. of non-synonymous substitution | Non-synonymous substitute rate (%) | dN/dS | Ka | Ks | Ka/Ks |
|---|---|---|---|---|---|---|---|---|---|---|
| SARS-CoV | R | 21,222 | 92 | 0.43 | 65 | 0.31 | 2.38 | 0.075 | 0.111 | 0.67 |
| S | 3,768 | 22 | 0.58 | 13 | 0.35 | 2.00 | 0.108 | 0.188 | 0.57 | |
| PUP1 | 825 | 9 | 1.09 | 6 | 0.73 | 1.77 | 0.171 | 0.340 | 0.50 | |
| PUP2 | 465 | 5 | 1.08 | 4 | 0.86 | 4.88 | 0.217 | 0.152 | 1.43 | |
| E | 231 | 1 | 0.43 | 1 | 0.43 | 0.093 | 0.000 | |||
| M | 666 | 4 | 0.60 | 4 | 0.60 | 0.126 | 0.000 | |||
| PUP3 | 192 | 2 | 1.04 | 2 | 1.04 | 0.492 | 0.000 | |||
| PUP4 | 369 | 0 | 0.00 | 0 | 0.00 | 0.000 | 0.000 | |||
| N | 1,269 | 4 | 0.32 | 3 | 0.24 | 3.00 | 0.049 | 0.054 | 0.91 | |
| PUP5 | 297 | 0 | 0.00 | 0 | 0.00 | 0.000 | 0.000 | |||
| Total | 29,725 | 137(142) | 0.46 | 98 | 0.33 | 2.52 | 0.085 | 0.119 | 0.72 | |
| Influenza Virus A | HA | 1,701 | 698 | 41.03 | 323 | 18.99 | 0.62 | 64.5 | 382.8 | 0.17 |
| M2 | 294 | 98 | 33.33 | 60 | 20.41 | 0.83 | 178.0 | 815.9 | 0.22 | |
| M1 | 759 | 243 | 32.02 | 79 | 10.41 | 0.24 | 79.9 | 1,112 | 0.07 | |
| NA | 1,413 | 418 | 29.58 | 181 | 12.81 | 0.67 | 12.3 | 69.4 | 0.18 | |
| NP | 1,497 | 573 | 38.28 | 186 | 12.42 | 0.31 | 107.0 | 1,231 | 0.09 | |
| NS | 366 | 128 | 34.97 | 56 | 15.30 | 0.34 | 90.6 | 1,087 | 0.08 | |
| PA | 2,151 | 689 | 32.03 | 181 | 8.41 | 0.20 | 31.4 | 596.4 | 0.05 | |
| PB1 | 2,274 | 851 | 37.42 | 204 | 8.97 | 0.13 | 27.5 | 805.7 | 0.03 | |
| PB2 | 2,280 | 805 | 35.31 | 203 | 8.90 | 0.18 | 36.2 | 719.3 | 0.05 | |
| Total | 13,638 | 4,784(4,803) | 35.08 | 1,473 | 10.80 | 0.27 | 48.1 | 659.1 | 0.07 | |
| HIV-1 | Gag | 1,476 | 979 | 66.33 | 676 | 45.80 | 0.88 | 5,917 | 25,290 | 0.23 |
| Pol | 3,000 | 1,894 | 63.13 | 1,232 | 41.07 | 0.61 | 3,974 | 24,510 | 0.16 | |
| Vif | 579 | 392 | 67.70 | 300 | 51.81 | 1.42 | 7,613 | 21,014 | 0.36 | |
| Vpr | 291 | 202 | 69.42 | 140 | 48.11 | 0.76 | 5,924 | 28,054 | 0.21 | |
| Tat | 306 | 216 | 70.59 | 172 | 56.21 | 2.64 | 11,101 | 16,108 | 0.69 | |
| Rev | 303 | 237 | 78.22 | 189 | 62.38 | 1.70 | 10,107 | 19,080 | 0.53 | |
| Vpu | 249 | 215 | 86.35 | 182 | 73.09 | 2.23 | 14,947 | 25,620 | 0.58 | |
| Env | 2,574 | 1,992 | 77.39 | 1,531 | 59.48 | 1.70 | 10,765 | 23,342 | 0.46 | |
| Nef | 624 | 473 | 75.80 | 370 | 59.29 | 1.12 | 8,632 | 29,346 | 0.29 | |
| Total | 9,680 | 6,536(7,141) | 67.52 | 4,792 | 49.50 | 1.15 | 7,445 | 24,094 | 0.31 | |
SARS-CoV data are from 17 isolates and referred to the notes of Table 2. Influenza Virus data are from 50 strains of HA segment, 100 strains of M segment, 24 strains of NA segment, 88 strains of NP segment, 89 strains of NS segment, 53 strains of PA segment, 58 strains of PB1 segment, and 58 strains of PB2 segment, downloaded from http://www.flu.lanl.gov. HIV data are from 405 strains, downloaded from http://hiv-web.lanl.gov/. Only those minor alleles that are present in at least two sequences were considered as real substitutions.
Fig. 3Two-scale substitution rates of the SARS-CoV in the first and second round of the transmission of the BJ Group. The numbers above the lines that connect each isolate are nonsynonymous substitution counts believed as results of the first round and second round of the transmission. The numbers in the parentheses are synonymous substitution counts between the connected isolates.
Predicted Subregional Changes by the Non-Synonymous Substitutions in the 17 SARS-CoV Genomes
Fig. 4A rooted phylogenetic tree (GD01 as the postulated root) indicates the defined haplotypes and possible transmission path of the SARS-CoV based on the complete genome sequences of 17 SARS-CoV isolates. The neighbor-joining trees are generated by using the program Clustalw 1.81. The sources and abbreviations of the sequences are referred to Table 2.