| Literature DB >> 15626340 |
E'de Qin1, Xionglei He, Wei Tian, Yong Liu, Wei Li, Jie Wen, Jingqiang Wang, Baochang Fan, Qingfa Wu, Guohui Chang, Wuchun Cao, Zuyuan Xu, Ruifu Yang, Jing Wang, Man Yu, Yan Li, Jing Xu, Bingyin Si, Yongwu Hu, Wenming Peng, Lin Tang, Tao Jiang, Jianping Shi, Jia Ji, Yu Zhang, Jia Ye, Cui'e Wang, Yujun Han, Jun Zhou, Yajun Deng, Xiaoyu Li, Jianfei Hu, Caiping Wang, Chunxia Yan, Qingrun Zhang, Jingyue Bao, Guoqing Li, Weijun Chen, Lin Fang, Changfeng Li, Meng Lei, Dawei Li, Wei Tong, Xiangjun Tian, Jin Wang, Bo Zhang, Haiqing Zhang, Yilin Zhang, Hui Zhao, Xiaowei Zhang, Shuangli Li, Xiaojie Cheng, Xiuqing Zhang, Bin Liu, Changqing Zeng, Songgang Li, Xuehai Tan, Siqi Liu, Wei Dong, Jun Wang, Gane Ka-Shu Wong, Jun Yu, Jian Wang, Qingyu Zhu, Huanming Yang.
Abstract
We report a complete genomic sequence of rare isolates (minor genotype) of the SARS-CoV from SARS patients in Guangdong, China, where the first few cases emerged. The most striking discovery from the isolate is an extra 29-nucleotide sequence located at the nucleotide positions between 27,863 and 27,864 (referred to the complete sequence of BJ01) within an overlapped region composed of BGI-PUP5 (BGI-postulated uncharacterized protein 5) and BGI-PUP6 upstream of the N (nucleocapsid) protein. The discovery of this minor genotype, GD-Ins29, suggests a significant genetic event and differentiates it from the previously reported genotype, the dominant form among all sequenced SARS-CoV isolates. A 17-nt segment of this extra sequence is identical to a segment of the same size in two human mRNA sequences that may interfere with viral replication and transcription in the cytosol of the infected cells. It provides a new avenue for the exploration of the virus-host interaction in viral evolution, host pathogenesis, and vaccine development.Entities:
Mesh:
Year: 2003 PMID: 15626340 PMCID: PMC5172408 DOI: 10.1016/s1672-0229(03)01014-3
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
The Complete Genome Sequences of 17 Isolates of SARS-CoV
| Isolate | Genome size (nt) | Accession number | Modification date |
|---|---|---|---|
| SIN2500 | 29,711 | AY283794.1 | 9-May-03 |
| SIN2677 | 29,705 | AY283795.1 | 9-May-03 |
| SIN2679 | 29,711 | AY283796.1 | 9-May-03 |
| SIN2748 | 29,706 | AY283797.1 | 9-May-03 |
| SIN2774 | 29,711 | AY283798.1 | 9-May-03 |
| TOR2 | 29,751 | NC_004718.3 | 22-May-03 |
| Urbani | 29,727 | AY278741.1 | 21-Apr-03 |
| CUHK-W1 | 29,736 | AY278554.2 | 14-May-03 |
| CUHK-Su10 | 29,736 | AY282752.1 | 7-May-03 |
| HKU-39849 | 29,742 | AY278491.2 | 18-Apr-03 |
| TW1 | 29,729 | AY291451.1 | 14-May-03 |
| ZJ01 | 29,715 | AY297028.1 | 19-May-03 |
| BJ01 | 29,725 | AY278488.2 | 1-May-03 |
| BJ02 | 29,740 | AY278487 | 29-May-03 |
| BJ03 | 29,738 | AY278490 | 29-May-03 |
| BJ04 | 29,732 | AY279354 | 29-May-03 |
| GD01 | 29,757 | AY278489 | 29-May-03 |
The Predicted ORFs in the GD01 SARS-CoV Genome
| ORF | Position | Size (a.a.) | TRS position | TRS sequence |
|---|---|---|---|---|
| R | 246 – 13,379 | 7,073 | 107 | A G U A UAAAC - AA UAA UAAA U U U U A |
| S | 21,473 - 25,240 | 1,255 | 21,463 | CAA CUAAACGAAC |
| BGI-PUP1 | 25,249 - 26,073 | 274 | 25,237 | CACA UAAACGAACUU |
| BGI-PUP2 | 25,670 - 26,134 | 154 | 25,600 | U GCA U C AACG C A - U G UA G AAU UAU |
| E | 26,098 - 26,328 | 76 | 26,086 | AG U GAGU ACGAACUU |
| M | 26,379 - 27,044 | 221 | 26,325 | GG UCUAAACGAACU AACU A U U A U U |
| BGI-PUP3 | 27,055 - 27,246 | 63 | 26,974 | A C CG UA UUG GAA AC U AU AAAU UAA |
| BGI-PUP4 | 27,254 - 27,622 | 122 | 27,184 | C CUCUAA - C U AA —G AA G A AU U A |
| AG UCUAAACGAAC | ||||
| AG UCUAAACGAAC A U G AAA -C U U C | ||||
| BGI-PUP | 27,760 - 27,863 | 122 | 27,750 | AG UCUAAACGAAC |
| (GD- | -(29 bp insertion)- | |||
| N | 28,101 - 29,369 | 422 | 28,083 | U AAA UAAACGAAC AAAU U AA A |
| BGI - PUP7 | 28,111 - 28,407 | 98 | 28,083 | U AAA UAAACGAAC AAAUU AA AA UG |
The nucleotide position is in reference to the complete sequence of BJ01. An ORF includes both start codon and stop codon. An actual position in GD01 is calculated by adding 3 nucleotides.
The position is in reference to the first nucleotide in the consensus leader core sequence (CUAAACGAAC) of the TRS.
The PUPs are equivalent to ORF 10 and ORF 11 in Tor2 (NC_004718) (.
BGI-PUP (GD-Ins29) is present only in the minor genotype of GD-Ins29.
Fig. 1The GD-Ins29 in the genome of Isolate GD01. Solid arrows indicate ORFs and the gray arrows denote the BGI-PUPs. Open arrows highlight the region of interest. The position of the insertion and related annotations are in the parentheses.
Summarized Substitutions in 17 Isolates of SARS-CoV
| ORF | Size (nt) | No. of S | Percentage of substitute (%) | No. of N-Syn | Percentage of N-Syn (%) |
|---|---|---|---|---|---|
| R | 21,222 | 92 (26) | 0.43 | 65 (16) | 71 |
| S | 3,768 | 22 (7) | 0.58 | 13 (5) | 59 |
| BGI-PUP1 | 825 | 9 (3) | 1.09 | 6 (2) | 67 |
| BGI-PUP2 | 465 | 5 (3) | 1.08 | 4 (2) | 80 |
| E | 231 | 1 (1) | 0.43 | 1 (1) | 100 |
| M | 666 | 4 | 0.60 | 4 | 100 |
| BGI-PUP3 | 192 | 2 | 1.04 | 2 | 100 |
| N | 1,269 | 4 | 0.32 | 3 | 75 |
| BGI-PUP | |||||
| (GD- | 369 | 2 (1) | 0.54 | 2 (1) | 100 |
| Non-ORF | 1 | ||||
| Total | 29,725 | 142 (41) | 0.46 | 100 |
S: substitution; N-Syn: non-synonymous substitution.
Number in the parenthesis indicates the substitutions contributed solely by Isolate GD01.
A single substitution at the same position in a region overlapped with two ORFs is counted as 2. The total number would be 137 if such a substitution event were calculated as 1, and the total number of substitutions contributed by Isolate GD01 would be 38, accordingly.
Fig. 2Phylogenetic analysis of the 17 SARS-CoV isolates based on completed genomes. The proposed rectangular cladogram was generated by Clustalw 1.81 and the bootstrap values were deducted from 1,000 replicates. The sources and abbreviations of the sequences are referred to the text of Table 1.