| Literature DB >> 33486372 |
Reham M Dawood1, Mai A El-Meguid1, Ghada M Salum1, Khaled El-Wakeel2, Mohamed Shemis3, Mostafa K El Awady4.
Abstract
BACKGROUND: The striking difference in severity of SARS CoV2 infection among global population is partly attributed to viral factors. With the spike (S) and nucleocapsid (N) are the most immunogenic subunits, genetic diversity and antigenicity of S and N are key players in virulence and in vaccine development. AIM: This paper aims at identifying immunogenic targets for better vaccine development and/or immunotherapy of COVID 19 pandemic.Entities:
Keywords: Furin cleavage site; Genetic distance; NH2 Terminal Domain; Phylogenetic tree; SARS CoV2
Mesh:
Substances:
Year: 2020 PMID: 33486372 PMCID: PMC7737509 DOI: 10.1016/j.jiph.2020.12.006
Source DB: PubMed Journal: J Infect Public Health ISSN: 1876-0341 Impact factor: 3.718
Designations, lengths and accession IDs of SARS CoV2, SARS CoV and MERS CoV complete genomes. Genome sequences of SARS CoV2 were derived from isolates of diverse populations covering four geographical areas worldwide; Asia, Middle East, Europe and North America.
| Accession number | S protein | N protein | Genome sequence | |
|---|---|---|---|---|
| 1 | NC_045512 | 21,563.25384 | 28,274.29533 | Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1, complete genome |
| 2 | MT077125.1 | 21,507.25328 | 28,218.29477 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/ITA/INMI1/2020, complete genome |
| 3 | MT066156.1 | 21,563.25384 | 28,274.29533 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/ITA/INMI1/2020, complete genome |
| 4 | LC534418.1 | 21,559.25380 | 28,270.29529 | Severe acute respiratory syndrome coronavirus 2 SARS-CoV-2/Hu/DP/Kng/19-031 RNA, complete genome |
| 5 | MN996531.1 | 21,550.25371 | 28,261.29520 | Severe acute respiratory syndrome coronavirus 2 isolate WIV07, complete genome |
| 6 | MT012098.1 | 21,550.25368 | 28,258.29517 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/IND/29/2020, complete genome |
| 7 | MT020880.1 | 21,563.25384 | 28,274.29533 | Severe acute respiratory syndrome coronavirus 2 isolate 2019-nCoV/USA-WA1-A12/2020, complete genome |
| 8 | MN938384.1 | 21,531.25352 | 28,242.29501 | Severe acute respiratory syndrome coronavirus 2 isolate 2019-nCoV_HKU-SZ-002a_2020, complete genome. |
| 9 | MT027064.1 | 21,563.25384 | 28,274.29533 | Severe acute respiratory syndrome coronavirus 2 isolate 2019-nCoV/USA-CA5/2020, complete genome |
| 10 | MT292569.1 | 21,509.25330 | 28,220.29479 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/ESP/Valencia13/2020, complete genome |
| 11 | MT304491.1 | 21,563.25384 | 28,274.29533 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/USA/TX_2967/2020, complete genome |
| 12 | MT320538.1 | 21,562.25380 | 28,270.29529 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/FRA/KRA-ROB/2020, complete genome |
| 13 | MT320891.1 | 21,525.25346 | 28,236.29495 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/IRN/HGRC-1.1-IPI-8206/2020, complete genome |
| 14 | MT276598.1 | 21,567.25388 | 28,278.29537 | Severe acute respiratory syndrome coronavirus 2 isolate SARS-CoV-2/human/ISR/ISR_IT0320/2020, complete genome |
| 15 | KT225476.2 | 21450.25511 | 28,560.29801 | Middle East respiratory syndrome coronavirus isolate |
| 16 | KJ556336.1 | 21,240.25301 | 28,350.29591 | Middle East respiratory syndrome coronavirus isolate |
| 17 | AY274119.3 | 21,492.25259 | 28,120.29388 | |
| 18 | NC_006577.2 | 22,942.27012 | 28,320.29645 |
Fig. 3Location of SARS CoV2-S (a) and N (b) proteins derived B cell and T cell epitopes (MCH1 and MCH2) on the protein structure. Functional domains in the S protein are marked as follows: NH2-Terminal Domain (a.a. 1–333), Receptor Binding Domain (a.a. 438–506) and S1/S2 cleavage site (a.a. 683–686).
Figure 1Radiation Phylogram of the phylogenetic tree showing comparison of the similarity, among diverse populations, of complete genomes (a) Spike (b) and nucleocapsid (c) of SARS CoV2. The tree was constructed based on the available diverse sequences using MEGA program (see Methods section).
Genetic distance of SARS CoV2, SARS CoV and MERS CoV spike proteins between different populations. The numbers reflect the relative inter-relation between strains on the X-axis with 14 strains on the Y axis. The smaller the number indicates the greater sequence identity between any 2 isolates.
| LC534418.1, CV2 Japan | KJ556336.1_MERS_Jeddah | MN996531.1_CV2 Wuhan | MT012098.1_CV2_India | MT020880.1_CV2_USA WA | MT027064.1_CV2_USA CA | MT066156.1_CV2_Italy | MT077125.1_CV2_Italy | MT276598.1_CV2_Israel | MT292569.1_CV2_Espagne | MT320538.1_CV2_France | MT320891.1_CV2_Iran | NC_045512.2_CV2_Wuhan | AY274119.3_CV_Tor2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KJ556336.1_MERS_Jeddah | 1.080 | |||||||||||||
| MN996531.1_CV2 China | 0.000 | 1.080 | ||||||||||||
| MT012098.1_CV2_India | 0.000 | 1.081 | 0.000 | |||||||||||
| MT020880.1_CV2_USA WA | 0.000 | 1.080 | 0.000 | 0.000 | ||||||||||
| MT027064.1_CV2_USA CA | 0.000 | 1.081 | 0.000 | 0.001 | 0.000 | |||||||||
| MT066156.1_CV2_Italy | 0.000 | 1.080 | 0.000 | 0.000 | 0.000 | 0.000 | ||||||||
| MT077125.1_CV2_Italy | 0.000 | 1.080 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||||||
| MT276598.1_CV2_Israel | 0.000 | 1.080 | 0.000 | 0.001 | 0.000 | 0.001 | 0.000 | 0.000 | ||||||
| MT292569.1_CV2_Espagne | 0.000 | 1.080 | 0.000 | 0.001 | 0.000 | 0.001 | 0.000 | 0.000 | 0.000 | |||||
| MT320538.1_CV2_France | 0.001 | 1.080 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.000 | 0.000 | ||||
| MT320891.1_CV2_Iran | 0.000 | 1.080 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 | |||
| NC_045512.2_CV2_Wuhan | 0.000 | 1.080 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | ||
| AY274119.3_CV_Tor2 | 0.470 | 1.342 | 0.470 | 0.469 | 0.470 | 0.469 | 0.470 | 0.470 | 0.470 | 0.470 | 0.470 | 0.470 | 0.470 | |
| NC_006577.2_CV_HKU1 | 1.115 | 1.394 | 1.115 | 1.116 | 1.115 | 1.114 | 1.115 | 1.115 | 1.116 | 1.116 | 1.116 | 1.115 | 1.115 | 1.353 |
Genetic distance of SARS CoV2, SARS CoV and MERS CoV nucleocapsid proteins between different populations. The numbers reflect the relative inter-relation between strains on the X-axis with 14 strains on the Y axis. The smaller the number indicates the greater sequence identity between any 2 isolates.
| LC534418.1_CV2 Japan | KJ556336.1_MERS Jeddah | MN996531.1_CV2 Wuhan | MT012098.1_CV2_India | MT020880.1_CV2_USA WA | MT027064.1_CV2_USA CA | MT066156.1_CV2_Italy | MT077125.1_CV2_Italy | MT276598.1_CV2_Israel | MT292569.1_CV2_Espagne | MT320538.1_CV2_France | MT320891.1_CV2_Iran | NC_045512.2_CV2_Wuhan | AY274119.3_CV_Tor2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KJ556336.1_MERS_Jeddah | 8.850 | |||||||||||||
| MN996531.1_CV2 Wuhan | 0.000 | 8.850 | ||||||||||||
| MT012098.1_CV2_India | 0.000 | 8.850 | 0.000 | |||||||||||
| MT020880.1_CV2_USA WA | 0.000 | 8.850 | 0.000 | 0.000 | ||||||||||
| MT027064.1_CV2_USA CA | 0.000 | 8.850 | 0.000 | 0.000 | 0.000 | |||||||||
| MT066156.1_CV2_Italy | 0.000 | 8.850 | 0.000 | 0.000 | 0.000 | 0.000 | ||||||||
| MT077125.1_CV2_Italy | 0.000 | 8.850 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |||||||
| MT276598.1_CV2_Israel | 0.002 | 8.887 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | ||||||
| MT292569.1_CV2_Espagne | 0.000 | 8.850 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | |||||
| MT320538.1_CV2_France | 0.000 | 8.850 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | 0.000 | ||||
| MT320891.1_CV2_Iran | 0.001 | 8.850 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.003 | 0.001 | 0.001 | |||
| NC_045512.2_CV2_Wuhan | 0.000 | 8.850 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | 0.000 | 0.000 | 0.001 | ||
| AY274119.3_CV_Tor2 | 0.192 | 9.113 | 0.192 | 0.192 | 0.192 | 0.192 | 0.192 | 0.192 | 0.199 | 0.192 | 0.192 | 0.194 | 0.192 | |
| NC_006577.2_CV_HKU1 | 11.335 | 13.175 | 11.335 | 11.335 | 11.335 | 11.335 | 11.335 | 11.335 | 11.358 | 11.335 | 11.335 | 11.335 | 11.335 | 12.393 |
Fig. 2(a) Multiple amino acid alignment between NH2-Terminal Domain of SARS CoV2 S proteins derived from 6 diverse populations covering the most stricken 3 continents Asia, Europe and North America. Homologous sequences are shown in red while mutations are indicated in black. Multiple alignments were performed using Cobalt software. (b) Pocket pattern of The NH2-Terminal Domain of the S protein. Pattern of amino acid pockets that make up the largest pocket at the farther end of the NTD derived from Wuhan reference strain as compared with the pocket pattern resulted from the Y144 and G107 deletions in the NTD of the India (accession ID # MT012098.1) and France (accession ID # MT320538.1) SARS CoV2 isolates respectively. The amino acid pockets were identified using Phyre2 program (see Methods section). (c) The 3 D models of S proteins in the reference strain derived from Wuhan China. Complete sequence of the S protein (blue) derived from the reference strain Wuhan (accession ID # NC_045512), was developed using Phyr2 program. The largest pocket (neutralizing antibody binding loop) is located at the end of the NH2-Terminal Domain (red).