| Literature DB >> 34924802 |
Abdallah A Hassanin1, Sayed Haidar Abbas Raza2, Javed Ahmed Ujjan3, Ayshah Aysh ALrashidi4, Basel M Sitohy5, Ameena A Al-Surhanee6, Ahmed M Saad7, Tahani Mohamed Al-Hazani8, Osama Osman Atallah9, Khalid M Al Syaad10,11, Ahmed Ezzat Ahmed10,12, Ayman A Swelum13,14, Mohamed T El-Saadony15, Mahmoud Z Sitohy7.
Abstract
The emergence of coronavirus disease 2019 (COVID-19) pandemic in Wuhan city, China at the end of 2019 made it urgent to identify the origin of the causal pathogen and its molecular evolution, to appropriately design an effective vaccine. This study analyzes the evolutionary background of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2 or SARS-2) in accordance with its close relative SARS-CoV (SARS-1), which was emerged in 2002. A comparative genomic and proteomic study was conducted on SARS-2, SARS-1, and Middle East respiratory syndrome coronavirus (MERS), which was emerged in 2012. In silico analysis inferred the genetic variability among the tested viruses. The SARS-1 genome harbored 11 genes encoding 12 proteins, while SARS-2 genome contained only 10 genes encoding for 10 proteins. MERS genome contained 11 genes encoding 11 proteins. The analysis also revealed a slight variation in the whole genome size of SARS-2 comparing to its siblings resulting from sequential insertions and deletions (indels) throughout the viral genome particularly ORF1AB, spike, ORF10 and ORF8. The effective indels were observed in the gene encoding the spike protein that is responsible for viral attachment to the angiotensin-converting enzyme 2 (ACE2) cell receptor and initiating infection. These indels are responsible for the newly emerging COVID-19 variants αCoV, βCoV, γCoV and δCoV. Nowadays, few effective COVID-19 vaccines developed based on spike (S) glycoprotein were approved and become available worldwide. Currently available vaccines can relatively prevent the spread of COVID-19 and suppress the disease. The traditional (killed or attenuated virus vaccine and antibody-based vaccine) and innovated vaccine production technologies (RNA- and DNA-based vaccines and viral vectors) are summarized in this review. We finally highlight the most common questions related to COVID-19 disease and the benefits of getting vaccinated.Entities:
Keywords: (E), envelope; (M), membrane; (N), nucleocapsid; (S), Spike; ACE2, Angiotensin Converting Enzyme 2; BLAST, Basic Local Alignment Search Tool; CD4, Helper T lymphocytes express cluster determinant 4; CD8, cytotoxic T cells express cluster determinant 8; CDC, Centers of Disease Control; COVID-19; COVID-19, Corona Virus Disease 2019; Coronavirus; Del, Deletion; Genomics; Ins, Insertion; MDCK, Madin-Darby Canine Kidney; MERS-CoV, Middle East Respiratory Syndrome Coronavirus; NCBI, National Center for Biotechnology Information; NJ, neighbor-joining; NSP, nonstructural protein; NTD, N-Terminal Domain; ORF, Open Reading Frame; PCR, polymerase chain reaction; PID, percentage identity; Proteomics; RBD, receptor binding domain; SARS-COV-2; SARS-CoV-1, Severe Acute Respiratory Syndrome Coronavirus 1; SARS-CoV-2, Severe Acute Respiratory Syndrome Coronavirus 2; UTR, Untranslated region; VOC, variants of concern; Vaccines; WHO, World Health Organization; mAbs, monoclonal antibodies
Year: 2021 PMID: 34924802 PMCID: PMC8667566 DOI: 10.1016/j.sjbs.2021.12.020
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 2213-7106 Impact factor: 4.052
Comparative genomic and proteomic analyses of SARS-1, SARS-2 and MERS viruses. Gene size is measured in nucleotides (nt) and protein size is estimated in amino acids (aa).
| Gene no. | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene name | Gene | Gene | Protein | Gene | Genomic location | Gene length | Protein | Gene name | Gene | Gene length | Protein | |
| 1–248 | 248 | 1–265 | 265 | 1–278 | 278 | |||||||
| 1 | 249–13397 | 13,148 | 4382 | 266–13483 | 13,217 | 4405 | 279–13454 | 13,175 | 4391 | |||
| 2 | 249–21469 | 21,220 | 7073 | 266–21555 | 21,289 | 7096 | 279–21514 | 21,235 | 7078 | |||
| 3 | 21476–25243 | 3767 | 1255 | 21563–25384 | 3821 | 1273 | 21456–25517 | 4061 | 1353 | |||
| 4 | 25252–26076 | 824 | 274 | 25393–26220 | 827 | 275 | 25532–25843 | 311 | 103 | |||
| 5 | 25673–26137 | 500 | 154 | 25852–26181 | 329 | 109 | ||||||
| 6 | 26101–26331 | 230 | 76 | 26245–26472 | 227 | 75 | 26093.0.26833 | 740 | 246 | |||
| 7 | 26382–27047 | 665 | 221 | 26523–27191 | 668 | 222 | 26840–27514 | 674 | 224 | |||
| 8 | 27058–27249 | 191 | 63 | 27202–27387 | 185 | 61 | 27590–27838 | 248 | 82 | |||
| 9 | 27257–27625 | 368 | 122 | 27394–27759 | 365 | 121 | 27853–28512 | 659 | 219 | |||
| 10 | 27763–28131 | 368 | 122 | 27894–28259 | 365 | 121 | 28566–29807 | 1241 | 413 | |||
| 11 | 28133–29401 | 1268 | 422 | 28274–29533 | 1259 | 419 | 28762–29100 | 338 | 112 | |||
| 12 | 28143–28439 | 296 | 98 | 29558–29674 | 116 | 38 | ||||||
| 28440–29757 | 1317 | 29675–29903 | 228 | 29101–30119 | 1018 | |||||||
| Genome size: | 29,757 nt | 29,903 nt | 30,119 nt | |||||||||
Genomic data were obtained from the NCBI GenBank database and analyzed using the proper bioinformatics tools.
Positions and potential roles of SARS-2 nonstructural proteins (nsp) in ORF1ab as predicted by bioinformatics tools.
| NSPs | Protein IDs for SARS-2 | amino acid (aa) position | aa identity % with | Putativerole | |
|---|---|---|---|---|---|
| SARS-1 | MERS | ||||
| Nsp1 | YP-009725297.1 | M1 – G180 | 84 | 21 | Suppress antiviral host response |
| Nsp2 | YP-009725298.1 | A181 – G818 | 68 | 22 | proofreading viral replication |
| Nsp3 | YP-009725299.1 | A819 – G2763 | 76 | 33 | putative papain-like-protease domain |
| Nsp4 | YP-009725300.1 | K2764 – Q3263 | 80 | 40 | complex with nsp3 and 6: contains transmembrane domain 2 (TM2) |
| Nsp5 | YP-009725301.1 | S3264 – Q3569 | 96 | 51 | 3CL-pro domain and main protease |
| Nsp6 | YP-009725302.1 | S3570 – Q3859 | 88 | 35 | complex with nsp3 and 4: putative transmembrane domain |
| Nsp7 | YP-009725303.1 | S3860 – Q3942 | 99 | 56 | complex with nsp8: RNA-dependent RNA polymerase |
| Nsp8 | YP-009725304.1 | A3943 – Q4140 | 97 | 53 | complex with nsp7: Multimeric RNA polymerase; replicase |
| Nsp9 | YP-009725305.1 | N4141 – Q4253 | 97 | 54 | ssRNA binding activity |
| Nsp10 | YP-009725306.1 | A4254 – Q4392 | 97 | 59 | complex with nsp14: proofreading, contains two zincbinding motifs |
| Nsp11 | YP-009725312.1 | S4393 – V4405 | 85 | 34 | short peptide at the end of orf1a |
| Nsp12 | YP-009725307.1 | S4393 – Q5324 | 96 | 72 | RNA-dependent RNA polymerase |
| Nsp13 | YP-009725308.1 | A5325 – Q5925 | 100 | 72 | Helicase, Zinc-binding domain |
| Nsp14 | YP-009725309.1 | A5926 – Q6452 | 95 | 63 | ExoN: 3′–5′ exonuclease |
| Nsp15 | YP-009725310.1 | S6453 – Q6798 | 89 | 52 | EndoU: poly(U)-specific endoribonuclease |
| Nsp16 | YP-009725311.1 | S6799 – N7096 | 93 | 66 | 2′-O-ribose methyltransferase |
| S | QHD43416.1 | 1 – 1273 | 76 | 33 | Spike protein |
| orf3a | QHD43417.1 | 1 – 275 | 72 | 21 | Accessory protein |
| E | QHD43418.1 | 1 – 75 | 95 | 37 | Envelop protein |
| M | QHD43419.1 | 1 – 222 | 91 | 40 | Membrane protein |
| orf6 | QHD43420.1 | 1 – 61 | 69 | – | Accessory protein |
| orf7a | QHD43421.1 | 1 – 121 | 85 | – | Accessory protein |
| orf8 | QHD43422.1 | 1 – 121 | 40 | 30 | Accessory protein |
| N | QHD43423.2 | 1 –419 | 94 | 49 | Nucleocapsid phosphoprotein |
| Orf10 | QHI42199.1 | 1 – 38 | – | – | Accessory protein |
Fig. 1Schematic representation of the SARS-2 virion and RNA genome. A) Structure of SARS-2 virion particle, and the function of its structural proteins. B) Genome organization gene expression strategies of SARS-2. The 5′-proximal two-thirds of the genome contains two overlapping open reading frames (ORF1a and ORF1b) that encode two long polyprotein precursors, namely pp1a and pp1ab, respectively.
Fig.2Phylogenetic analysis revealing the descendant subspecies of genus Betacoronavirus. Full-length genomic RNA sequences representing members from every subspecies were included along with the genomes of SARS-CoV, SARS-CoV-2 and MERS-CoV (in red). The phylogenetic trees were calculated from distance mattresses determined from percentage identity (PID) using neighbor-joining (NJ) algorithm. Genomic sequences were retracted from the GenBank and GISAID database under accession numbers shown in. Multiple sequence alignment was done using Clustal Omega online tool(Madeira et al., 2019), data were curated manually using Jalview2(Waterhouse et al., 2009), and phylogeny was done using MEGA X software with bootstrap values calculated from 1000 replicates(Kumar et al., 2018). Subspecies and corresponding lineages are shown on branches.
Fig. 3Taxonomic lineages of genus Four genetically distinct lineages are designated A, B, C and D. representative example for each lineage is shown. The variable accessory ORFs at the 3′-third of the viral genomes are shown. Boxes colors match those of each corresponding subspecies shown in (Fig. 2). Open boxes represent consensus viral ORFs.
List of coronavirus isolates used to study the phylogenetic relationship. Accession numbers for the isolates studied in the current research are shown in bold and underlined font. The year of discovery and identification is also shown.
| Coronavirus isolate: | GenBank ID: | Year | |
|---|---|---|---|
| 1 | SARS-CoV GZ02 | AY390556 | 2003 |
| 2 | SARS-CoV SZ3 | AY304486.1 | 2003 |
| 3 | SARS-CoV Tor2 | AY274119 | 2003 |
| 4 | SARS-CoV BJ01 | AY278488 | 2003 |
| 5 | AY278489.2 | 2003 | |
| 6 | Bat SARSr-CoV Rs4231 | KY417146 | 2016 |
| 7 | Bat SARSr-CoV SHC014 | MT308984.1 | 2013 |
| 8 | Bat SARSr-CoV WIV1 | KF367457 | 2013 |
| 9 | Bat SARSr-CoV YN2018C | MK211377 | 2018 |
| 10 | Bat SARSr-CoV Rp3 | DQ071615.1 | 2004 |
| 11 | Bat SARSr-CoV HKU3-1 | DQ022305 | 2004 |
| 12 | Bat SARSr-CoV BM48-31 | NC014470 | 2008 |
| 13 | Pangolin coronavirus Guangxi | MT072864 | 2020 |
| 14 | Bat coronavirus ZXC21 | MG772934 | 2018 |
| 15 | Bat coronavirus ZC45 | MG772933 | 2018 |
| 16 | Pangolin coronavirus Guangdong | MT121216 | 2019 |
| 17 | Bat coronavirus RmYN02 | EPI_ISL_412977 | 2020 |
| 18 | Bat coronavirus RaTG13 | MN996532 | 2013 |
| 19 | MN908947.3 | 2020 | |
| 20 | SARS-CoV-2_Egy_CCHE57357 | MZ380257.1 | 2021 |
| 21 | SARS-CoV-2_ USA/CA-CHLA-PLM73818046 | MZ722702.1 | 2020 |
| 22 | SARS-CoV-2_ /France/10009EE | MT470142.1 | 2020 |
| 23 | SARS-CoV-2_ CHN/Hefei-300 | MZ824630.1 | 2020 |
| 24 | Bat Hp-betacoronavirus Zhejiang2013 | KF636752.1 | 2013 |
| 25 | Rousettus bat coronavirus HKU9 | EF065513 | 2006 |
| 26 | Rousettus bat coronavirus GCCDC1 | MT350598.1 | 2016 |
| 27 | Pipistrellus bat coronavirus HKU5 | NC009020 | 2006 |
| 28 | Tylonycteris bat coronavirus HKU4 | NC009019 | 2006 |
| 29 | NC_019843.3 | 2012 | |
| 30 | Human coronavirus HKU1 | NC006577 | 2005 |
| 31 | Murine hepatitis virus_A59 | NC_048217.1 | 2005 |
| 32 | Human coronavirus OC43 | AY391777 | 2003 |
The distance identity mattresses used to draw the phylogenetic relationship among SARS coronaviruses. Percentages of identity were calculated by Clustal 12.1.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 3 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 100 | ||||||||||||||||||||||||||||||||
| 2 | 72.4 | 100 | |||||||||||||||||||||||||||||||
| 3 | 72.5 | 72.4 | 100 | ||||||||||||||||||||||||||||||
| 4 | 50.3 | 50.4 | 49.4 | 100 | |||||||||||||||||||||||||||||
| 5 | 52.0 | 52.8 | 50.7 | 72.4 | 100 | ||||||||||||||||||||||||||||
| 6 | 51.1 | 51.5 | 50.2 | 70.0 | 70.2 | 100 | |||||||||||||||||||||||||||
| 7 | 51.5 | 52.4 | 50.5 | 50.6 | 52.2 | 51.1 | 100 | ||||||||||||||||||||||||||
| 8 | 49.4 | 49.9 | 49.3 | 50.3 | 50.8 | 50.4 | 66.5 | 100 | |||||||||||||||||||||||||
| 9 | 49.5 | 50.1 | 49.2 | 51.1 | 51.7 | 51.7 | 52.0 | 51.0 | 100 | ||||||||||||||||||||||||
| 10 | 50.9 | 51.6 | 50.7 | 51.8 | 52.9 | 52.5 | 52.7 | 51.6 | 57.0 | 100 | |||||||||||||||||||||||
| 11 | 50.7 | 51.1 | 50.2 | 52.0 | 52.7 | 52.6 | 52.0 | 51.3 | 56.9 | 78.5 | 100 | ||||||||||||||||||||||
| 12 | 50.7 | 51.0 | 50.1 | 52.0 | 52.6 | 52.5 | 52.0 | 51.3 | 56.7 | 78.6 | 87.9 | 100 | |||||||||||||||||||||
| 13 | 50.8 | 51.0 | 50.1 | 52.0 | 52.7 | 52.6 | 52.1 | 51.3 | 56.8 | 78.7 | 87.9 | 98.5 | 100 | ||||||||||||||||||||
| 14 | 50.8 | 51.0 | 50.1 | 52.0 | 52.7 | 52.6 | 52.1 | 51.3 | 56.8 | 78.7 | 87.9 | 98.4 | 100 | 100 | |||||||||||||||||||
| 15 | 50.8 | 51.0 | 50.1 | 52.0 | 52.7 | 52.6 | 52.1 | 51.3 | 56.7 | 78.7 | 87.9 | 98.4 | 99.8 | 99.8 | 100 | ||||||||||||||||||
| 16 | 50.8 | 51.0 | 50.1 | 52.0 | 52.7 | 52.6 | 52.1 | 51.3 | 56.8 | 78.7 | 87.9 | 98.4 | 99.9 | 99.9 | 99.9 | 100 | |||||||||||||||||
| 17 | 50.8 | 50.9 | 50.1 | 52.0 | 52.7 | 52.6 | 52.1 | 51.3 | 56.7 | 78.7 | 87.8 | 98.4 | 99.8 | 99.9 | 99.8 | 99.9 | 100 | ||||||||||||||||
| 18 | 50.7 | 50.8 | 50.0 | 51.9 | 52.6 | 52.6 | 52.1 | 51.5 | 56.8 | 78.8 | 88.3 | 95.6 | 95.7 | 95.7 | 95.7 | 95.7 | 95.7 | 100 | |||||||||||||||
| 19 | 50.8 | 51.1 | 50.2 | 52.0 | 52.7 | 52.7 | 52.1 | 51.5 | 57.0 | 78.7 | 88.3 | 96.4 | 95.6 | 95.6 | 95.5 | 95.6 | 95.5 | 96.8 | 100 | ||||||||||||||
| 20 | 50.8 | 51.0 | 50.1 | 51.9 | 52.6 | 52.5 | 52.1 | 51.4 | 56.9 | 78.8 | 89.7 | 93.9 | 93.9 | 93.9 | 93.8 | 93.8 | 93.8 | 95.5 | 95.6 | 100 | |||||||||||||
| 21 | 50.9 | 51.0 | 50.2 | 52.0 | 52.8 | 52.6 | 52.0 | 51.4 | 57.0 | 78.7 | 89.7 | 92.5 | 92.5 | 92.5 | 92.5 | 92.5 | 92.4 | 93.4 | 93.5 | 95.3 | 100 | ||||||||||||
| 22 | 51.2 | 52.2 | 50.4 | 52.3 | 53.4 | 52.8 | 52.5 | 51.6 | 57.2 | 76.1 | 78.7 | 78.9 | 78.8 | 78.8 | 78.9 | 78.9 | 78.8 | 79.1 | 79.2 | 79.1 | 79.0 | 100 | |||||||||||
| 23 | 51.4 | 52.4 | 50.8 | 52.2 | 53.4 | 52.8 | 52.9 | 51.7 | 57.4 | 76.2 | 79.1 | 79.3 | 79.4 | 79.4 | 79.4 | 79.5 | 79.4 | 79.1 | o | 79.3 | 79.3 | 83.5 | 100 | ||||||||||
| 24 | 51.5 | 52.5 | 50.7 | 52.2 | 53.5 | 52.9 | 52.8 | 51.6 | 57.5 | 76.4 | 79.3 | 79.3 | 79.3 | 79.3 | 79.3 | 79.3 | 79.3 | 79.5 | 79.5 | 79.6 | 79.4 | 85.2 | 93.2 | 100 | |||||||||
| 25 | 51.4 | 52.6 | 50.9 | 52.4 | 53.6 | 52.8 | 53.0 | 51.8 | 57.5 | 76.5 | 79.3 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.4 | 79.6 | 79.6 | 79.6 | 79.5 | 85.4 | 93.9 | 96.2 | 100 | ||||||||
| 26 | 51.4 | 52.6 | 50.9 | 52.4 | 53.6 | 52.8 | 53.0 | 51.8 | 57.5 | 76.5 | 79.4 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.6 | 79.7 | 79.6 | 79.5 | 85.4 | 93.9 | 96.2 | 100 | 100 | |||||||
| 27 | 51.4 | 52.6 | 50.9 | 52.4 | 53.6 | 52.8 | 53.0 | 51.8 | 57.5 | 76.5 | 79.3 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.4 | 79.6 | 79.6 | 79.6 | 79.5 | 85.4 | 93.9 | 96.2 | 100 | 100 | 100 | ||||||
| 28 | 51.4 | 52.6 | 50.9 | 52.4 | 53.6 | 52.8 | 53.0 | 51.8 | 57.5 | 76.5 | 79.4 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.6 | 79.6 | 79.6 | 79.5 | 85.4 | 93.9 | 96.2 | 100 | 100 | 100 | 100 | |||||
| 29 | 51.4 | 52.6 | 50.9 | 52.4 | 53.6 | 52.8 | 53.0 | 51.8 | 57.5 | 76.5 | 79.4 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.6 | 79.6 | 79.6 | 79.5 | 85.4 | 93.9 | 96.2 | 100 | 100 | 100 | 100 | 100 | ||||
| 30 | 51.4 | 52.6 | 50.9 | 52.4 | 53.6 | 52.8 | 53.0 | 51.8 | 57.5 | 76.5 | 79.4 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.5 | 79.6 | 79.6 | 79.6 | 79.5 | 85.4 | 93.9 | 96.2 | 100 | 100 | 100 | 100 | 100 | 100 | |||
| 31 | 51.3 | 52.1 | 50.8 | 52.3 | 53.3 | 52.8 | 52.6 | 51.4 | 57.2 | 76.1 | 78.8 | 78.9 | 78.9 | 78.9 | 78.9 | 78.9 | 78.9 | 79.0 | 79.1 | 79.2 | 78.9 | 85.1 | 88.3 | 90.2 | 90.3 | 90.3 | 90.3 | 90.3 | 90.3 | 90.3 | 100 | ||
| 32 | 51.2 | 52.1 | 50.4 | 52.1 | 53.3 | 52.6 | 52.8 | 51.5 | 57.3 | 76.8 | 82.3 | 81.0 | 81.0 | 81.0 | 81.0 | 81.0 | 80.9 | 81.1 | 81.2 | 81.5 | 81.5 | 83.3 | 87.3 | 88.0 | 88.0 | 88.0 | 88.0 | 88.0 | 88.0 | 88.0 | 87.1 | 100 | |
| 33 | 51.1 | 52.2 | 50.5 | 52.2 | 53.3 | 52.6 | 52.8 | 51.5 | 57.4 | 76.7 | 82.7 | 80.9 | 80.9 | 80.9 | 80.9 | 80.9 | 80.8 | 81.0 | 81.2 | 81.4 | 81.4 | 83.3 | 87.2 | 88.0 | 88.0 | 88.1 | 88.0 | 88.0 | 88.0 | 88.0 | 87.1 | 97.5 | 100 |
Fig. 4Naturally occurring indels throughout SARS-CoV-2 genome. Boxes indicate the genomic size, number of inserted and deleted nucleotides of the viral genes. Del: Deletion, Ins: Insertion.
Cumulative mutations within conserved ORFs throughout SARS-2 genome. Genetic variations were among SARS-1 and SARS-2 genomes are presented as number of nucleotide insertions, deletions and number of loci.
| no. of nucleotides | no. of loci | no. of nucleotides | no. of loci | total loci | |
|---|---|---|---|---|---|
| Orf1ab | 78 | 2 | 9 | 3 | 5 |
| S | 66 | 2 | 12 | 5 | 7 |
| ORF3a | 3 | 1 | – | – | 1 |
| E | – | – | 3 | 1 | 1 |
| M | 3 | 1 | – | – | 1 |
| ORF6 | – | – | 6 | 1 | 1 |
| ORF7a | – | – | 3 | 1 | 1 |
| ORF8 | 15 | 3 | 18 | 2 | 5 |
| N | – | – | 9 | 2 | 2 |
| ORF10 | 3 | 1 | 181 | 3 | 4 |
| – | |||||
Fig. 5CLUSTAL O (1.2.4) multiple proteins sequences alignment of SARS-CoV-2 (SARS-2) and SARS-COV (SARS-1) proteome. (A) Alignment of the variable regions of Orf1ab protein. (B) Alignment of the variable regions of Spike protein (S). Del: Deletion, Ins: Insertion.
Fig. 7CLUSTAL O (1.2.4) multiple proteins sequences alignment of SARS-CoV-2 (SARS-2) and SARS-COV (SARS-1) proteome. (A) Alignment of the variable regions of ORF7a protein. (B) Alignment of the variable regions of ORF8 protein. (C) Alignment of the variable regions of nucleocapsid protein. (D) Alignment of the variable regions of ORF10 protein. Del: Deletion, Ins: Insertion.
Fig. 6CLUSTAL O (1.2.4) multiple proteins sequences alignment of SARS-CoV-2 (SARS-2) and SARS-COV (SARS-1) proteome. (A) Alignment of the variable regions of ORF3a protein. (B) Alignment of the variable regions of envelope protein (E). (C) Alignment of the variable regions of membrane glycoprotein (M). (D) Alignment of the variable regions of ORF6 protein. Del: Deletion, Ins: Insertion.
Fig. 8Initial and severe clinical symptoms of COVID-19 (Control and Prevention, 2020).
Fig. 9SARS-2 variants and their essential mutations within the spike protein. The variants are referred as alpha, beta, gamma and delta based on WHO labeling system and classified into lineages according to Pang nomenclature. Country and time where and when the variants were firstly characterized are shown. The viral variants as well as their characterizing mutations are color coded.
Most studied amino acid mutations that characterize SARS-2 variants and their potential roles in viral fitness.
| L18F | Leucine-to-phenylalanine | Reduce neutralization by some antibodies. |
| ΔH69–70 | Histidine deletion | Change the conformation of an exposed NTD loop and associated with increasing infectivity. |
| ΔY144 | Tyrosine deletion | Seem to change the conformation of the N3 NTD loop (amino acid positions 140–156) and reduces antibody binding affinity. |
| K417N | Lysine-to- asparagine | Reduces virus sensitivity to antibodies and increases binding affinity to human ACE2 receptor. |
| N439K | Asparagine-to-lysine | Increase the binding affinity for the ACE2 receptor and reduces the neutralizing activity. |
| L452R | Leucine-to-arginine | Confer stronger affinity of the S protein for the ACE2 receptor. |
| T478K | Threonine-to-lysine | May enhances viral virulence as it was found in the rapidly rising SARS in Mexico and South America. |
| E484K | Glutamic acid-to-lysine | An escape mutation: immune dominant spike protein residue with various substitutions; facilitating escape from several mAbs and evade the immune system. |
| E484Q | Glutamic acid-to-glutamine | May be functionally similar to the antibody-escape mutation E484K. |
| N501Y | Asparagine to tyrosine | Increases in human ACE2 binding affinity conferred by a single RBD mutation. |
| D614G | Aspartic acid-to-glycine | Found in highly transmissible lineages like B.1.1.7, B.1.351 and P.1. it reduces the S1 shedding and increases the infectivity. |
| P681R | Proline-to-arginine | Boost cell-level infectivity of the variant and thus helps virus entry and abolish phospho-inhibition at S1/S2 site. |
| P681H | Proline-to-histidine | May increase spike cleavage by furin-like proteases. |
| D950N | Aspartic acid-to- asparagine | May participate in the regulation of s protein dynamics. |
Fig. 10Traditional and novel approaches of COVID-19 vaccines production.