| Literature DB >> 33519278 |
Ayman M Al-Qaaneh1,2, Thamer Alshammari1, Razan Aldahhan1, Hanan Aldossary1,3, Zahra Abduljaleel Alkhalifah1, J Francis Borgio1,3.
Abstract
SARS-CoV-2 is a type of Betacoronaviruses responsible for COVID-19 pandemic disease, with more than 1.745 million fatalities globally as of December-2020. Genetically, it is considered the second largest genome of all RNA viruses with a 5' cap and 3' poly-A tail. Phylogenetic analyses of coronaviruses reveal that SARS-CoV-2 is genetically closely related to the Bat-SARS Like-Corona virus (Bat-SL-Cov) with 96% whole-genome identity. SARS-CoV-2 genome consists of 15 ORFs coded into 29 proteins. At the 5' terminal of the genome, we have ORF1ab and ORF1a, which encode the 1ab and 1a polypeptides that are proteolytically cleaved into 16 different nonstructural proteins (NSPs). The 3' terminal of the genome represents four structural (spike, envelope, matrix, and nucleocapsid) and nine accessory (3a, 3b, 6, 7a, 7b, 8b, 9a, 9b, and orf10) proteins. As the number of COVID-19 patients increases dramatically worldwide, there is an urgent need to find a quick and sensitive diagnostic tool for controlling the outbreak of SARS-CoV-2 in the community. Today, molecular testing methods utilizing viral genetic material (e.g., PCR) represent the crucial diagnostic tool for the SARS-CoV-2 virus despite its low sensitivity in the early stage of viral infection. This review summarizes the genome composition and genetic characterization of the SARS-CoV-2.Entities:
Keywords: COVID-19; Coronaviruses; Diagnosis; Genome; SARS-CoV-2
Year: 2021 PMID: 33519278 PMCID: PMC7834485 DOI: 10.1016/j.sjbs.2020.12.053
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 2213-7106 Impact factor: 4.219
Fig. 1Sequence-region of SARS-CoV-2. GenBank accession: NC_045512.2. Resources: The 2019 Novel Coronavirus Resource (Zhou et al., 2020) and National Center for Biotechnology Information ().
Different genomic structures for human coronavirus genera.
| Genera | ||||
|---|---|---|---|---|
| Genomic Structure | ||||
| Spike Glycoproteins | Present | Present | Present | Present |
| ORFs | 7–10 additional ORFs | 7–10 additional ORFs | 7–10 additional ORFs | 7–10 additional ORFs |
| NSP1 | ~110 amino acids size | ~180–250 amino acids size | Absent | Absent |
| Papain-like protease (PLpro) | two PLpro domains | two PLpro domains | Single PLpro Domains | Single PLpro domains |
| Hemagglutinin Esterase (HE) | Absent | Present | Absent | Absent |
| Envelope proteins | Present | Present | Present | Present |
| Nucleocapsid protein | Present | Present | Present | Present |
| Membrane Glycoproteins: | Present | Present | Present | Present |
Classification of different type of human coronavirus.
| Genus | Coronavirus Type | Animal reservoir | Intermediary host | First identified | Symptoms | Cellular Receptor | Epidemiology | References |
|---|---|---|---|---|---|---|---|---|
| Alpha-coronavirus | HCoV-229E | Bats | Bats | 1966 | Common cold, sneezing, fever, dry cough, sore throat, Myalgia, diarrhea, and pneumonia | APN receptor Human Aminopeptidase N (CD13) | Endemic | ( |
| HCoV-NL63 | Bats | palm civets | 2004 | Sneezing, fever, dry cough, sore throat, Myalgia, diarrhea, febrile, and convulsion | ACE2 receptor | Endemic | ( | |
| Beta- coronaviruses | HCoV-OC43 | Mice | Cattle | 1967 | Common cold, sneezing, fever, dry cough, sore throat, Myalgia diarrhea, and pneumonia | 9-O-acetylsialic acids as receptor | Endemic | ( |
| HCoV-HKU1 | Mice | Mice | 2005 | Common cold, sneezing, fever, dry cough, sore throat, Myalgia, febrile convulsion diarrhea and pneumonia | 9-O-acetylsialic acids as receptor | Endemic | ( | |
| SARS-CoV | Bats | civet cat | 2003 | Common respiratory symptoms, fever, cough, shortness of breath, breathing difficulties, and pneumonia | ACE2 | Epidemic | ( | |
| MERS-CoV | Bats | Dromedary camels | 2012 | Cough, shortness of breath, and, on occasion, pneumonia. Gastrointestinal symptoms (diarrhea). | Dipeptidyl peptidase-4 (DPP4) receptor | Epidemic | ( | |
| SARS-CoV-2 | Bats | Bats | 2019 | Common respiratory symptoms, fever, cough, shortness of breath, and breathing Myalgia Dyspnea Vomiting pneumonia, respiratory system failure and even death | ACE2 | Pandemic | ( | |
Coding region, base length, amino acid length, and functions of different ORFs in SARS-CoV-2.
| Gene name | Coding region (nt) | Base length | Amino acid length (aa) | Function | Reference | |
|---|---|---|---|---|---|---|
| UTR | 265 | |||||
| Nonstructural protein | orf1a | 266–13483 | 13,217 | 4405 | Encoded nonstructural proteins (NSP1 to NSP11), essential for viral replication, viral assembly, immune response modulation, etc. | ( |
| Non-structural protein | orf1ab | 266–13468, 13468–21555 | 21,289 | 7096 | Encoded nonstructural proteins (NSP12 to NSP16), essential for viral replication | ( |
| Structural protein | S (Spike protein) | 21563–25384 | 3822 | 1273 | Spike protein, binding to cell receptor and mediate virus-cell fusion | ( |
| Accessory protein | ORF 3a | 25393–26220 | 828 | 275 | ion-channel activity, cell cycle arrest, apoptosis | ( |
| Accessory protein | ORF 3b | 25814–25882 | 68 | 22 | IFN (Type I) production and signaling inhibition, cell cycle arrest, apoptosis | ( |
| Structural protein | E (envelope protein) | 26245–26472 | 228 | 75 | Morphogenesis, assembly and of virions | ( |
| Structural protein | M (matrix protein) | 26523–27191 | 669 | 222 | Membrane protein, virus assembly | ( |
| Accessory protein | ORF 6 | 27202–27387 | 186 | 61 | IFN (Type I) production and signaling inhibition | ( |
| Accessory protein | ORF 7a | 27394–27759 | 366 | 121 | inhibiting the host translation, cell cycle arrest, apoptosis | ( |
| Accessory protein | ORF 7b | 27756–27887 | 132 | 43 | still unknown | ( |
| Accessory protein | ORF 8b | 27894–28259 | 366 | 121 | still unknown | ( |
| Structural protein | N(nucleocapsid) | 28274–29533 | 1260 | 419 | Replication, transcription, virion structure, and viral assembly | ( |
| Accessory protein | ORF 9a | 28284–28577 | 294 | 97 | still unknown | ( |
| Accessory protein | ORF 9b | 28,734–28,955 | 222 | 73 | still unknown | ( |
| Accessory protein | ORF 10 | 29558–29674 | 117 | 38 | still unknown | ( |
| UTR | 299 |
Coding region, amino acid length, and functions of different NSPs in SARS-CoV-2 virus genome.
| NSP | Coding region (aa) | Amino acid length (aa) | Function | Reference |
|---|---|---|---|---|
| NSP1 (leader protein) | 1–180 | 180 | mRNA degradation, interferon (IFN) signaling inhibition | ( |
| NSP2 | 181–818 | 638 | still unknown | ( |
| NSP3 | 819–2763 | 1945 | ADRP, cleaving Polypeptide, blocking the innate immunity of the host, enhancing expression of cytokines | ( |
| NSP4 | 2764–3263 | 500 | double-membrane vesicles (DMVs) formation | ( |
| NSP5 (3C-like proteinase) | 3264–3569 | 306 | Mpro, 3CLpro, polypeptides cleaving, and IFN signaling inhibiting | ( |
| NSP6 | 3570–3859 | 290 | formation of DMV, and restricting autophagosome expansion | ( |
| NSP7 | 3860–3942 | 83 | cofactor with NSP8 and NSP12 | ( |
| NSP8 | 3943–4140 | 198 | primase, cofactor with NSP7 and NSP12 | ( |
| NSP9 | 4141–4253 | 113 | RNA binding, dimerization | ( |
| NSP10 (growth-factor-like protein) | 4254–4392 | 139 | Scaffold protein for NSP14 and NSP16 | ( |
| NSP11 check if this available in SARS-CoV-2 | 4393–4405 | 13 | still unknown | ( |
| NSP12 (RNA-dependent RNA polymerase) | 4393–5324 | 932 | Primer dependent RdRp | ( |
| NSP13 (RNA 5′-triphosphatase) | 5325–5925 | 601 | RNA helicase, and 5′ triphosphatase | ( |
| NSP14 (3′-to-5′ exonuclease) | 5926–6452 | 527 | 3′-to-5′ exonuclease | ( |
| NSP15 (endoRNAse) | 6453–6798 | 346 | dsRNA sensors evasion, and NendoU | ( |
| NSP16 (2′-O-ribose methyltransferase) | 6799–7096 | 298 | OMT, negatively regulates the innate immunity | ( |
ADRP: adenosine diphosphate-ribose 1″-phosphatase; 3CLpro: 3C-like cysteine proteinase; RdRp: RNA-dependent RNA polymerase; NendoU: nidoviral endoribonuclease specific for U; OMT: S-adenosylmethionine-dependent ribose 2′-O-methyltransferase.
Fig. 2The phylogenetic analysis of different Coronaviruses.
Genome sequence similarity among beta-coronaviruses.
| Beta-Coronavirus | Percent of genome sequence Identity |
|---|---|
| SARS-CoV-2 vs. Bat-SL-CoV-ZC45 | 88% |
| SARS-CoV-2 vs. Bat-SL-CoV-ZXC21 | 88% |
| SARS-CoV-2 vs. SARS-CoV | 79.6% |
| SARS-CoV-2 vs. MERS-CoV | 50% |
Probes and Primers for SARS-CoV-2 Polymerase Chain Reaction (PCR) Tests.
| Name of the gene | Institution | Forward (5′-3′) / Reverse (5′-3′) | Probe (5′-3′) |
|---|---|---|---|
| Nuncleoacpsid fragment-1 (N1) | US CDC | GACCCCAAAATCAGCGAAAT / TCTGGTTACTGCCAGTTGAATCTG | FAM-ACCCCGCATTACGTTTGGTGGACC-BHQ1 |
| Nuncleoacpsid fragment-2 (N2) | US CDC | TTACAAACATTGGCCGCAAA / GCGCGACATTCCGAAGAA | FAM-ACAATTTGCCCCCAGCGCTTCAG-BHQ1 |
| Nucleocapsid fragment-3 (N3) | US CDC | GGGAGCCTTGAATACACCAAAA / TGTAGCACGATTGCAGCATTG | FAM-AYCACATTGGCACCCGCAATCCTG-BHQ1 |
| Nucleocapsid (N) | China, CDC | GGGGAACTTCTCCTGCTAGAAT / CAGATGTTAAASACACTATTAGCATA | FAM-TTGCTGCTGCTTGACAGATT-TAMRA |
| Hong Kong University | TAATCAGACAAGGAACTGATTA / CGAAGGTGTGACTTCCATG | FAM-GCAAATTGTGCAATTTGCGG-TAMRA | |
| National Institute of Infectious Diseases, Japan | AAATTTTGGGGACCAGGAAC / TGGCAGCTGTGTAGGTCAAC | FAM- ATGTCGCGCATTGGCATGGA-BHQ | |
| National Institute of Health, Thailand | CGTTTGGTGGACCCTCAGAT / CCCCACTGCGTTCTCCATT | FAM-CAACTGGCAGTAACCABQH1 | |
| Envelope (E) | Charité, Germany | ACAGGTACGTTAATAGTTAATAGCGT / ATATTGCAGCAGTACGCACACA | FAM-ACACTAGCCATCCTTACTGCGCTTCG-BBQ |
| RNAse P Forward Primer RNase (RP-F Rnase) | US CDC | AGATTTGGACCTGCGAGCG / GAGCGGCTGTCTCCACAAGT | FAM-TTCTGACCTGAAGGCTCTGCGCG-BHQ-1 |
| Open Reading Frame-1ab (ORF1ab) | China CDC | CCCTGTGGGTTTTACACTTAA / ACGATTGTGCATCAGCTGA | FAM-CCGTCTGCGGTATGTGGAAAGGTTATGG-BHQ1 |
| Open Reading Frame-1b (ORF1b) | Hong Kong University | TGGGGYTTTACRGGTAACCT / AACRCGCTTAACAAAGCACTC | FAM-TAGTTGTGATGCWATCATGACTAG-TAMRA |
| RNA-dependent RNA polymerase (RdRP) | Charité, Germany | GTGARATGGTCATGTGTGGCGG / CARATGTTAAASACACTATTAGCATA | RdRp 1: FAM-CAGGTGGAACCTCATCAGGAGATGC-BBQ |