| Literature DB >> 33647446 |
Masarra M Sakr1, Noha S Elsayed2, Ghadir S El-Housseiny1.
Abstract
Amid the COVID-19 outbreak, several bioinformatic analyses have been conducted on SARS-CoV-2 virus genome. Numerous studies rushed to fill the gap about this novel virus. Comparison with other related sequences, structural predictions of the produced proteins, determination of variations in amino acid residues and depiction of possible drug and vaccine targets have been the focus of scientific research from the beginning of this year. In addition to discussing the viral taxonomy, clinical features, life cycle, and genome organization, this review will focus on the recent updates in genome and viral proteins characterization and potential therapeutic and vaccine candidates developed so far. Comparative studies with related genomes and proteins provide understanding for the viral molecular mechanisms and suggest targets for therapeutics and vaccinology trials to stop the escalation of this new virus. This pandemic, with its resulting social and economic afflictions, will definitely have significant marks on our lives in the following years.Entities:
Keywords: Bioinformatics; COVID-19; Genome; SARS-CoV-2; Vaccine
Mesh:
Substances:
Year: 2021 PMID: 33647446 PMCID: PMC7910145 DOI: 10.1016/j.micpath.2021.104809
Source DB: PubMed Journal: Microb Pathog ISSN: 0882-4010 Impact factor: 3.848
Fig. 1Severe Acute Respiratory Syndrome Coronavirus 2 structure [12].
Fig. 2Schematic representation of SARS-CoV-2 genome organization. The genome consists of a single long open reading frame with two flanking untranslated regions (UTRs). The genome is arranged in the order of 5′ UTR (265 nucleotides)-orf1ab (the replicase gene)-orfS- orf3a-orfE-orfM- orf 6- orf 7a-orf 7b- orf8-orf N-orf 10–3′UTR (358 nucleotides). ORF1ab encodes for two large polyproteins, pp1a and pp1ab. Expression of these two polyproteins is achieved by ribosomal frame shifting. These two polyproteins are cleaved into 16 non-structural proteins by the action of two proteases, PLpro encoded by nsp3 and Mpro encoded by nsp5. The rest of the genome encodes for 4 structural proteins; Spike protein (S), Envelope protein (E), Membrane protein (M), nucleocapsid protein (N) in addition to six accessory proteins (coloured white); 3a, 6, 7a, 7b, 8 and 10. The main domains of the four structural proteins are shown. The prefusion structure of S protein is displayed showing the RBD highlighted in green, adopted from Wrapp et al. [40], and the Homopentamer of E pentameric protein which modulates ion channel activity of the E protein is also illustrated, adopted from Gupta et al. [49].
List of non-structural proteins (nsps) of SARS-CoV-2, their amino acid lengths, and functions.
| Non- structural protein nsp | Amino acid length and position | Function | References |
|---|---|---|---|
| nsp1 | 180 aa (residues 1–180) | Suppresses host innate immune functions. It supresses type I IFN expression in infected cells. This antagonism takes place by degradation of host mRNA and inactivation of host translational machinery. | [ |
| nsp2 | 638 aa (residues 181–818) | Interacts with host factors prohibitin 1 and 2 which are involved in many cellular processes. | [ |
| nsp3 | 1945 aa (residues A819-G2763) | It is the largest protein in the coronavirus genome. It is a large transmembrane protein, a papain-like protease (PLpro) responsible for the processing of viral polyproteins encoded from the genomic RNA to individual protein components. It was also shown to antagonize host innate immunity. | [ |
| nsp4 | 500 aa (residues K2764 – Q3263) | A multipass membrane protein responsible for membrane rearrangement. It interacts with nsp3 and 6 to form double membrane vesicles (DMV). This interaction is crucial for viral replication complex formation. | [ |
| nsp5 | 306 aa (residues S3264 – Q3569) | Mpro, chymotrypsin-like protease (3CLpro), the viral main protease considered the key enzyme which digests the viral polyprotein at no less than 11 conserved sites including its autolytic cleavage from polyproteins pp1a and pp1ab. | [ |
| nsp6 | 290 aa (residues S3570-Q3859 | A transmembrane scaffold protein that forms a complex with nsp3 and nsp4 (DMV formation) | [ |
| nsp7 | 83 aa (residues S3860 – Q3942) | A cofactor that binds to nsp8 forming hexadecameric complex and acts as processivity clamp for RNA polymerase and primase | [ |
| nsp8 | 198 aa (residues A3943 – Q4140) | A cofactor that binds to nsp7 forming hexadecameric complex and acts as processivity clamp for RNA polymerase and primase | [ |
| nsp9 | 113 aa (residues N4141 – Q4253) | A ss-RNA binding protein. Putatively stabilizes nascent nucleic acid during replication or transcription protecting it from nucleases. | [ |
| nsp 10 | 139 aa (residues A4254 – Q4392) | A small protein believed to act as a multifunctional cofactor. In replication it interacts with nsp14 involved in replication fidelity and also interacts with nsp16. | [ |
| nsp 11 | 11-23 aa (residues S4393 – V4405) | It is a short peptide at the end of ORF1a and represents the frameshift boundary. It is a pp1a cleavage product of an unknown function. | [ |
| nsp12 | 932 aa (residues S4393 – Q5324) | It is the RNA-dependant RNA polymerase (RdRp) responsible for both replication and transcription of the viral genome. Formation of new viral RNA mediated by RdRp is believed to be the key step in viral life cycle. | [ |
| nsp 13 | 601 aa (residues A5325 – Q5925) | A helicase composed of an N-terminal metal binding domain and a helicase conserved domain. It plays an important role in viral replication by having the ability to unwind duplex RNA and DNA. | [ |
| nsp14 | 527 aa (residues A5926 – Q6452 | It has 3′-5′ exoribonuclease activity which is important in proof reading during viral RNA replication and has N-7-Guanine methyl transferase activity which is involved in mRNA capping. | [ |
| nsp15 | 346 aa (residues S6453 – Q6798) | It is a poly(U)-specific endoribonuclease enzyme which cleaves RNA at the 3′ end of uridylates. Loss of nsp15 was found to affect viral replication and virulence. | [ |
| nsp16 | 298 aa (residues S6799 – N7096 | It has 2-O methyl transferase activity which is essential for capping of viral mRNA to escape host detection. | [ |
Candidate COVID-19 Vaccines in Phase 3 clinical evaluation according to the WHO as of January 3, 2021 [159].
| ID | Vaccine platform acronymn | Vaccine platform description | Type of candidate vaccine | Route of administration | Developers | Phase |
|---|---|---|---|---|---|---|
| 1 | IV | Inactivated virus | SARS-CoV-2 vaccine (inactivated) | IM | Sinovac Research and Development Co., Ltd | Phase 3 |
| 2 | IV | Inactivated virus | Inactivated SARS-CoV-2 vaccine (Vero cell) | IM | Sinopharm + Wuhan Institute of Biological Products | Phase 3 |
| 3 | IV | Inactivated virus | Inactivated SARS-CoV-2 vaccine (Vero cell) | IM | Sinopharm + Beijing Institute of Biological Products | Phase 3 |
| 4 | VVnr | Viral vector (Non-replicating) | ChAdOx1-S - (AZD1222) (Covishield) | IM | AstraZeneca + University of Oxford | Phase 3 |
| 5 | VVnr | Viral vector (Non-replicating) | Recombinant novel coronavirus vaccine (Adenovirus type 5 vector) | IM | CanSino Biological Inc./Beijing Institute of Biotechnology | Phase 3 |
| 6 | VVnr | Viral vector (Non-replicating) | Gam-COVID-Vac Adeno-based (rAd26-S + rAd5-S) | IM | Gamaleya Research Institute; Health Ministry of the Russian Federation | Phase 3 |
| 7 | VVnr | Viral vector (Non-replicating) | Ad26.COV2·S | IM | Janssen Pharmaceutical | Phase 3 |
| 8 | PS | Protein subunit | SARS-CoV-2 rS/Matrix M1-Adjuvant (Full length recombinant SARS CoV-2 glycoprotein nanoparticle vaccine adjuvanted with Matrix M) | IM | Novavax | Phase 3 |
| 9 | RNA | RNA based vaccine | mRNA -1273 | IM | Moderna + National Institute of Allergy and Infectious Diseases (NIAID) | Phase 3 |
| 10 | RNA | RNA based vaccine | BNT162 (3 LNP-mRNAs) | IM | BioNTech + Fosun Pharma; Jiangsu Provincial Center for Disease Prevention and Control + Pfizer | Phase 2/3 |
| 11 | PS | Protein subunit | Recombinant SARS-CoV-2 vaccine (CHO Cell) | IM | Anhui Zhifei Longcom Biopharmaceutical + Institute of Microbiology, Chinese Academy of Sciences | Phase 3 |
| 12 | RNA | RNA based vaccine | CVnCoV Vaccine | IM | CureVac AG | Phase 2/3 |
| 13 | DNA | DNA based vaccine | INO-4800+electroporation | ID | Inovio Pharmaceuticals + International Vaccine Institute | Phase 2/3 |
| 14 | IV | Inactivated virus | Whole-Virion Inactivated SARS-CoV-2 Vaccine (BBV152) | IM | Bharat Biotech International Limited | Phase 3 |
| 15 | VLP | Virus like particle | Coronavirus-Like Particle COVID-19 (CoVLP) | IM | Medicago Inc. | Phase 2/3 |