| Literature DB >> 34220199 |
Ritesh Gorkhali1, Prashanna Koirala1, Sadikshya Rijal1, Ashmita Mainali1, Adesh Baral1, Hitesh Kumar Bhattarai1.
Abstract
SARS-CoV-2 virus, the causative agent of COVID-19 pandemic, has a genomic organization consisting of 16 nonstructural proteins (nsps), 4 structural proteins, and 9 accessory proteins. Relative of SARS-CoV-2, SARS-CoV, has genomic organization, which is very similar. In this article, the function and structure of the proteins of SARS-CoV-2 and SARS-CoV are described in great detail. The nsps are expressed as a single or two polyproteins, which are then cleaved into individual proteins using two proteases of the virus, a chymotrypsin-like protease and a papain-like protease. The released proteins serve as centers of virus replication and transcription. Some of these nsps modulate the host's translation and immune systems, while others help the virus evade the host immune system. Some of the nsps help form replication-transcription complex at double-membrane vesicles. Others, including one RNA-dependent RNA polymerase and one exonuclease, help in the polymerization of newly synthesized RNA of the virus and help minimize the mutation rate by proofreading. After synthesis of the viral RNA, it gets capped. The capping consists of adding GMP and a methylation mark, called cap 0 and additionally adding a methyl group to the terminal ribose called cap1. Capping is accomplished with the help of a helicase, which also helps remove a phosphate, two methyltransferases, and a scaffolding factor. Among the structural proteins, S protein forms the receptor of the virus, which latches on the angiotensin-converting enzyme 2 receptor of the host and N protein binds and protects the genomic RNA of the virus. The accessory proteins found in these viruses are small proteins with immune modulatory roles. Besides functions of these proteins, solved X-ray and cryogenic electron microscopy structures related to the function of the proteins along with comparisons to other coronavirus homologs have been described in the article. Finally, the rate of mutation of SARS-CoV-2 residues of the proteome during the 2020 pandemic has been described. Some proteins are mutated more often than other proteins, but the significance of these mutation rates is not fully understood.Entities:
Keywords: SARS-CoV; SARS-CoV-2; function; proteins; structure
Year: 2021 PMID: 34220199 PMCID: PMC8221690 DOI: 10.1177/11779322211025876
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Figure 1.Genome organization of SARS-CoV and SARS-CoV-2. The upper panel shows the genomic organization of SARS-CoV. It contains two polyproteins pp1a and pp1b synthesized from ORF1a and 1b. These polypeptides undergo a series of proteolytic cleavages to form 16 nonstructural proteins. These 16 nonstructural proteins are encoded by the first two thirds of the genome (figure not drawn to scale). The second one third of the genome encodes for four structural proteins S, E, M, and N. Interspersed within these genes are genes for the expression accessory proteins. SARS-CoV-2, a more recently discovered virus, has genomic organization almost identical to SARS-CoV. Accessory protein 8 for SARS-CoV-2 is not divided into 8a and 8b as in SARS-CoV and two ORF for spike proteins are not present in this virus. In addition, SARS-CoV-2 protein contains accessory protein 10 potentially not present in SARS-CoV.
ORF indicates Open Reading Frames.
Tabulation of amino acid lengths of SARS-CoV and SARS-CoV-2 proteins and percentage identity for common proteins of the two species.
| Gene | Length of amino acids in SARS-CoV-2 | Length of amino acids in SARS-CoV | Percentage identity between homologous proteins | Number of amino acid residues with mutation rate greater than 0.01 during the pandemic (N) | N/amino acid length |
|---|---|---|---|---|---|
| Nsp1 | 180 | 180 | 84.44 | 0 | 0.000 |
| Nsp2 | 628 | 628 | 68.34 | 5 | 0.008 |
| Nsp3 | 1922 | 1922 | 75.82 | 10 | 0.005 |
| Nsp4 | 500 | 500 | 80.00 | 1 | 0.002 |
| Nsp5 | 306 | 306 | 96.08 | 5 | 0.016 |
| Nsp6 | 390 | 390 | 88.15 | 6 | 0.015 |
| Nsp7 | 83 | 83 | 98.80 | 1 | 0.012 |
| Nsp8 | 198 | 198 | 97.47 | 0 | 0.000 |
| Nsp9 | 113 | 113 | 97.35 | 1 | 0.009 |
| Nsp10 | 139 | 139 | 97.12 | 0 | 0.000 |
| Nsp11 | 13 | 13 | 84.60 | 0 | 0.000 |
| Nsp12 | 932 | 932 | 96.14 | 7 | 0.008 |
| Nsp13 | 601 | 601 | 99.83 | 5 | 0.008 |
| Nsp14 | 527 | 527 | 95.07 | 2 | 0.004 |
| Nsp15 | 346 | 346 | 88.73 | 4 | 0.012 |
| Nsp16 | 298 | 298 | 93.29 | 1 | 0.003 |
| S | 1273 | 1255 | 75.96 | 21 | 0.016 |
| ORF3a | 275 | 274 | 72.36 | 9 | 0.033 |
| ORF3b | 151 | 154 | No significant similarity | – | – |
| E | 75 | 76 | 94.74 | 0 | 0.000 |
| M | 222 | 221 | 90.54 | 0 | 0.000 |
| ORF6 | 61 | 63 | 68.85 | 0 | 0.000 |
| ORF7a | 121 | 122 | 85.35 | 1 | 0.008 |
| ORF7b | 43 | 44 | 81.40 | 1 | 0.023 |
| ORF8a | 121 (only ORF8) | 39 | 31.71 | 2 | 0.017 |
| ORF8b | 121 (only ORF8) | 84 | 40.48 | ||
| N | 419 | 422 | 90.52 | 16 | 0.038 |
| ORF9a | 97 | 98 | 72.45 | – | – |
| ORF9b | 73 | 70 | 77.14 | 4 | 0.055 |
| ORF10 | 38 | – | – |
Abbreviation: ORF, open reading frames.
Open reading frames of SARS-CoV-2 proteins were detected from GenBank accession number NC_045512.2. Similarly, open reading frames of SARS-CoV Tor2 were detected from GenBank accession number AY274119. Pairwise alignment of the proteins was conducted using NCBI blast and percentage identity was tabulated. In the pblast algorithm, max target sequence of 100, short queries automatically adjusted to parameters for short input sequences, expect threshold of 0.05 and world size of 6 was set. BLOSUM62 matrix with gap costs of 11 for existence and 1 for extension was set. Number of amino acids with mutation rate higher than 0.01 and mutational frequency rate was tabulated in the last two columns.
Figure 13.Cartoon representation of crystal structure of SARS-CoV-2 nsp9 dimer structure showing loops (green), β-sheets (yellow), and helices (red) (PDB 6WXD). The inter-subunit interactions to form a dimer are due to van der Waals interactions between the interfacing copies of α1 helix C-terminal as a result of self-association of GxxxG protein-protein binding motif.
Figure 2.Cap0 and Cap1 activity. Methylation is carried out at the 5ʹ end of newly synthesized RNA using nsp14 (cap0). Methylation is then again carried out at the 2ʹ oxygen of terminal ribose using nsp10 and 14.
Structure and function of nonstructural proteins of SARS-CoV and SARS-CoV-2.
| Name | Functional name | Structure solved (SARS-CoV-2) | Structure solved (SARS-CoV) | Structure description | Function |
|---|---|---|---|---|---|
| Nsp1 | Virulent factor | Cryo-EM Structure and X-Ray Crystallography structure PDB:7K5I, 7K3 N, 7K7 P | NMR Structure | The SARS-CoV-2 nsp113-127 like that of SARS-CoV hosts a unique topological arrangement, which gives to the formation of a six-stranded (n = 6) beta-barrel. In addition, there is an alpha1 helix which is positioned as a cap along one opening of the beta-barrel, two 310 helices that run parallel to each other and the beta5 strand which is though not a part of the beta-barrel but forms a beta-sheet interaction with the beta4 strand. As evident in the crystal structure of nsp113 127, nsp1 of SARS-CoV-2 has large number of flexible loops. | It inhibits host translation, causes invasion from host immune response and leads to efficient viral gene expression in infected cells.[ |
| Nsp2 | Endosome-associated protein | – | – | N/A | It is entirely unknown. |
| Nsp3 | Cutting and untagging protein | X-Ray Crystallography PDB: 6YWL, 6WEY, 6WOJ, 7CZ4, 7CJD, 7C33, 7LLZ, 7LOS, 7CMD, 7JIW,7LLZ | X-Ray Crystallography PDB: 4MM3, | It contains two transmembrane domains, which is released from pp1a/1ab by the papain-like protease domain, which is a part of nsp3 itself. | It releases nsp1 and nsp2 from polyprotein, interacts with other viral nsps as well as RNA to form replication/transcription complex
|
| Nsp4 | Double-membrane vesicle maker | – | – | It is predicted to contain four transmembrane domains, both termini projecting at the cytoplasmic side of the membrane, and three loop regions. | Nsp3, 4, and 6 are predicted to function to nucleate and anchor viral replication complexes on double-membrane vesicles in the cytoplasm.[ |
| Nsp5 | Protease (3CLpro) | X-Ray Crystallography PDB: 6M2N, 2M2N, 7L0D, 6M2Q, 7JKV, 7JQ3, 7JPY, 7JPZ, 7JQ0, 7JQ1, 7JQ4, 7JQ5, 7JQ2 | X-Ray Crystallography PDB: 2HOB, 3SN8 | 3CLpro monomer has 3 domains, domain I, domain II, domain III and a long loop. The active site of 3CLpro is located in the gap between domains I and II, and has a CysHis catalytic dyad. | 3CLpro is first automatically cleaved from polyproteins to produce mature enzyme, which then cleaves downstream nsps at 11 sites to release nsp4-nsp16.
|
| Nsp6 | Double-membrane vesicle factory | – | – | Nsp6 protein possesses 7 putative transmembrane helices located in endoplasmic reticulum (ER). | Nsp3, 4 and 6 are predicted to function to nucleate and anchor viral replication complexes on double-membrane vesicles in the cytoplasm.[ |
| Nsp7 | Copy assistant | Nsp7-nsp8-nsp12 structure solved (X-Ray Crystallography) PDB: 7JLT, 6YHU, 7DCD, 7BW4, 6M71 | Nsp7-nsp8 structure solved (X-Ray Crystallography) PDB: 2AHM | It has a hexadecameric structure with 8 nsp7 and nsp8s that encircles double-stranded RNA. | SARS-CoV nsp7 dimerizes and interacts with other proteins such as nsp5, nsp8, nsp9, and nsp13.
|
| Nsp8 | Primase | Nsp7-nsp8-nsp12 structure solved (X-Ray Crystallography) PDB: 7JLT, 6YHU, 7DCD, 7BW4, 6M71 | Nsp7-nsp8 structure solved (X-Ray Crystallography) PDB: 2AHM | It has a hexadecameric structure with 8 nsp7 and nsp8s that encircles double-stranded RNA. | Nsp8 enzyme is able of de novo initiate replication and has been proposed to operate as primase
|
| Nsp9 | RNA-binding protein | X-Ray Crystallography PDB: 6WXD | X-Ray Crystallography PDB: 3EE7 | It consists of an unusual fold and its core is made up of 6-stranded enclosed β-barrel and a series of extended loops projects outward from it. | It is a single-stranded RNA-binding protein, which displays an oligosaccharide/oligonucleotide binding fold.
|
| Nsp10 | Methyltransferase stimulator | Solved as nsp10-nsp16-SAM complex (X-Ray Crystallography) PDB: 7BQ7, 7JYY | Solved as nsp10-nsp16-SAM complex (X-Ray Crystallography) PDB:3R24 | It comprises a central anti-parallel pair of β-strands, surrounded by a broad crossover loop on one side. On the other side, a helical domain with loops is present, which generates 2 zinc fingers. | It stimulates nsp16 to execute S-adenosyl-L-methionine (SAM)-dependent methyltransferase (MTase) activity
|
| Nsp12 | RNA-dependent RNA polymerase | Solved as nsp7-nsp8-nsp12 (Electron Microscopy) PDB: 6M71, 7JLT, 6YHU, 7DCD, 7BW4, 7AAP | Solved as nsp7-nsp8-nsp12 (Electron Microscopy) PDB: 6NUR | It consists of N-terminal and polymerase domain which resembles a cupped “right hand” consisting a finger, a palm, and a thumb subdomain | Nsp12, in association with nsp7, nsp8, and other essential components of the RNA synthesis machinery, forms a viral replication complex.
|
| Nsp13 | Helicase | X-Ray Crystallography PDB: 6ZSL, 7NI0, 7NN0, 7NNG | – | Nsp13 adopts a triangular pyramid shape comprising five domains: two “RecA-like” domains (1A and 2A), and 1B domain, N-terminal zinc-binding domain (ZBD) and stalk domain, which connects ZBD and 1B domain. | It unwinds dsRNA or DNA with a 5′→3′ polarity, using energy from nucleotide hydrolysis.
|
| Nsp14 | Proofreading exonuclease | – | Nsp14-nsp10 complex solved (X-Ray Crystallography) PDB: 5C8U | The ExoN domain features a core, twisted β-sheet consisting of five β-strands with one Mg2+ ion at its active site. The N7-MTase domain features a MTase fold with central β-sheet consisting of five β-strands. β1 and β2 sheets have a ligand-binding cavity in-between. | Its N-terminal exoribonuclease domain has a proofreading role, which prevents lethal mutagenesis, whereas the C-terminal domain functions as a (guanine-N7) methyltransferase (N7-MTase) for mRNA capping.
|
| Nsp15 | Endonuclease | X-Ray Crystallography PDB: 7KEG, 7KEH, 7KF4 | Catalytically inactive mutant version of Nsp15 solved (X-Ray Crystallography) PDB: 2RHB | Nsp15 forms dimers of trimers, which finally assembles into a hexamer. Each subunit consists of N-terminal domain, a middle domain and C-terminal catalytic endonuclease domain. | Nsp15 preferentially cleaves 3′ of uridines in a manganese dependent manner. This is thought to be an important way for the virus to hide from antiviral defense.
|
| Nsp16 | Methyltransferase | Solved as nsp10-nsp16-SAM complex (X-Ray Crystallography) PDB: 7BQ7, 7JYY | Solved as nsp10-nsp16-SAM complex (X-Ray Crystallography) PDB:3R24 | It consists of Rossmann-like β-sheet fold surrounded by 11 α-helices, 7 β-strands, and loops in the 2′-O-MTase catalytic core. | Nsp16 recruits N7-methylated capped RNA and SAM which promotes the assembly of the enzymatically active nsp10/nsp16 complex. This complex converts 7mGpppG (cap-0) into 7mGpppG2′Om (cap-1) RNA by 2′-OH methylation of N1.
|
Abbreviations: Cryo-EM, cryogenic electron microscopy; DNA, deoxyribonucleic acid; NMR = nuclear magnetic resonance; RdRp, RNA-dependent RNA polymerase; RNA, ribonucleic acid.
Figure 3.Hypothetical SARS-CoV-2 entry and replication inside human host cell. (A) Human host cell with angiotensin-converting enzyme 2 (ACE 2) receptors, which attaches the virus and aids its entry into the cell. (B) Positive-sense single-stranded viral RNA within the cell. (C) Translation of the RNA into 1ab proteins (pp1a and pp1b). (D) Cleavage of pp1a and pp1b into 16 nonstructural proteins by virally encoded chymotrypsin-like protease and two papain-like proteases. (E) Induction of double-membraned vesicles (DMV) and localization of cleaved nsps with the help of nsp3, nsp4 and nsp6. (F) nsp8 acts as a primase for RNA replication. (G) nsp7, nsp8, nsp12, and nsp14 assist polymerase and exonuclease activities. (H) mRNA capping is assisted by nsp10, nsp13, nsp14 and nsp16. (I) Finally, replicated RNA and other translated viral proteins assembled into a new virus.
Figure 4.Crystal structure of SARS-CoV-2 nsp1 globular domain (Cartoon representation) comprising of residues 13 to 127 showing red helix, yellow sheet and green loop (PDB 7K3N). The structure of SARS-CoV-2 nsp1, like that of SARS-CoV, has six-stranded beta-barrel (yellow) and has additionally an alpha1 helix (red) and large number of flexible loops (green).
Figure 5.SARS-CoV-2 3CL protease (3CL pro) in complex with a novel inhibitor in cartoon representation showing red helix, yellow sheet, and green loop (PDB 2M2 N). 3CL pro has two chains (A and B) with three domains (I, II, III). A long loop connects domains II and III. The B barrels of each domain I and II are composed of six-stranded B-sheets (yellow) and domain III is composed of mainly alpha helices (red).
Figure 6.Crystal structure of SARS-CoV-2 nsp3 macrodomain (Cartoon representation) in complex with ADP ribose and showing five (A, B, C, D, E) different chains in different colors where ligands and water are shown in ball and stick representation (PDB 6YWL). SARS-CoV-2 encodes a large, multidomain nsp3 with an ADP ribose phosphate (ADRP) domain (also known as macrodomain), which is thought to interfere with the host immune response.
Figure 7.Crystal structure of RNA-dependent RNA polymerase of SARS-CoV-2 consisting of four chains represented by nsp12 (chain A), nsp8 (chain B, D), and nsp7 (chain C) in four different colors green, sky-blue, yellow, and purple, respectively (PDB 7W4Y).
Figure 8.RNA-dependent RNA polymerase activity and exonuclease proofreading activity. The figure shows, at a biochemical level, what reactions take place due to RNA-dependent RNA polymerase and 3ʹ to 5ʹ exonuclease.
Figure 9.Cartoon representation of SARS-CoV nsp10-nsp14 complex structure showing loops (green), β-sheets (yellow), and helices (red) (PDB 5C8U). The nsp14 ExoN domain is stabilized by nsp10. The ExoN domain features a core, twisted β-sheet consisting of five β-strands and the N7-MTase domain consists of five β-strands.
Figure 10.Cartoon representation of SARS-CoV-2 nsp13 structure showing loops (green), β-sheets (yellow), and helices (red) (PDB 6ZSL).
Figure 11.Cartoon representation of SARS-CoV-2 nsp10-nsp16 complex structure showing loops (green), β-sheets (yellow), and helices (red) (PDB 6W4 H). Nsp10’s positively charged and hydrophobic surface interacts with a hydrophobic pocket and a negatively charged nsp16 surface, which helps to stabilize the SAM binding site.
Figure 12.Cartoon representation of SARS-CoV-2 nsp15 structure showing loops (green), β-sheets (yellow), and helices (red)(PDB 6VWW). SARS-CoV-2 nsp15 generates dimers of trimmers, which finally assembles into a hexamer where each subunit of nsp15 contains 10 α-helices and 21 β-strands.
Structure and function of structural proteins.
| Name of the protein | Structure solved (SARS-CoV-2) | Structure solved (SARS-CoV) | Description of the structure | Function |
|---|---|---|---|---|
| Spike (S) protein | SOLVED: | Solved for: SARS-CoV (Cryo-EM) | It forms homotrimer protruding from the viral surface. There are two subunits: S1 and S2. The S1 subunit houses the receptor-binding domains at the distal end. | It causes anchorage with a host cell receptor before merging. It is also involved in viral entry into the host cell.
|
| Nucleocapsid (N) protein | Solved for SARS-CoV-2-RNA binding domain | Solved for SARS-CoV | SARS-CoV-2-N-NTD crystal shows orthorhombic crystal packaging mode in which there are four monomers in one asymmetry unit. Each monomer in one asymmetric unit has the same type of right handed (loop)-(β-sheet core)-(loop) forming a sandwich structure. It consists of five antiparallel β-strands with a single short 310 helix and a sticking out β-hairpin between β2 and β5 strands, this structure as a whole contributes to form a β-sheet core. The structure of SARS-CoV-2-N-NTD looks similar to the hand shape (fingers, palm and wrist). | Nucleocapsid Protein (N) is believed to have multifunctional activities like forming helical ribonucleoprotein (RNP) complex during packaging the RNA genome, during replication and regulating viral RNA synthesis, transcription and regulating infected cell metabolism.[ |
| Membrane(M) protein | Not solved | Not solved | The M protein consists of three transmembrane domain which are flanked by a short glycosylated amino-terminal domain and a long carboxy-terminal tail (Cytoplasmic domain) outward and inward within the viral envelope, respectively | M protein plays a vital role in the assembly of viruses through protein-protein interactions: M-nucleocapsid (N), M-M and M-spike(S) interaction.
|
| Envelope (E) protein | NOT SOLVED: For SARS-CoV-2 | SOLVED: For another SARS-CoV via NMR | For SARS-CoV, two distinct subunits have been identified: hydrophobic domain with a transmembrane domain (TMD) and a charged cytoplasmic tail. | It reaches heavy localization at sites of intracellular transport, Viral assembly and budding at ER, Golgi complex ERGIC.[ |
Abbreviations: ER, endoplasmic reticulum; ERGIC, endoplasmic reticulum-Golgi body intermediate compartment; NMR = nuclear magnetic resonance; RNA, ribonucleic acid; RNP, ribonucleoprotein; TMD, transmembrane domain.
Figure 14.Surface representation of a closed trimer of SARS-CoV-2 S protein (A) composed of three chains shown in purple, green, and cyan. A cartoon representation of the side view of SARS-CoV-2 S trimer, (B) cartoon representation of top view of the SARS-CoV-2 trimer (C) showing the (closed) hACE2-binding S-B domain (pdb 6VXX)
Figure 16.A cartoon representation showing a trimer of the S2 subunit of SARS-CoV-2 S protein (PDB 6LXT).
Figure 15.A cartoon representation of hACE2-binding S-B domain (A) of SARS-CoV-2 S protein. A cartoon representation of SARS-CoV-2 S-B domain (purple) bound with an hACE2 (red) (B). A SARS-CoV-2 S-B domain (C) showing receptor-binding motif (RBM) comprising amino acids 438 to 506 in red and the core in cyan (pdb 6M0J).
Figure 17.Structure of SARS-CoV-2 nucleocapsid protein: (A) Structure of SARS-CoV-2-N-NTD (Cartoon representation), monomers in one asymmetric unit. There are a total of four monomers, which are represented by different colors (PDB 6M3M). (B) structure of SARS-CoV-2-N-NTD (Cartoon representation) showing green loops, yellow β sheets, and red 310 helices (η) (PDB 6M3M). (C) Structure of SARS-CoV-2-N-NTD showing (loop)-(β-sheet core)-(loop) with β-hairpin sticking out from β2 and β5 regions. Here η1 represent the 310 helix.
Figure 18.Top and side view, repectively, of a SARS-CoV E protein pentameric ion channel (A and B) in ribbon diagram represntation. Ribbon diagram representation of a single monomer of E protein (C) that forms the ion channel pentamer (pdb 5X29).
Figure 19.Transmembrane domains predicted by TMHMM web server in which there are three domains connected by two linker peptides.
Structure and function of accessory proteins.
| Structure solved (SARS-CoV-2) | Structure solved (SARS-CoV) | Description of the structure | Function | Functional name | |
|---|---|---|---|---|---|
| 3a | Solved for SARS-CoV-2157
| Not solved | It is an ion channel with 275 amino acid residues (274 amino acid residues in SARS-CoV). It is an O-glycosylated protein with three transmembrane helices followed by cytosolic domains with multiple β-strands per protomer chain and an N-terminal ectodomain and a C-terminal endodomain in both intracellular and plasma membranes.[ | Interactions with some structural proteins (S, M and E) and observed inducing apoptosis in vitro
| Hole borer |
| 3b | Not solved | Not solved | It is 154 amino acids long for SARS-CoV (151 for SARS-CoV-2).
| It induces apoptosis and necrosis, and hinders antiviral innate immune response.
| Immune modulator |
| 6 | Not solved | Not solved | It is 63 amino acids long in SARS-CoV (61 for SARS-CoV-2).
| It suppress IFN induction and IFN signaling pathway.[ | Type 1 interferon antagonist |
| 7a | Solved for SARS-CoV-2 | Solved for SARS-CoV
| It is a type I transmembrane protein with 121 amino acids (122 for SARS-CoV that contain 15-amino-acid N-terminal signal peptide sequence, an 81-amino-acid luminal domain, a 21-amino-acid transmembrane domain and a short C-terminal tail.
| It induces apoptosis, inhibits cellular protein synthesis and arrests cell cycle at the GO/G1 phase.
| Immune evador |
| 7b | Not solved | Not solved | It is an integral membrane protein with 44 amino acids (43 for SARS-CoV-2).
| Immune evador | |
| 8 | Solved for SARS-CoV-2 (X-Ray Diffraction) PDB: 7JTL | 8a—Not solved | It is a 39-amino-acid-long polypeptide in which residues 1-35 are identical to the N-terminal of 8ab. | It is found to enhance replication in some studies and shows interaction with other structural proteins.[ | Immune modulator |
| 8b—Not solved | It is a 84-amino-acid-long polypeptide and residues 9-84 are identical to the C terminal of 8ab. | ||||
| 9b | Solved for SARS-CoV-2 (X-Ray Crystallography) PDB: 6Z4U. | Solved for SARS-CoV
| It is 70 amino acids long protein (73 in case of SARS-CoV-2). It contains a novel dimeric fold having β strands like structure with an amphipathic surface and a central hydrophobic cavity.
| It shows interaction with some nonstructural proteins, and incorporates into mature virions.
| Internal N protein |
| 10 | Not solved | Not found | It is a 38-amino-acid-long protein. It has one potential TM domain. Presence of one α-helix and two β-strands and β-α-β motif with a β-molecular recognition feature occurring in the first β-strand is revealed through the prediction of secondary structural elements of this protein | It has potential immune modulatory role. | Mystery protein |
Abbreviations: IFN, interferon; ORF, Open Reading Frames; TM, transmembrane.
Figure 20.Structure of SARS-CoV-2 ORF3a protein (Cartoon representation). (A) Showing green chain A, blue chain B. (B) Showing green loops, yellow β sheets, and red alpha helices (PDB 6XDC).
ORF indicates Open Reading Frames.
Figure 21.Structure of SARS-CoV-2 ORF7a protein (cartoon representation) showing green loops, yellow β sheets (PDB 7Ci3).
ORF indicates Open Reading Frames.
Figure 22.Structure of SARS-CoV-2 ORF8 protein (cartoon representation) showing green loops, yellow β sheets (PDB 7JTL).
ORF indicates Open Reading Frames.
Figure 23.Structure of SARS-CoV-2 ORF9b protein (Cartoon representation). (A) Showing green chain A, blue chain B. (B) Showing green loops, yellow β sheets, and red alpha helices (PDB 6Z4U).
ORF indicates Open Reading Frames.