| Literature DB >> 34337272 |
Choon Kim1, Kiran V Mahasenan1, Atul Bhardwaj1, Olaf Wiest1, Mayland Chang1, Shahriar Mobashery1.
Abstract
The severe acute respiratory syndrome coronavirus-2 (Entities:
Year: 2021 PMID: 34337272 PMCID: PMC8315141 DOI: 10.1021/acsomega.1c02984
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Figure 1Genomic structure of SARS-CoV-2.
Available Structures (X-ray, NMR, or EM) are from 17 Proteins Encoded in the SARS-CoV-2 Genome and Three Proteins Encoded in the SARS-CoV Genomea
| protein | description | experimental structures at PDB |
|---|---|---|
| NSP1 | inhibits host translation by interacting with the 40S ribosomal
subunit[ | SARS-CoV-2; 7JQB (cryoEM), 7K7P and 7K3N |
| NSP2 | involved in the disruption of intracellular
host signaling
during SARS-CoV infections[ | |
| NSP3 | multidomain protein including
the papain-like protease domain; involved
in cutting the N-terminal part of PP1a
and PP1ab to mature NSP1, NSP2, and NSP3 itself and altering many
of the infected host proteins; interacts with NSP4[ | SARS-CoV-2; PLpro domain (6WUU, 6WX4, 6W9C, 6WZU, 6YVA, 6WRH, 7KRX, 7KOK, and 7D6H), ARH domain (6WOJ, 6WEY, 6YWK, 6YWL, 6YWM, 6WEN, 6VXS, 6W02, 6WCF, 6W6Y, 7KXB, 7LG7, 6Z5T, 7BF5, 7C33, 7CZ4, and 7KG3) |
| NSP4 | forms virally induced cytoplasmic double-membrane vesicles necessary for viral replication[ | |
| NSP5 | 3C-like protease maturing most
of the NSPs (NSP4–NSP16) from
PP1ab[ | SARS-CoV-2; 137 structures (6LU7, 6M2Q, and 135 more) |
| NSP6 | forms virally induced cytoplasmic double-membrane vesicles necessary for viral replication[ | |
| NSP7 | cofactor of RNA-dependent RNA
polymerase (NSP12)[ | SARS-CoV-2; 7BV1, 7BW4, 7BV2, 6WTC, 6WIQ, 6WQD, 6M71, 7BTF, 6YYT, 6YHU, and 7DCD |
| NSP8 | cofactor of RNA-dependent RNA
polymerase (NSP12)[ | SARS-CoV-2; 7BV1, 7BW4, 7BV2, 6WTC, 6WIQ, 6WQD, 6M71, 7BTF, 6YYT, 6YHU, and 7DCD |
| NSP9 | single-stranded RNA-binding protein; interacts with
NSP8[ | SARS-CoV-2; 6W4B, 6WXD, and 6W9Q |
| NSP10 | critical co-factor for the activity
of 3′ → 5′ exonuclease
(NSP14) and 2′- | SARS-CoV-2; 6WKQ, 7C2J, 7C2I, 6WKS, 6W61, 6WVN, 6WQ3, 6WJT, 6WRZ, 6W4H, 6W75, 6YZ1, 7KOA, 7L6T, 6ZPE, and 6ZCT |
| NSP11 | the leftover of PP1a fragmentation[ | |
| NSP12 | RNA-dependent RNA polymerase,
responsible for replication and transcription of the viral RNA genome[ | SARS-CoV-2; 7BTF, 7BV1, 7BV2, 6YYT, 7BW4, 7L1F, 7B3B, and 7D4F |
| NSP13 | helicase; interacts with NSP12[ | SARS-CoV-2; 7NN0 and 7NIO |
| NSP14 | N-terminal 3′ → 5′ exonuclease domain and C-terminal N7-guanine-methyltransferase domain[ | SARS-CoV; 5C8U, 5C8S, 5C8T, and 5NFY |
| NSP15 | uridylate-specific endoribonuclease[ | SARS-CoV-2; 6W01, 6WXC, 6WLC, 6X1B, 6VWW, 7K0R, 7KEG, 5S71, 7K1O, and 7K9P |
| NSP16 | 2′- | SARS-CoV-2; 6WKQ, 7C2J, 7C2I, 6WKS, 6W61, 6WVN, 6WQ3, 6WJT, 6WRZ, 6W4H, 6W75, 6YZ1, 7KOA, and 7L6T |
| S | spike glycoprotein interacting with the host
receptor protein
ACE2[ | SARS-CoV-2; receptor-binding domain (6VSB, 7BZ5, 6M0J, 6M17, 6W41, 7C01, 6LZG, 6YM0, 6YLA, 6YOR, and 6VW1), S2 domain (6LXT and 6LVN), cryoEM of the full-length (6VXX, 6WPT, 6WPS, 6VYB, 6X29, 6X2C, 6X2A, and 6X2B) |
| ORF3a | accessory protein; activates the NLRP3
inflammasome[ | SARS-CoV-2; 7KJR (cryoEM) |
| ORF3b | accessory protein; IFN antagonist[ | |
| E | envelope protein; mediates virus morphogenesis and assembly[ | SARS-CoV; 5X29 and 2MM4 |
| M | membrane glycoprotein;
involved in virus morphogenesis and
assembly[ | |
| ORF6 | accessory protein; antagonizes type I interferon[ | |
| ORF7a | accessory protein; inhibits the antiviral function of bone
marrow stromal antigen 2 (BST-2)[ | SARS-CoV-2; 6W37 and 7CI3 |
| ORF7b | accessory protein and a structural
component of the SARS-CoV virion[ | SARS-CoV; 6W37 |
| ORF8 | accessory protein; important for interspecies
transmission
of the virus; activates the NLRP3 inflammasome[ | SARS-CoV-2; 7JX6 |
| ORF9b | accessory
protein; suppresses innate immunity by targeting
mitochondria and the MAVS/TRAF3/TRAF6 signalosome[ | SARS-CoV-2; 7KDT (cryoEM) |
| ORF9c | unknown[ | |
| N | nucleocapsid
phosphoprotein; packages the viral RNA genome[ | SARS-CoV-2; N-terminal RNA-binding domain (6YI3, 6M3M, 6WKP, 6VYO, and 7CR5), C-terminal dimerization domain (6WZQ, 6WZO, 6WJI, 6YUN, and 7C22) |
| ORF10 | unknown; may not be produced[ |
PDB code(s) of the structures are listed for each protein, when available, together with the attributed functions for the viral proteins.
Figure 2A SARS-CoV-2 gene is preceded by a 6xHis tag, a Twin-Strep tag, and a TEV cleavage site in pETHST. All synthetic SARS-CoV-2 genes except for the nsp1 were inserted between NdeI and XhoI.
Expression Levels of 6xHis-Twin-Strep-Tagged SARS-CoV-2 Proteins in E. coli LOBSTR-RIL
| protein | expression level |
|---|---|
| NSP1 | soluble: L, 16; |
| insoluble: N, 16; M, 37 | |
| NSP2 | soluble: |
| insoluble: N, 16; H, 37 | |
| NSP3 | soluble: |
| insoluble: N, 16; N, 37 | |
| NSP3 ARH | soluble: |
| insoluble: N, 16; H, 37 | |
| NSP3 PLpro | soluble: |
| insoluble: L, 16; H, 37 | |
| NSP3C | soluble: |
| insoluble: L, 16; H, 37 | |
| NSP4N | soluble: |
| insoluble: M, 16; H, 37 | |
| NSP4C | soluble: |
| insoluble: N, 16; H, 37 | |
| NSP5 | soluble: N, 16; |
| insoluble: N, 16; L, 37 | |
| NSP7 | soluble: L, 16; |
| insoluble: N, 16; L, 37 | |
| NSP8 | soluble:
M, 16; |
| insoluble: L, 16; L, 37 | |
| NSP9 | soluble: L, 16; |
| insoluble: N, 16; L, 37 | |
| NSP10 | soluble: |
| insoluble: N, 16; H, 37 | |
| NSP12 | soluble: |
| insoluble: H, 16; H, 37 | |
| NSP12P | soluble: |
| insoluble: H, 16; H, 37 | |
| NSP13 | soluble: |
| insoluble: H, 16; H, 37 | |
| NSP14 | soluble: |
| insoluble: L, 16; H, 37 | |
| NSP15 | soluble: |
| insoluble: N, 16; L, 37 | |
| NSP16 | soluble: |
| insoluble: L, 16; H, 37 | |
| S | soluble: |
| insoluble: H, 16; H, 37 | |
| S1 | soluble: |
| insoluble: H, 16; H, 37 | |
| S2 | soluble: |
| insoluble: M, 16; H, 37 | |
| ORF3a | soluble: |
| insoluble: L, 16; H, 37 | |
| M | soluble: |
| insoluble: L, 16; H, 37 | |
| ORF7a | soluble: |
| insoluble: L, 16; H, 37 | |
| ORF8 | soluble: |
| insoluble: M, 16; H, 37 | |
| ORF9b | soluble: L, 16; |
| insoluble: N, 16; M, 37 | |
| N | soluble: |
| insoluble: M, 16; H, 37 | |
| NS | soluble: M, 16; |
| insoluble: N, 16; M, 37 |
L, low; M, moderate; H, high; N, none; 16 for 16 °C; and 37 for 37 °C. For example, L, 16 indicates low expression at 16 °C.
Figure 3Purification of the N-terminal ARH and PLpro domains of NSP3, NSP15, and NSP16 as representatives. M, molecular weight marker; S, the soluble fraction of the cell extract; I, the insoluble fraction of the cell extract; ST, Strep-Tactin affinity chromatography; Ni, Ni-NTA affinity chromatography; and TEV, cleavage with TEV protease. The protein sizes before and after TEV-protease treatment are indicated at the bottom.
Figure 4ARH domain of the NSP3 protein of SARS-CoV-2. (A) Stereoview of the ribbon representation of the ARH domain, with the active site in the center. (B) Stereo view of the X-ray structure of the ARH domain depicted as a solvent-accessible surface in complex with ADP-ribose (PDB code 6W02). The adenosine, pyrophosphate, and ribose (from left to right) binding subpockets are shown by yellow broken highlights. (C) Stereoview of the superimposition of 21 top-ranking compounds docked to the NSP3 ARH domain active site (PDB code 6WCF; resolution of 1.06 Å), demonstrating sampling of all three subpockets for binding.
Figure 5Structures of compounds and the dose–response plots of compounds for the ARH domain of NSP3. (A) Chemical structures of three compounds in the first set; (B) structures of three derivatives in the second set; and (C) dose–response plots of MST for compounds 200, 147, and 130 from left to right.