| Literature DB >> 32941612 |
Erica T Prates1,2, Michael R Garvin1,2, Mirko Pavicic1,2, Piet Jones2,3, Manesh Shah4, Omar Demerdash1, B Kirtley Amos5, Armin Geiger2,3, Daniel Jacobson1,2,3,4,6.
Abstract
Despite SARS-CoV and SARS-CoV-2 being equipped with highly similar protein arsenals, the corresponding zoonoses have spread among humans at extremely different rates. The specific characteristics of these viruses that led to such distinct outcomes remain unclear. Here, we apply proteome-wide comparative structural analysis aiming to identify the unique molecular elements in the SARS-CoV-2 proteome that may explain the differing consequences. By combining protein modeling and molecular dynamics simulations, we suggest nonconservative substitutions in functional regions of the spike glycoprotein (S), nsp1, and nsp3 that are contributing to differences in virulence. Particularly, we explain why the substitutions at the receptor-binding domain of S affect the structure-dynamics behavior in complexes with putative host receptors. Conservation of functional protein regions within the two taxa is also noteworthy. We suggest that the highly conserved main protease, nsp5, of SARS-CoV and SARS-CoV-2 is part of their mechanism of circumventing the host interferon antiviral response. Overall, most substitutions occur on the protein surfaces and may be modulating their antigenic properties and interactions with other macromolecules. Our results imply that the striking difference in the pervasiveness of SARS-CoV-2 and SARS-CoV among humans seems to significantly derive from molecular features that modulate the efficiency of viral particles in entering the host cells and blocking the host immune response.Entities:
Keywords: COVID-19; SARS-CoV; SARS-CoV-2; pathogenesis; proteome-wide analysis; proteome-wide modeling
Mesh:
Substances:
Year: 2021 PMID: 32941612 PMCID: PMC7543629 DOI: 10.1093/molbev/msaa231
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Summary of the SARS-CoV-2 Proteome (reference genome NC_045512.2).
| Name | Accession | Length | Function |
|---|---|---|---|
| nsp1 | YP_009725297.1 | 180 | Inhibits host gene expression and interferon signaling ( |
| nsp2 | YP_009725298.1 | 638 | May assist other viral proteins in their function, interacting with several of them, but its specific function is not known yet ( |
| nsp3 | YP_009725299.1 | 1,945 | Papain-like protease with phosphatase activity. Performs proteolytic cleavage of the polyproteins ( |
| nsp4 | YP_009725300.1 | 500 | Essential to membrane rearrangements during viral replication ( |
| nsp5 | YP_009725301.1 | 306 | Also known as 3C-like proteinase, its main role is to cleave the viral polyprotein to generate the active forms of the nonstructural proteins ( |
| nsp6 | YP_009742613.1 | 290 | Participates in membrane rearrangements and autophagy ( |
| nsp7 | YP_009725303.1 | 83 | Part of the replication complex (nsp7–nsp8–nsp12). It forms an hexadecameric complex with nsp8 that may act as a processivity clamp for the RNA-dependent RNA polymerase ( |
| nsp8 | YP_009725304.1 | 198 | Part of the replication complex (nsp7–nsp8–nsp12). It forms an hexadecameric complex with nsp7 that may act as a processivity clamp for the RNA-dependent RNA polymerase ( |
| nsp9 | YP_009725305.1 | 113 | Forms homodimers that bind and protect the viral genome from degradation during replication ( |
| nsp10 | YP_009725306.1 | 139 | Forms complexes with nsp14 and nsp16, which perform 3′–5′ exoribonuclease and 2′- |
| nsp11 | YP_009725312.1 | 13 | Short peptide that may be involved in RNA synthesis ( |
| nsp12 | YP_009725307.1 | 932 | RNA-dependent RNA polymerase, the core of the replication complex (nsp7–nsp8–nsp12) ( |
| nsp13 | YP_009725308.1 | 601 | RNA helicase with NTPase, dNTPase, and RTpase activities ( |
| nsp14 | YP_009725309.1 | 527 | 3′–5′ exonuclease with proofreading activity ( |
| nsp15 | YP_009725310.1 | 346 | Nidoviral RNA uridylate-specific endoribonuclease (NendoU) ( |
| nsp16 | YP_009725311.1 | 298 | 2′- |
| S | YP_009724390.1 | 1,273 | Spike glycoprotein. Main means of virus entry into host cells. These highly glycosylated proteins protrude from the viral surface to interact with the host cell receptor(s) ( |
| M | YP_009724393.1 | 222 | Membrane glycoprotein. Required for membrane curvature initiation, RNA packing, and viral particle budding ( |
| N | YP_009724397.2 | 419 | Nucleocapsid. Packages the viral RNA to form a ribonucleocapsid, playing a key role in viral assembly ( |
| E | YP_009724392.1 | 75 | Envelope protein. Minor structural protein that forms pentameric ion channels in host ER membranes ( |
| ORF3a | YP_009724391.1 | 275 | Forms homotetramers with ion channel properties ( |
| ORF6 | YP_009724394.1 | 61 | Enhances viral replication ( |
| ORF7a | YP_009724395.1 | 121 | Prevents virus tethering at the plasma membrane by binding to BTS-2 ( |
| ORF7b | YP_009725318.1 | 43 | Integral transmembrane protein. Its function is unclear ( |
| ORF8 | YP_009724396.1 | 121 | Accessory protein involved in enhanced virus replication ( |
| ORF9b | PODTD2 | Alternative reading frame in the N gene. Suppresses host antiviral response by promoting MAVS degradation ( | |
| ORF10 | YP_009725255.1 | 38 | Accessory protein with potential role in inhibiting the ubiquitin-proteasome system (UPS) ( |
Annotated by Gordon et al. (2020).
Fig. 1.Distribution of sequence variation in fully or partially solved structures of SARS-CoV-2 proteins relative to SARS-CoV. Variations considered nonconservative, represented in magenta, are defined in supplementary table S1, Supplementary Material online. Variations occurring within protein cores (low solvent accessible surface area, SASA) are represented in orange. ORF10 is not included as it is not found in the SARS-CoV proteome. The percentages were computed relative to the total number of amino acids of each protein.
Fig. 2.Nonconservative substitutions in the spike glycoprotein. (A) Local modeling-derived SARS-CoV-2 spike glycoprotein (based on PDB ids 6acc and 6ack). (B) Conformational transition of the receptor-binding domain of the S1 subunit of the spike glycoprotein and association with ACE2 receptor. Nonconservative substitutions relative to SARS-CoV S are depicted in orange. N-terminal (NTD) and C-terminal domains (CTD) are identified.
Fig. 3.Analysis of simulations of SARS-CoV and SARS-CoV-2 RBDs in complex with ACE2. (A) Probability density of residues from ACE2 forming contacts with the RBDs. A maximum distance of 4 Å between any atom in a pair of residues was established. Bars with a standard deviation >50% of the probability density are considered transient contacts in the simulations and are not included in these plots. The colors of the bars correspond to zone 1 and zone 2 of ACE2, defined in (B), which shows the RBD residues involved in contacts formed during more than 70% of the simulation time (labeled in green). RBD and ACE2 residues are represented as licorices, in green and pink, respectively. (C) Superimposition of frames in a representative simulation of RBD1-ACE2 (left) and RBD2-ACE2 (right), using the initial position of ACE2 as reference for alignment. RBDs and ACE2 are represented in green and pink, respectively.
Fig. 4.Analysis of simulations of SARS-CoV and SARS-CoV-2 RBDs in the built complex with ACE. (A) Superimposition of the last frames of the simulations of RBD1-ACE (left). For visual clarity, because the relative orientation of the proteins in RBD2-ACE is very flexible due to the small surface of contact, we only show the last frame of a representative simulation of RBD-ACE (right). RBDs and ACE are represented in green and pink, respectively. (B) Residues involved in contacts formed during more than 70% of the simulation time. RBD and ACE residues are represented as licorices, in green and pink, respectively. (C) Probability density of residues in ACE forming contacts with the RBDs. A maximum distance of 4 Å between any atom in a pair of residues was established. Bars with a standard deviation higher than 50% of the probability density are considered transient contacts in the simulations and not included in these plots. The colors of the bars correspond to zone 1 and zone 2 of ACE, shown in (B).
Fig 5.Analysis of the structural variation in SARS-CoV-2 nsp1 protein relative to SARS-CoV nsp1. Fragment-based predicted structure of nsp1. Nonconservative substitutions relative to SARS-CoV nsp1 are depicted in yellow. Substitutions discussed in the text are labeled, including the corresponding residue of the homolog (SARS-CoV) in parentheses.
Fig. 6.Papain-like protease domain of SARS-CoV-2 nsp3 bound to human ISG15. (A) Nonconservative substitutions relative to SARS-CoV are depicted in orange in PL2Pro (structure in pink). ISG15 bound to SARS-CoV-2 PL2pro (PDB id 6xa9) is represented in blue, and ISG15 bound to SARS-CoV PL2Pro (PDB id 5tl6) is represented in green. (B) The substitution of Lys917 (in green) in SARS-CoV PL2Pro by Gln940 in SARS-CoV-2 may result in the loss of a salt bridge with Glu127 in ISG15. The distance between side chain atoms, N_Lys917-O_Glu127 and N_Gln940-O_Glu127, is 7 Å and 13 Å, respectively.
Fig. 7.Analysis of the structural variation in SARS-CoV-2 nsp5 protein relative to SARS-CoV nsp5. (A) Nsp5 dimer (PDB id: 6lu7). (B) Close view of the catalytic site of nsp5. In yellow, NEMO is shown in the conformation predicted with docking. The conformation of NEMO transferred from PEDV 3CLpro is also depicted, in black (PDB id: 5zqg). Substitutions discussed in the text are labeled, including the corresponding residue of the homolog (PEDV) in parentheses.