| Literature DB >> 32785274 |
Manish Sarkar1, Soham Saha2,3.
Abstract
The outbreak of COVID-19 across the world has posed unprecedeene">nted and global challeene">nges oene">n multiple froene">nts. Most of the vaccine and drug developmeene">nt has focused oene">n the spike proteins and viral RNA-polymerases and main protease for viral replication. Using the bioinformatics and structural modelling approach, we modelled the structure of the envelope (E)-protein of novel SARS-CoV-2. The E-protein of this virus shares sequence similarity with that of SARS- CoV-1, and is highly conserved in the N-terminus regions. Incidentally, compared to spike proteins, E proteins demonstrate lower disparity and mutability among the isolated sequences. Using homology modelling, we found that the most favorable structure could function as a gated ion channel conducting H+ ions. Combining pocket estimation and docking with water, we determined that GLU 8 and ASN 15 in the N-terminal region were in close proximity to form H-bonds which was further validated by insertion of the E protein in an ERGIC-mimic membrane. Additionally, two distinct "core" structures were visible, the hydrophobic core and the central core, which may regulate the opening/closing of the channel. We propose this as a mechanism of viral ion channeling activity which plays a critical role in viral infection and pathogenesis. In addition, it provides a structural basis and additional avenues for vaccine development and generating therapeutic interventions against the virus.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32785274 PMCID: PMC7423102 DOI: 10.1371/journal.pone.0237300
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Table showing composition and optimization of the generated membrane.
| ER% | Golgi % | (ER+Golgi) | % in ERGIC complex | N = 500 | N = 450 | N = 400 | N = 350 | N = 300 | N = 250 | N = 200 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 54 | 36 | 90 | 47 | 235 | 212 | 188 | 164 | 141 | 118 | 94 | |
| 20 | 21 | 41 | 21 | 105 | 118 | 84 | 74 | 63 | 52 | 42 | |
| 11 | 12 | 23 | 12 | 60 | 54 | 48 | 42 | 36 | 30 | 24 | |
| 6 | Not present | 6 | 3 | 15 | 14 | 12 | 11 | 9 | 7 | 6 | |
| 8 | 18 | 26 | 14 | 70 | 63 | 54 | 49 | 42 | 35 | 28 | |
| 6 | Not present | 6 | 3 | 15 | 13 | 12 | 11 | 9 | 7 | 6 |
Single site-directed mutations on the E-put protein.
| Protein/Mutant | Theoretical pI (Expasy) | ddG value (STRUM) | Aliphatic index (Expasy) | Average hydrophobic score (Expasy) |
|---|---|---|---|---|
| Wild Type | 8.57 | N/A | 144.00 | 1.128 |
| F4A | 8.57 | -1.43 | 145.33 | 1.115 |
| F4L | 8.57 | -0.74 | 149.20 | 1.141 |
| F4R | 8.99 | -1.09 | 144.00 | 1.031 |
| F4Y | 8.55 | -0.6 | 144.00 | 1.073 |
| F4W | 8.57 | -0.53 | 144.00 | 1.079 |
| E8A | 8.99 | -0.53 | 145.33 | 1.199 |
| E8V | 8.99 | -0.43 | 147.87 | 1.231 |
| E8I | 8.99 | -0.66 | 149.20 | 1.235 |
| E8K | 9.26 | -0.7 | 144.00 | 1.123 |
| E8M | 8.99 | -0.55 | 144.00 | 1.200 |
| N15A | 8.57 | -0.9 | 145.33 | 1.199 |
| N15V | 8.57 | -0.67 | 147.87 | 1.231 |
| N15I | 8.57 | -0.95 | 149.20 | 1.235 |
| N15K | 8.96 | -0.68 | 144.00 | 1.123 |
| N15F | 8.57 | -0.89 | 144.00 | 1.212 |
| F26A | 8.57 | -2.14 | 145.33 | 1.115 |
| F26L | 8.57 | -0.9 | 149.20 | 1.141 |
| F26R | 8.99 | -1.18 | 144.00 | 1.031 |
| F26Y | 8.55 | -0.62 | 144.00 | 1.073 |
| F26W | 8.57 | -0.49 | 144.00 | 1.079 |
Fig 1Sequence alignment of SARS CoV E proteins, disparity index and mutability.
(A) Sequence alignment of CoV E-proteins performed by Clustal Omega after a BLASTp search against human SARS-CoV-2 E protein (Accession number: QII57162.1). The selected aligned sequences were based on the criteria of 90–100% similarity. The conserved sequences, quality of sequence alignment, and consensus sequence are shown below. (B) Sequence alignment of CoV E-proteins from the bat (HKU3-7), human SARS 2018, bat SARS_RsSHC014, BtRI-BetaCoV, SARS-CoV-1 and SARS-CoV-2 showing highly conserved regions in the N-terminal region. The consensus sequence is shown below. (C) Percentage composition of each amino acid computed from the aligned selected sequences for (i) Spike protein, (ii) E-protein, and (iii) M-protein. (D) Lower triangular heatmap representation of the disparity index computed from the sequence alignment for (i) spike protein, and (ii) E-protein. Scale: 0–1 (for i), and 0–0.1 (for ii). (E) Mutability (probability of amino acid change) calculated for spike protein and E-protein. The dotted line indicates a 50% probability chance, while red bars indicate amino acids absent/single-site presence on the sequences analyzed. $$ indicates residues with lower mutability in E-protein compared to spike protein.
Fig 2Pentameric homology model of the E protein of SARS-CoV-2.
(A) NMR structure of pentameric E protein (PDB id: 5X29) of SARS-CoV-1. (B) Pentameric model of E protein of SARS-CoV-2 (COVID-19) (i) generated in GALAXY (E-put), and (ii) generated in SWISS-MODEL. A magnified figure for each is provided in (red box). (C) Top view of pentameric E protein SARS-CoV-1 (5X29). (D) Top view of pentameric E protein SARS-CoV-2 (i) generated in GALAXY (E-put), and (ii) generated in SWISS-MODEL. (E) Bottleneck radius measurement of C (5X29). (F) Bottleneck radii measurement of D (i) GALAXY (E-put) and (ii) SWISS-MODEL. (G) N-terminal residues of (i) GALAXY model (E-put) (as in B-i) and (ii) SWISS-MODEL model (as in B-ii). (H) Structure validation parameters and pore radii of 5X29, GALAXY model (E-put), and SWISS-MODEL model.
Table showing MolProbity score, clash score, and pore radius for three possible conformations of the E-put protein.
| Models | MolProbity Score | Clash Score | Pore radius (angstrom) |
|---|---|---|---|
| E-protein SARS-Cov 2 (Galaxy model) | 0.81 | 0 | 3.4 |
| E-protein SARS-Cov 2 (Galaxy model: open; | 2.79 | 15.66 | 5.6 |
| E-protein SARS-Cov 2 (Galaxy model: closed bottleneck; | 2.02 | 12.86 | 2.3 |
Fig 3Pore volume estimation of E put protein by GALAXY-WEB and docking with water molecules.
(A) Estimation of the pore volume of the GALAXY-WEB E-put protein using the CASTp 3.0 server. (B) (i) Residues lining the luminal side of the E-put protein core and (ii) the luminal surface of the pore determined by the CASTp 3.0 server. (C) Docking of water to the E-put protein by SWISS-DOCK showing the chains B and E of the pentamer and magnification of the region of interaction. (D) Distances of water molecules from different lining residues and the limit of H-bond. (E) Residues’ orientation of the E-put protein generated from PYMOL and interaction of water with residues showing the distance between water and (i) ASN 15 and (ii) GLU 8 by CHIMERA. (F) Residues’ orientation generated from PYMOL (side view) and CHIMERA (top view) lining the Central core of the E-put protein. (G) Residues’ orientation generated from PYMOL and interaction of water with the lining residues of the Hydrophobic funnel of the E-put protein by CHIMERA. (H) Residues’ orientation generated from PYMOL (side view) and CHIMERA (top view) showing the Gate of the E-put protein in the closed conformation. (I) Residues’ orientation generated from PYMOL (side view) and CHIMERA (top view) forming the bottleneck of the E-put protein.
Fig 4Pore volume estimation of E put protein by SWISS MODEL and docking with water molecules.
(A) Estimation of the pore volume of the SWISS-MODEL E-put protein using the CASTp 3.0. (B) (i) Residues lining the luminal side of the SWISS-MODEL protein core and (ii) the luminal surface of the pore determined by the CASTp 3.0 server. (C) Docking of water to the SWISS-MODEL protein by SWISS-DOCK showing the chains C and E of the pentamer. (D) Residues’ interaction of water with residues showing the distance between water and (i) ASN 15 and (ii) GLU 8 by CHIMERA. (E) Residues’ interaction of water with the lining residues of the hydrophobic funnel of the SWISS-MODEL protein by CHIMERA. (F) Distances of water molecules from different lining residues and the limit of H-bond. (G) Residues’ orientation in CHIMERA forming the bottleneck of the SWISS-MODEL protein. (H) Superposition of the E-put and the SWISS-MODEL protein (purple: E-put; cyan: SWISS-MODEL) as (i) side view and (ii) top view. (I) Ramachandran plot of the E-put protein (Galaxy model). (J) Ramachandran plot of the SWISS-MODEL protein.
Fig 5E-put protein interacts with the lipid molecule components of ERGIC membrane.
(A) Ceramide: Sphingolipid generated in OpenBabel and CHIMERA. (B) Docking of Ceramide to the N-terminus of the E-put protein in CHIMERA-VINA and magnification of the region of interaction. (C) Interaction of ARG 38 in the C terminal with the docked ligand showing the distances between the charged groups of ARG 38 and the ligand. (D) Phosphatidylcholine: Glycerophospholipid generated in OpenBabel and CHIMERA. (E) Docking of Phosphatidylcholine to the N-terminus of the E-put protein in CHIMERA and Autodock Vina and magnification of the region of interaction. (F) Interaction of ARG 38 in the C terminal of E-put with the docked ligand showing the distances between the charged groups of ARG 38 and the ligand in CHIMERA. (G) Distances of the ligands from ARG 38 showing the bond limits of different interactions.
Fig 6Proposed mechanism of proton chaneling activity in E-put protein.
(A) Proton channeling by water hopping mechanism in M2 viroporin of influenza A virus in a pH-dependent fashion generated with CHIMERA. (B) Different conformational states of E-put protein in SARS-CoV-2 by CHIMERA showing its (i) Closed Discontinuous state (ii) Continuous Channel state 1 (iii) Continuous Channel state 2 (iv) Closed Discontinuous state generated with CHIMERA. (C) Pore volumes of the E-put protein (i) Closed Discontinuous state (ii) Continuous Channel state 1 (iii) Continuous Channel state 2 (iv) Closed Discontinuous state generated with CHIMERA and CASTp. (D) Representation of the change of pore volume as in C- i, ii, iii, iv. (E) The orientation of the bottleneck PHE 26 in different conformational states as in C- i, ii, iii, iv.
Fig 7Membrane insertion of E-put protein and structural morphing from open to closed state.
(A) Membrane insertion outline of the closed discontinuous state of E put TM with pore water (in red and white spheres) inside the pore structure and packed by lipid-like pseudo atoms (green spheres). Only pore water molecules remain after pore water generation through high-temperature dynamics. (B) The closed discontinuous state of E put TM inserted in a 1.5 lipid bilayer (POPC, POPE, POPI, POPS, Cholesterol, and SM) having 194 lipid components (94 in the upper leaflet and 100 in the lower leaflet) and modified with a TIP3P water model. The water molecules are seen in the cytoplasmic and the ERGIC luminal side. (C) A magnified side view of the model water molecules in the pore structure of the closed discontinuous state of the E put TM inserted in the ERGIC membrane mimic. (D) Top view of the model water molecules in the pore structure of the E put TM inserted in the closed discontinuous state within the ERGIC membrane mimic. (E) Membrane insertion outline of the continuous channel state 1 of E put TM with pore water (in red and white spheres) inside the pore structure and packed by lipid-like pseudo atoms (green spheres). Only pore water molecules remain after pore water generation through high-temperature dynamics. (F) The continuous channel state 1 of E put TM inserted in a 1.5 lipid bilayer (POPC, POPE, POPI, POPS, Cholesterol, and SM) having 194 lipid components (94 in the upper leaflet and 100 in the lower leaflet) and modified with a TIP3P water model. The water molecules are seen in the cytoplasmic and the ERGIC luminal side. (G) A magnified side view of the model water molecules in the pore structure of the continuous channel state 1 of the E put TM inserted in the ERGIC membrane mimic. (H) Top view of the model water molecules in the pore structure of the E put TM inserted in the continuous channel state 1 within the ERGIC membrane mimic. (I) Snapshots of the morphed protein models between open and closed state intermediate of the E put at a 10-frame interval (from frame 1–61). (J) Cross-sectional pore size profile of the membrane-inserted E put TM along the Z-axis in the closed discontinuous state (black) and continuous channel state 1 (red). (K) Cross-sectional pore area profile of the membrane-inserted E put TM along the Z-axis in the closed discontinuous state (close in black) and continuous channel state 1 (open in red).