| Literature DB >> 34928574 |
Nisha Grandhi Jayaprakash1, Avadhesha Surolia1.
Abstract
The SARS-CoV-2 virus has been known to gain entry into the host cell through the spike protein that binds to the host ACE2 cell surface protein. However, the role of the putative sugar-binding sites in the spike protein has remained unclear. We provide a comprehensive in silico outlook into the infection initiation wherein the virus first recognizes the sialosides on the cell via its S1A domain of the spike protein as it surfs over the cell surface. This facilitates the subsequent interaction with the cellular glycosaminoglycans through the S1B domain of the spike protein as it binds to the ACE2 receptor. The unique coadaptation to recognize both the host protein and the cell-surface carbohydrate receptors provides an additional coupling mechanism for efficient viral attachment and infection.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34928574 PMCID: PMC8713392 DOI: 10.1021/acschembio.1c00691
Source DB: PubMed Journal: ACS Chem Biol ISSN: 1554-8929 Impact factor: 5.100
Figure 1Sequence and structural characterization of the two domains of the S1 subunit of the spike protein. (A) Phylogenetic analysis of the five beta coronaviruses indicating their domain architecture and ligand preferences to infer the closest homologues of the SARS-CoV-2 spike protein. The neighbor-joining method was used to construct the tree using the MEGA software.[24] (B) Electrostatic surface rendering of the S1A and S1B domains, S1 protomer, and S1 trimer with the pockets lined with positively charged residues highlighted in the dotted box. (C) Results from the MetaPocket[22] pocket prediction for the S1A domain and CASTp[23] pocket prediction for the S1B domain. Protein domains are shown in cartoon representation, with the identified pockets shown as spheres. (D) Modeling of the experimentally observed glycosylation (shown in stick representation) on the surface representation of the S1 subunit, SARS-CoV-2 spike protein (PDB ID: 6VSB), and the pockets, free from glycosylation highlighted in the dotted box; S1A binding site in pink and S1B binding site in blue.
Ligands Used in the Study
| sialosides | neu | Neu |
| aSia | ||
| bSia | ||
| NEU5GC | ||
| ACE9NEU | ||
| ACE7NEU | ||
| ACE4NEU | ||
| KDN | deaminated neuraminic acid | |
| 23SIA | ||
| 26SIA | ||
| 26SIL | ||
| 23SIL | ||
| S26GalNAc | ||
| diNANA | ||
| sLeX | aDNeu5Ac(2-3)bDGal(1-4)[aFuc(1-3)]GlcNAc | |
| gangliosides | GM1 | bDGalp(1–3)bDGalNAc[aNeu5Ac(2–3)]bDGalp(1–4)bDGlcp |
| GM1B | aNeu5Ac(2–3)bDGalp(1–3)bDGalNAc(1–4)bDGalp(1–4)bDGlcp | |
| GD1A | aNeu5Ac(2–3)bDGalp(1–3)bDGalNAc(1–4)[aNeu5Ac(2–3)]bDGalp(1–4)bDGlcp | |
| GD1B | bDGalp(1–3)bDGalNAc(1–4)[aNeu5Ac(2–8)aNeu5Ac(2–3)]bDGalp(1–4)bDGlcp | |
| GT1B | aNeu5Ac(2–3)bDGalp(1–3)bDGalNAc(1–4)[aNeu5Ac(2–8)aNeu5Ac(2–3)]bDGalp(1–4)bDGlcp | |
| GT1C | bDGalp(1–3)bDGalNAc(1–4)[aNeu5Ac(2–8)aNeu5Ac(2–8)aNeu5Ac(2–3)]bDGalp(1–4)bDGlcp | |
| GQ1C | aNeu5Ac(2–8)aNeu5Ac(2–3)bDGalp(1–3)bDGalNAc(1–4)[aNeu5Ac(2–8)aNeu5Ac(2–3)]bDGalp(1–4)bDGlcp | |
| glycosaminoglycans | Hept | aLIdoA(1→4)aDGlc(1→4)aLIdoA(1→4)aDGlc |
| Hepsul | aDGlcNAc(1→4)bDGlcA | |
| Hep | aLIdoA(1→4)aDGlc | |
| blood group antigens | Aant-I | aLFuc(1→2)[aDGalNAc(1→3)]bDGal(1→3)bDGlcNAc(1→3)aDGal |
| Aant-II | aLFuc(1→2)[aDGalNAc(1→3)]bDGal(1→4)bDGlcNAc(1→3)aDGal | |
| Aant-III | aLFuc(1→2)[aDGalNAc(1→3)]bDGal(1→3)aDGlcNAc(1→3)aDGal | |
| Aant-IV | aLFuc(1→2)[aDGalNAc(1→3)]bDGal(1→3)bDGalNAc(1→3)aDGal | |
| Bant-I | aLFuc(1→2)[aDGal(1→3)]bDGal(1→3)bDGlcNAc(1→3)aDGal | |
| Bant-II | aLFuc(1→2)[aDGal(1→3)]bDGal(1→4)bDGlcNAc(1→3)aDGal | |
| Hant-I | aLFuc(1→2)bDGal(1→3)bDGlcNAc(1→3)aDGal | |
| Hant-II | aLFuc(1→2)aDGal(1→4)bDGlcNAc(1→3)aDGal |
Figure 2Docking and simulation studies of the S1A domain with the various saccharides. (A) Intermolecular docking energy from the molecular docking studies for all the ligands of the various complexes with the S1A domain. The set is subdivided into various ligand subtypes—sialosides (mono and oligo), gangliosides, and blood group antigens. (B) Cartoon representation of the four S1A domain complexes highlighting the receptor residues involved in hydrogen bonds and hydrophobic interactions (shown as sticks) with ligands in orange stick representation and hydrogen bonds shown in cyan (Figure S3 includes detailed Ligplot representations for all S1A complexes). (C) Average number of intermolecular hydrogen bonds as indicated by box plots for the S1A-ligand complexes subdivided according to the ligand subtypes. The best binders were seen to have a higher number of hydrogen bonds. (D) Binding energy calculation using MMPBSA analysis of the various complexes with the S1A domain. The energy values are in kJ/mol. Inset: Surface representation of the protein with the best binding ligand in stick representation.
MMPBSA Analysis for the S1A Domain Complexes
| (A)
monosialosides | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| kJ/mol | Neu | aSia | bSia | Ace4Neu | Ace7Neu | Ace9Neu | Neu5Gc | Galnac | KDN |
| Van der Waals energy | –49.695 | –56.582 | –39.306 | –74.145 | –29.078 | –63.582 | –42.115 | –18.43 | –61.191 |
| electrostatic energy | –580.69 | –527.16 | –453.632 | –635.628 | –352.292 | –339.75 | –467.63 | –26.81 | –475.81 |
| polar solvation energy | 224.756 | 192.861 | 151.388 | 264.236 | 89.182 | 80.422 | 130.168 | 5.32 | 144.579 |
| SASA energy | –9.165 | –9.045 | –6.289 | –11.094 | –5.39 | –8.743 | –7.194 | –2.83 | –8.82 |
| binding energy | –414.97 | –399.45 | –345.544 | –455.009 | –297.307 | –331.771 | –388.33 | –42.73 | –402.69 |
Figure 3Docking and simulation studies of the S1B domain with the various saccharides. (A) Intermolecular docking energy from the molecular docking studies for all the ligands of the their complexes with the S1B domain. The set is subdivided into various ligand subtypes—glycosaminoglycans, gangliosides, and blood group antigens. (B) Cartoon representation of the two S1B domain complexes highlighting the receptor residues involved in hydrogen bonds and hydrophobic interactions (shown as sticks) with ligands in orange stick representation and hydrogen bonds shown in cyan (Figure S5 includes detailed Ligplot representations for all S1B complexes). (C) Average number of intermolecular hydrogen bonds as indicated by box plots for the S1B-ligand complexes subdivided according to the ligand subtypes. The best binders were seen to have a higher number of hydrogen bonds. (D) Binding energy calculation using MMPBSA analysis of the various complexes with the S1B domain. The energy values are in kJ/mol. Inset: Surface representation of the protein with the best binding ligand in stick representation.
MMPBSA Analysis for the S1B Domain Complexes
| (A)
glycosaminoglycans | |||||
|---|---|---|---|---|---|
| kJ/mol | Hept | Hepsul | Hep | diNANA | aSia |
| Van der Waals energy | –69.207 | –58.186 | –55.397 | –51.85 | –36.537 |
| electrostatic energy | –5097.65 | –953.06 | –2501.3 | –1184.2 | –840.17 |
| polar solvation energy | 1164.89 | 362.28 | 533.443 | 340.528 | 307.95 |
| SASA energy | –14.162 | –9.636 | –9.413 | –8.219 | –8.007 |
| binding energy | –4018.12 | –657.31 | –2027.1 | –905.68 | –573.05 |
Figure 4Simulation studies of the S1 subunit with the various saccharides. (A) Average RMSD analysis of the various complexes of the S1 subunit and the protomer with the distribution plots indicating the range sampled. The five complexes are color-coded. (B) The radius of gyration plot for the five complexes of the S1 subunit and protomers for 50 ns of time with the distribution plots indicating the range sampled. (C) RMSF analysis for the Cα atoms of the S1 subunit for the five complexes in comparison to the free protein highlighting the reduction in fluctuation offered by the ligand interaction. The binding site residues are indicated in the boxes. (D) The total energy of the five complexes of the S1 protomer color-coded across the 50 ns simulation. (E) Average number of intermolecular hydrogen bonds as indicated by box plots for the S1 protomer for the five complexes subdivided according to the ligand subtypes. The best binders were seen to have a higher number of hydrogen bonds. (F) Decomposition of the energetic contribution of the binding site residues to ligand binding. The complexes are color-coded. The energy values are given in kJ/mol. The residue-wise energy is calculated using a free energy decomposition scheme using g_mmpbsa tool.
MMPBSA Analysis for the S1 Subunit Complexes
| binding energy | S1A domain | S1B domain |
|---|---|---|
| diNANA-Hept | –996.391 ± 43.584 kJ/mol | –1246.155 ± 52.947 kJ/mol |
| 23SIA-Hept | –522.267 ± 18.801 kJ/mol | –1258.782 ± 24.144 kJ/mol |
| 23SIA-Hep | –285.431 ± 35.673 kJ/mol | –1913.800 ± 27.995 kJ/mol |
| 23SIA-GT1C | –563.118 ± 5.963 kJ/mol | –570.308 ± 64.290 kJ/mol |
| 23SIA-GM1B | –377.654 ± 24.122 kJ/mol | –529.669 ± 17.235 kJ/mol |
Figure 5Simulation studies of the S1 trimer with sialosides in the S1A domain and a glycosaminoglycan in the S1B domain. (A) Average RMSD analysis of chain A of the protomer and holo and apo trimers in the 1-up and 2-up conformations are color-coded. The presence of the carbohydrate and the trimer presentation is seen to stabilize the systems. (B) RMSF analysis for the Cα atoms of the S1 subunit trimer for the apo and holo forms are color-coded. The values are represented as bfactor plots and mapped onto the cartoon representation. (C) The radius of gyration plot for the full apo and holo trimers of the S1 subunit in the 1-up and 2-up conformations color-coded accordingly. (D) Average number of intermolecular hydrogen bonds as indicated by box plots for the S1 trimer with its respective carbohydrate ligands (color-coded). (E) Average number of interchain hydrogen bonds seen as a function of time for the 1-up and 2-up trimer conformations subdivided chain-wise. The chains A with BC and C with AB are seen to have similar interaction patterns.
List of the SARS-CoV-2 Variants of Concern Used in the Study
| mutant name | country | mutations |
|---|---|---|
| wild type | China | 0 |
| Variants of Concern | ||
| B.1.1.7 (ALPHA) | UK | Del 69–70, Del 144, N501Y, A570D, D614G, P681H, T716I, S982A, D1118H |
| B.1.351 (BETA) | South Africa | (L18F), D80A, D215G, (Del 242–244), K417N, E484K, N501Y, D614G, A701V |
| P.1(GAMMA) | Brazil | L18F, T20N, P26S, D138Y, R190S, K417N/T, E484K, N501Y, D614G, H655Y, T1027I, V1176F |
| B.1.617.2 (DELTA) | India | L452R, T478K, D614G, P681R |
| B.1.617.2 (DELTA plus AY.1) | L452R, K417N, T478K, D614G, P681R | |
| B.1.617.2 (DELTA plus AY.2) | A222V, L452R, K417N, T478K, D614G, P681R | |
Figure 6Sequence analysis of the spike protein variants of SARS-CoV-2. (A) Modeling of the experimentally observed glycosylation (shown in stick representation) in a cartoon representation of the S1 subunit trimer; SARS-CoV-2 spike protein (PDB ID: 6VSB) trimer with chains color-coded as chain A in green, chain B in pink, and chain C in cyan. The mutations across the chains are indicated as spheres. (B) Shannon entropy plot obtained using the protein variability server for the aligned variants of SARS-CoV-2. The bars indicate the frequency of variation at different residue positions. Lower entropic frequency implies higher residue conservation. (C) Residue-wise scatter plot representing the frequency of mutations observed at various residues of the variants of the spike protein obtained using the web application CoV-GLUE. The variations are indicated as replacements, deletions, or insertions. The binding site residues are indicated in the green dotted box (http://cov-glue.cvr.gla.ac.uk). (D) Intermolecular docking scores for the variants of concern with the respective carbohydrate ligands docked against the S1A and S1B domains of the S1 spike protein subunit clustered according to the variants.