| Literature DB >> 34063530 |
Giulia Paiardi1,2, Maria Milanesi1, Rebecca C Wade2,3,4, Pasqualina D'Ursi5, Marco Rusnati1.
Abstract
Glycosaminoglycans (GAGs) are linear polysaccharides. In proteoglycans (PGs), they are attached to a core protein. GAGs and PGs can be found as free molecules, associated with the extracellular matrix or expressed on the cell membrane. They play a role in the regulation of a wide array of physiological and pathological processes by binding to different proteins, thus modulating their structure and function, and their concentration and availability in the microenvironment. Unfortunately, the enormous structural diversity of GAGs/PGs has hampered the development of dedicated analytical technologies and experimental models. Similarly, computational approaches (in particular, molecular modeling, docking and dynamics simulations) have not been fully exploited in glycobiology, despite their potential to demystify the complexity of GAGs/PGs at a structural and functional level. Here, we review the state-of-the art of computational approaches to studying GAGs/PGs with the aim of pointing out the "bitter" and "sweet" aspects of this field of research. Furthermore, we attempt to bridge the gap between bioinformatics and glycobiology, which have so far been kept apart by conceptual and technical differences. For this purpose, we provide computational scientists and glycobiologists with the fundamentals of these two fields of research, with the aim of creating opportunities for their combined exploitation, and thereby contributing to a substantial improvement in scientific knowledge.Entities:
Keywords: glycosaminoglycans; heparan sulfate; heparin; molecular docking; molecular dynamic simulations; molecular modeling
Year: 2021 PMID: 34063530 PMCID: PMC8156566 DOI: 10.3390/biom11050739
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1(A) Chemical structures of the disaccharide units composing the five main classes of GAGs. In red, “R” indicates potential points of sulfation. (B) Schematic representation of the distribution of GAGs/PGs inside the cell, on its surface, in the ECM or in body fluids.
List of the features that contribute to the high structural heterogeneity of GAGs/PGs.
| GAGs/PGs | Combinations of hexuronic acids and amino sugars |
| Length of the saccharide chain | |
| Positions of sulfated groups (sulfatase activity) | |
| Degree of sulfation (sulfatase activity) | |
| Distinctive expression profiles in different cell types | |
| Distinctive expression profiles in different tissues | |
| Changes of expression profile during cell differentiation | |
| Changes of expression profile from physiology to pathology | |
| Localization in intra- or extracellular compartments | |
| Action of different glycosidases on the GAG chain | |
| PGs | Different core proteins |
| Variable number of GAG chains attached to the core protein | |
| Type of association of the core protein to the cell membrane | |
| Action of different proteases on the core protein |
Figure 2Consequences of GAG/PG interactions with proteins. Upon their binding, GAGs/PGs exert different effects on proteins that impact various cellular functions.
Figure 3Heparin (PDBid 1HPN) [49] and HA (PDBid 1HYA) [50] crystal structures depicted in sphere and stick representations and colored by elements (carbon, oxygen, nitrogen and sulphur atoms in grey, red, blue and yellow, respectively).
Web-based tools for computational studies of GAGs. Accession date for all the links reported in the table: 12 April 2021. FF: force field.
| Name | Description (Website) | Ref. |
|---|---|---|
| Databases | ||
| PDB | Bio-macromolecular structures. ( | [ |
| PubChem | Open chemical database containing the structures of small and large molecules including GAGs with their respective annotations (chemical structures, identifiers, physical properties, biological activities, patents, safety and toxicity data). ( | [ |
| KEGG | Collection of experimental GAG structures taken from CarbBank or from recent publications and present in KEGG pathways. ( | [ |
| Zinc | Curated collection of commercially available chemical compounds in ready-to-dock, 3D formats. ( | [ |
| DrugBank | Detailed drug properties (chemical, pharmacological and pharmaceutical features) and target information (sequences, structures and pathway). ( | [ |
| EMBL-EBI | Collection of various tools and data from different sources (including those listed in this table) ( | [ |
| GAG-database | Comprehensive resource for 3D-structures of GAGs, oligosaccharides and their complexes with proteins (140 curated entries). ( | [ |
| monosaccharides database | Comprehensive resource for monosaccharides. (776 entries). ( | [ |
| Tools to Build a GAG | ||
| CarbBuilder | Builds GAG 3D-structures with CHARMM FF from pre-calculated glycosidic linkage torsions. ( | [ |
| Chemsketch | Converts 2D drawings into 3D structures using a modified molecular mechanics approach. ( | [ |
| GLYCAM-Web GAG Builder | Models GAG 3D-structures with GLYCAM06 FF using the AMBER MD package in an automated system. ( | [ |
| CHARM-GUI Glycan Modeller | In silico N-/O-glycosylation of proteins; modeling of GAG-only systems. ( | [ |
| Amber-tleap | Models GAG 3D-structures with the GLYCAM06 FF using the AMBER MD package. ( | [ |
| MOE | Models GAG 3D-structures with MMFF94, AMBER, CHARMM FF and semi-empirical energy functions (PM3, AM1, MNDO). Conformational analysis using either a systematic or a stochastic search using random rotation of bonds. ( | [ |
| PRODRG | Models GAG 3D-structures with the ffgmx GROMACS FF. ( | [ |
| Macromodel | Models GAG 3D-structures with MM2, MM3, AMBER, AMBER94, MMFF, MMFFs, OPLS, OPLS_2005 and OPLS3 FF. ( | [ |
| Software for Molecular Docking | ||
| Autodock | Stochastic local search and Lamarck genetic algorithm and empirical scoring function. ( | [ |
| Autodock-Vina | Gradient-based local search, iterated local search algorithm and empirical scoring function. ( | [ |
| Glide | Search algorithms include the modes of extra precision, standard precision and a high-throughput virtual filter. ( | [ |
| Dock | Step-by-step geometric matching strategy; AMBER FF, empirical scoring function. ( | [ |
| Gold | Genetic algorithm. ( | [ |
| HADDOCK | Encodes information from identified or predicted interfaces in ambiguous interaction restraints. ( | [ |
| ClusPro | Fast Fourier Transform-based algorithm and molecular mechanics energy function for scoring. ( | [ |
| VinaCarb | Carbohydrate intrinsic-energy functions implemented in AutoDock Vina software. ( | [ |
| GlycoTorc-Vina | Based on the VinaCarb program; uses QM-derived scoring functions to improve GAGs docking. ( | [ |
| GAG-dock | Modification of DarwinDock method for sulfated GAGs. | [ |
| FFs for GAGs | ||
| GLYCAM_06 | Set of parameters and quantum mechanical data for a collection of minimal molecular fragments and related small molecules for GAGs simulation. ( | [ |
| CHARMM FF for carbohydrates | Hierarchical parametrization of model compounds containing the key atoms in GAGs. ( | [ |
| GROMOS 53A6glyc | Refined potential parameters for the determination of hexopyranose ring conformations by fitting to the corresponding quantum-mechanical profiles. ( | [ |
Figure 4Experimental and computational methods used to generate models of GAGs alone or in complex with their binders. Each bar reports the percentage of papers in which the indicated experimental (black bars) or computational (grey bars) methods were employed. For the software grouped under “others”, see Table 2. db: database. For further details on the bibliographic research strategy, see Appendix A.
Features that makes computational docking of a GAG to a protein a challenging task.
| GAGs | Long length |
| Structural and chemical heterogeneity | |
| High flexibility | |
| High charge density | |
| Large number of torsional angles between glycosidic bonds | |
| Difficulty to define the impact of solvation/desolvation on GAG structure | |
| Proteins | High charge density of GAG-binding sites |
| GAG/Protein Complexes | Absence of well-defined GAG-binding pockets on bound proteins |
| Electrostatic nature of GAG/protein interactions | |
| Weak surface complementarity of GAG/protein interactions | |
| Indispensability of solvent for their interactions | |
| Impact of solvation/desolvation on GAG/protein complexes |
Figure 5Docking software programs used to predict models of GAG complexes with their targets. Each bar reports the percentage of the published papers in which the indicated software programs were used. For the software programs grouped under “others”, see Table 2. For further details on the bibliographic research strategy, see Appendix A.
Figure 6FFs used for MD simulations of GAGs alone or in complexes with targets. Each bar reports the percentage of papers in which the indicated FFs have been employed. For the FFs grouped under “others”, see Table 2. For further details on the bibliographic research strategy, see Appendix A.
Figure 7(A) Crystal structure of a 12-mers heparin (PDBid 1HPN) shown in stick representation colored by elements with green carbons. (B) Structure of a 31-mers heparin obtained with the incremental docking method [44] and docked to the spike protein of SARS-CoV2 virus shown as electrostatic potential surface to highlight the basic path to which heparin binds. (C) Superimposition of 20 snapshots from 1 µs of MD simulation of the 31-mers heparin/spike complex showing the cloud of conformations adopted by heparin on the protein surface (adapted from Paiardi et al. https://arxiv.org/abs/2103.07722, accessed on 12 April 2021).
Figure 8Flowchart schematizing the series of queries in an application of computational approaches aimed at a comprehensive characterization of a GAG or a GAG/target complex.
Figure 9Number of papers containing computational studies of GAGs/PGs published since 1985. For further details on the bibliographic research strategy, see Appendix A.
Figure 10Distribution of computational studies with respect to GAG length. For further details on the bibliographic research strategy, see Appendix A.
Figure 11Distribution of computational structural studies of GAG alone and of GAGs complexed with the indicated ligand. For further details on the bibliographic research strategy, see Appendix A.
Figure 12Distribution of computational studies among the different GAGs. The bar “others” includes other natural GAGs and synthetic GAG-mimicking compounds. For further details on the bibliographic research strategy, see Appendix A.
Figure 13Virtuous circle between computational and experimental studies. As in the classical Yin and Yang principle, the two fields of research complement each other with the results from one pole helping the interpretation of the other. A correct balance between the two poles is needed in order to comprehend GAG/protein interactions and their biological consequences.