| Literature DB >> 21597992 |
Sergey A Samsonov1, Joan Teyra, M Teresa Pisabarro.
Abstract
Glycosaminoglycans (GAGs) are anionic polysaccharides, which participate in key processes in the extracellular matrix by interactions with protein targets. Due to their charged nature, accurate consideration of electrostatic and water-mediated interactions is indispensable for understanding GAGs binding properties. However, solvent is often overlooked in molecular recognition studies. Here we analyze the abundance of solvent in GAG-protein interfaces and investigate the challenges of adding explicit solvent in GAG-protein docking experiments. We observe PDB GAG-protein interfaces being significantly more hydrated than protein-protein interfaces. Furthermore, by applying molecular dynamics approaches we estimate that about half of GAG-protein interactions are water-mediated. With a dataset of eleven GAG-protein complexes we analyze how solvent inclusion affects Autodock 3, eHiTs, MOE and FlexX docking. We develop an approach to de novo place explicit solvent into the binding site prior to docking, which uses the GRID program to predict positions of waters and to locate possible areas of solvent displacement upon ligand binding. To investigate how solvent placement affects docking performance, we compare these results with those obtained by taking into account information about the solvent position in the crystal structure. In general, we observe that inclusion of solvent improves the results obtained with these methods. Our data show that Autodock 3 performs best, though it experiences difficulties to quantitatively reproduce experimental data on specificity of heparin/heparan sulfate disaccharides binding to IL-8. Our work highlights the current challenges of introducing solvent in protein-GAGs recognition studies, which is crucial for exploiting the full potential of these molecules for rational engineering.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21597992 PMCID: PMC3107433 DOI: 10.1007/s10822-011-9433-1
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
GAG-protein complex structures used for the reference docking runs
| PDB ID | Res. (Å) | Description | GAG length | Crystal waters | GRID waters | Overlapc |
|---|---|---|---|---|---|---|
| 1DBO | 1.70 | Chondroitinase B + CSa | Dimer | 8 | 2 | 1 |
| 1OJN | 1.60 | Hyaluronate lyase + CSa | Dimer | 24 | 7 | 8 |
| 1RWH | 1.25 | Chondroitin lyase AC + CSa | Tetramer | 30 | 4 | 16 |
| 1G5N | 1.90 | Annexin V + HEb | Tetramer | 15 | 1 | 6 |
| 1T8U | 1.95 | 3-O-Sulfotransferase3 + HEb | Tetramer | 15 | 2 | 6 |
| 3E7J | 2.10 | Heparinase II + HEb | Tetramer | 17 | 2 | 5 |
| 2HYU | 1.42 | Annexin 2A + HEb | Tetramer | 35 | 1 | 9 |
| 2BRS | 2.20 | EMBP + HEb | Dimer | 3 | 2 | 0 |
| 1BFB | 1.90 | FGF2 + HEb | Tetramer | 0 | 5 | 0 |
| 3IN9_1 | 2.00 | Heparin lyase 1 + HEb | Dimer | 7 | 4 | 3 |
| 3IN9_2 | 2.00 | Heparin lyase 1 + HEb | Dimer | 7 | 5 | 3 |
aChondroitin sulfate
bHeparin/heparan sulfate
cThis number shows how many GRID-generated water molecules were discarded because of the overlap with crystal water molecules
Fig. 1Dependence of water molecules abundance in GAG-protein interfaces on crystal structure resolution
Hydration of GAG-protein, other saccharide-protein and protein–protein interfaces
| Interfaces dataset | Number of interfaces | Water molecules/interface area (1/1000 Å2) |
|---|---|---|
| GAG-protein | 57 | 10.8 |
| Saccharide–protein (not GAGs) | 1,910 | 9.5 |
| Protein–protein [ | 176 | 3 |
Fig. 2Time fractions of interactions per protein (dark grey) and per GAGs (light grey) residues for CD44—HA (a); Cathepsin K—CS (b) complexes
MM-PBSA free energy decomposition for CD44—HA and Cathepsin K—CS complexes
| Complex/component | Free energy (kcal/mol) | |||||||
|---|---|---|---|---|---|---|---|---|
| ELE | VDW | GAS | PBSUR | PB | PBSOL | PBELE | PBTOT | |
| CD44—HA | 34.2 | −35.7 | −1.5 | −3.1 | −14.1 | −17.3 | 20.0 | −18.8 |
| Cathepsin K—CS | −1399.2 | −35.0 | 0.0 | −5.1 | 1398.9 | 1393.7 | −0.4 | −40.5 |
Energy components: ELE electrostatic, VDW van der Waals, GAS full energy in gas phase (ELE + VDW), PBSUR hydrophobic contribution to solvation, PB reaction field calculated by PB, PBSOL full solvation (PBSUR + PB), PBELE sum of electrostatic energy in vacuo and reaction field energy (PB + ELE), PBTOT total energy (PBSOL + GAS)
Fig. 3Comparison of the reference docking experiments of Autodock 3, eHiTs, MOE docking and FlexX for 11 complexes: a RMSD of top scoring pose; b Lowest RMSD within 50 top poses; c Rank of the pose with the lowest RMSD in 50 top poses; d Number of correct poses in 50 top poses; e Number of correct poses in 10 top poses. ‘+Wat’ relates to the runs with explicit water molecules. ‘+ Wat3’ relates to the type 3 of FlexX water
Autodock 3 performance with de novo solvent placement
| PDB ID | W− top posea RMSD (Å) | W+ top poseb RMSD (Å) | W− best pose ranka (RMSD (Å)) | W+ best pose rankb (RMSD (Å)) | W− correct posesa in top 50 | W+ correct posesb in top 50 | W− correct posesa in top 10 | W+ correct posesb in top 10 |
|---|---|---|---|---|---|---|---|---|
| 1DBO | 1.93 | 1.42 | 42 (1.27) | 11 (1.06) | 15 | 27 | 0 | 6 |
| 1OJN | 2.90 | 1.37 | 8 (1.11) | 2 (1.13) | 6 | 9 | 4 | 6 |
| 1RWH | 1.99 | 1.13 | 7 (0.97) | 2 (0.87) | 8 | 13 | 7 | 9 |
| 1G5N | 4.41 | 4.28 | 29 (1.48) | 27 (1.82) | 5 | 5 | 1 | 1 |
| 1T8U | 2.10 | 3.57 | 2 (1.84) | 3 (2.27) | 4 | 2 | 3 | 1 |
| 3E7J | 1.79 | 1.20 | 1 (1.79) | 1 (1.20) | 5 | 3 | 2 | 3 |
| 2HYU | 2.26 | 3.60 | 16 (1.91) | 4 (2.29) | 8 | 7 | 2 | 3 |
| 2BRS | 2.77 | 3.01 | 34 (1.13) | 15 (1.45) | 8 | 16 | 0 | 8 |
| 1BFB | 3.35 | 3.58 | 14 (1.88) | 6 (2.31) | 5 | 3 | 4 | 1 |
| 3IN9_1 | 3.54 | 2.75 | 32 (0.87) | 34 (0.87) | 20 | 16 | 0 | 2 |
| 3IN9_2 | 1.18 | 1.22 | 22 (0.99) | 13 (0.85) | 32 | 43 | 9 | 10 |
| Mean | 2.57 ± 0.93 | 2.47 ± 1.21 | 19 (1.39 ± 0.41) | 11 (1.47 ± 0.60) | 10.5 | 13.1 | 2.9 | 4.5 |
aDocking runs without explicit solvent
bDocking runs with explicit solvent
eHiTs performance with de novo solvent placement
| PDB ID | W− top posea RMSD (Å) | W+ top poseb RMSD (Å) | W− best pose ranka (RMSD (Å)) | W+ best pose rankb (RMSD (Å)) | W− correct posesa in top 50 | W+ correct posesb in top 50 | W− correct posesa in top 10 | W+ correct posesb in top 10 |
|---|---|---|---|---|---|---|---|---|
| 1DBO | 3.14 | 4.58 | 13 (1.67) | 4 (1.46) | 3 | 4 | 2 | 2 |
| 1OJN | 5.79 | 6.83 | 41 (1.32) | 46 (1.51) | 4 | 8 | 3 | 5 |
| 1RWH | 9.35 | 1.99 | 19 (2.56) | 42 (1.68) | 2 | 4 | 0 | 1 |
| 1G5N | 10.97 | 5.71 | 38 (5.68) | 43 (4.53) | 0 | 0 | 0 | 0 |
| 1T8U | 11.25 | 3.27 | 50 (4.91) | 37 (2.92) | 0 | 6 | 0 | 2 |
| 3E7J | 11.47 | 1.65 | 18 (2.02) | 8 (0.89) | 1 | 9 | 0 | 8 |
| 2HYU | 11.00 | 4.99 | 41 (6.62) | 49 (3.31) | 0 | 3 | 0 | 1 |
| 2BRS | 7.74 | 5.42 | 4 (2.07) | 28 (2.05) | 5 | 4 | 3 | 1 |
| 1BFB | 11.53 | 7.15 | 50 (5.57) | 30 (5.55) | 0 | 0 | 0 | 0 |
| 3IN9_1 | 6.43 | 5.57 | 44 (4.37) | 48 (4.22) | 0 | 0 | 0 | 0 |
| 3IN9_2 | 5.47 | 7.59 | 32 (2.98) | 27 (4.15) | 1 | 0 | 0 | 0 |
| Mean | 8.55 ± 2.98 | 4.97 ± 1.98 | 32 (3.61 ± 1.87) | 33 (2.93 ± 1.53) | 1.5 | 3.5 | 0.7 | 1.8 |
aDocking runs without explicit solvent
bDocking runs with explicit solvent
De novo water molecules placement
| PDB ID |
| Excluded watersb | Overlap with ligandc | RMSD bck/all (Å)d |
|---|---|---|---|---|
| 1DBO | 2 | 0 | 0 | 0.027/0.224 |
| 1OJN | 11 | 0 | 0 | 0.026/0.238 |
| 1RWH | 15 | 2 | 1 (1.5 Å) | 0.024/0.129 |
| 1G5N | 8 | 3 | 1 (1.6 Å) | 0.187/0.261 |
| 1T8U | 8 | 2 | 0 | 0.034/0.192 |
| 3E7J | 8 | 2 | 0 | 0.049/0.169 |
| 2HYU | 9 | 3 | 1 (1.1 Å) | 0.166/0.306 |
| 2BRS | 7 | 1 | 1 (0.9 Å) | 0.093/0.337 |
| 1BFB | 13 | 4 | 2 (0.7, 1.5 Å) | 0.113/0.317 |
| 3IN9_1 | 7 | 2 | 0 | 0.041/0.252 |
| 3IN9_2 | 9 | 1 | 0 | 0.071/0.237 |
aTotal number of GRID-generated water molecules used in de novo docking experiments
bNumber of water molecules excluded by use of Csp3 atomic probe
cNumber of water molecules overlapping with ligand in crystal structure in the docking experiments (distance to ligand)
dRMSD is calculated for backbone and all atoms for receptor binding site residues after minimization in comparison to the initial crystal structure of each complex
Fig. 4Results for the docking of Idu(2S)-GlcNAc(6S) to IL-8 with Autodock 3: 50 top docking solutions. The residues of heparin binding site are labeled and shown in licorice, the pyranose rings of disaccharides are in lines: red—Ido(2S) and green—GlcNAc(6S)
Clustering 150 docking solutions for disaccharides in IL-8 heparin binding site (RMSD = 2 Å)
| Disaccharides | CLUSTERS_WATa (−) | CLUSTERS_WATb (+) |
|---|---|---|
| Idu(2S)-GlcNAc(6S) | 13 (7), 27 (6), 43 (5) | 28 (10), 18 (8), 11 (5) |
| Idu-GlcNS(6S) | 10 (14), 12 (10), 37 (10) | 13 (30), 10 (17), 8 (12) |
| Idu-GlcNS | 64 (9), 42 (7), 1 (6) | 38 (10), 9 (8), 27 (7) |
| Idu-GlcNAc | 1 (8), 3 (7), 71 (6) | 2 (7), 19 (7), 27 (7) |
| Idu(2S)-GlcNS | 2 (7), 16 (5), 51 (5) | 14 (8), 38 (6), 2 (5) |
| Idu-GlcN(6S) | 10 (22), 17 (9), 34 (6) | 10 (24), 32 (13), 7 (9) |
aRanks of three biggest clusters (number of solutions in cluster) when docking with no explicit water molecules
bRanks of three biggest clusters (number of solutions in cluster) when docking with explicit water molecules