| Literature DB >> 25996209 |
Laurent Hoffer1,2, Camelia Chira3, Gilles Marcou4, Alexandre Varnek5, Dragos Horvath6.
Abstract
This paper describes the development of the unified conformational sampling and docking tool called Sampler for Multiple Protein-Ligand Entities (S4MPLE). The main novelty in S4MPLE is the unified dealing with intra- and intermolecular degrees of freedom (DoF). While classically programs are either designed for folding or docking, S4MPLE transcends this artificial specialization. It supports folding, docking of a flexible ligand into a flexible site and simultaneous docking of several ligands. The trick behind it is the formal assimilation of inter-molecular to intra-molecular DoF associated to putative inter-molecular contact axes. This is implemented within the genetic operators powering a Lamarckian Genetic Algorithm (GA). Further novelty includes differentiable interaction fingerprints to control population diversity, and fitting a simple continuum solvent model and favorable contact bonus terms to the AMBER/GAFF force field. Novel applications-docking of fragment-like compounds, simultaneous docking of multiple ligands, including free crystallographic waters-were published elsewhere. This paper discusses: (a) methodology, (b) set-up of the force field energy functions and (c) their validation in classical redocking tests. More than 80% success in redocking was achieved (RMSD of top-ranked pose < 2.0 Å).Entities:
Keywords: conformational sampling; docking; force field fitting; genetic algorithms; interaction fingerprints
Mesh:
Substances:
Year: 2015 PMID: 25996209 PMCID: PMC6272476 DOI: 10.3390/molecules20058997
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Principle of fragment recombination.
Description of the AMBER/GAFF carbon types and their classes.
| Category | Force Field | Atomic Type | Description (from AMBER/GAFF Parameters File) |
|---|---|---|---|
| Polarized | AMBER | C | sp2 C carbonyl group |
| GAFF | c | sp2 C carbonyl group | |
| Aromatic | AMBER | C* | sp2 arom. 5 memb.ring w/1 subst. (TRP) |
| AMBER | CA | sp2 C pure aromatic (benzene) | |
| AMBER | CB | sp2 aromatic C, 5&6 membered ring junction | |
| AMBER | CC | sp2 aromatic C, 5 memb. ring HIS | |
| AMBER | CN | sp2 C aromatic 5&6 memb.ring junct.(TRP) | |
| AMBER | CR | sp2 arom as CQ but in HIS | |
| AMBER | CV | sp2 arom. 5 memb.ring w/1 N and 1 H (HIS) | |
| AMBER | CW | sp2 arom. 5 memb.ring w/1 N-H and 1 H (HIS) | |
| GAFF | ca | sp2 C in pure aromatic systems | |
| GAFF | cc | sp2 carbons in non-pure aromatic systems | |
| GAFF | cd | sp2 carbons in non-pure aromatic systems | |
| GAFF | cp | Head sp2 carbons connecting rings in bi-phenyls | |
| GAFF | cq | Head sp2 carbons connecting rings in bi-phenyls | |
| Aliphatic | AMBER | CT | sp3 aliphatic C |
| GAFF | c1 | sp C | |
| GAFF | c2 | sp2 C | |
| GAFF | c3 | sp3 C | |
| GAFF | ce | Inner sp2 carbons in conjugated systems | |
| GAFF | cf | Inner sp2 carbons in conjugated systems | |
| GAFF | cg | Inner sp carbons in conjugated systems | |
| GAFF | ch | Inner sp carbons in conjugated systems | |
| GAFF | cu | sp2 carbons in triangle systems | |
| GAFF | cv | sp2 carbons in square systems | |
| GAFF | cx | sp3 carbons in triangle systems | |
| GAFF | cy | sp3 carbons in square systems |
Figure 2Principle of Interaction Fingerprints: each element represents a unique putatively favorable interaction, which may be embodied by different topologically equivalent atoms. For example, the E26-R54 ionic contact above is ‘on’ (associated PIF element set to 1.0) as soon as either of equivalent HZ/R54 are within contact distance of either of OE/E26 (* meaning ‘either atom’). For all distances between equivalent atoms (dotted lines of a same color) the corresponding contact strength are calculated and eventually averaged according to Equation (7). This biased average (favoring strong contact contributions) is reported in the associated PIF cell (color matching distance lines).
Figure 3Calculation of the chirality index used to preserve the configuration of asymmetric carbons.
Considered FF parameter schemes.
| Parameter | Core FF | Preliminary FF | Fit FF |
|---|---|---|---|
| epsilon | 2 | 4 | 4 |
| desolv_factor | 0.0 | 0.1 | 0.1 |
| minq_to_desolv | 0 | 0.125 | 0.125 |
| hbond_bonus | 0 | 2 | 2 |
| repulsive_factor | 1.00 | 0.75 | 0.75 |
| vicinal_weight | 0.5 | 0.033 | 0.033 |
| desolv_scale_ion | – | 1.0 | 0.1 |
| desolv_scale_hb | – | 1.0 | 0.1 |
| Kpolarized | 0.0 | 0.1 | 0.01 |
| Karom | 0.0 | 0.1 | 0.15 |
| Kaliph | 0.0 | 0.1 | 0.15 |
Figure 4Strategy used in the force field parameter tuning protocol. ‘Decoy poses’ are non-native-like, to be distinguished from the native-like by a proper choice of FF parameters. These three parameters were subjected to a systematic scan. Within the scanned parameter space volume, the representative subset of possible FF configurations includes: Core FF, Preliminary FF setups, setups at the extremes (corners) and at the center of the scanned parameter ‘cube’. This representative subset is used to generate docking poses (one 400-generation run per FF configuration, per complex).
Figure 5Ability to sample X-ray binding modes (irrespective of their ranking in terms of energy), in function of the number of generations at different RMSD thresholds (Astex Diverse Set).
Triplets of weights which lead to the best rescoring results (the selected triplet is shown in bold).
| Number of Successfully Predicted Complexes | Weights (K) | ||
|---|---|---|---|
| Kpolarized | Karom | Kaliph | |
| 154/191 | 0.000 | 0.075 | 0.150 |
| 0.000 | 0.100 | 0.150 | |
| 0.000 | 0.150 | 0.150 | |
| 0.000 | 0.150 | 0.200 | |
| 0.000 | 0.200 | 0.200 | |
| 0.010 | 0.075 | 0.150 | |
| 0.010 | 0.100 | 0.150 | |
| 0.010 | 0.200 | 0.200 | |
| 0.025 | 0.150 | 0.150 | |
| 0.200 | 0.150 | 0.150 | |
Docking performance of several tools on the Astex Diverse Set (* Result from herein described benchmarking calculations, ** Statistics from closest protocols with respect to those presented here). With S4MPLE, Saved poses include the top 30 non-redundant (at minfpdiff = 0.01) most stable geometries.
| Docking Tools (Scoring) | Success Rate (%) | Success Rate (%) |
|---|---|---|
| S4MPLE (Core FF) | 76 | 93 |
| S4MPLE (Fit FF) | 85 | 96 |
| FlexX * | 71 | 91 |
| GOLD (Goldscore) ** [ | 75–81 | Unavailable |
| Plants (ChemPLP) [ | 87 | 97 |
| Plants (PLP) [ | 84 | |
| LGA (LargeAll) [ | 63 | Unavailable |
| RosettaLig [ | 58 | 92 |
| SKATE [ | 87 | 98 |
Figure 6Docking performances on the Astex Diverse Set for both Core FF and Fit FF.
Figure 7X-ray and top-ranked poses obtained using Fit FF and Core FF from the 9 PDB complexes discussed in the chapter §0 (Core vs. Fit). Native poses are shown in green, whereas Fit FF poses and Core FF are displayed in blue and purple respectively. Hydrogen-bonds are shown as black dotted line and ions are represented as spheres.
RMSD of top-ranked poses for Core FF and Fit FF from complexes discussed in the dedicated chapter (Core vs. Fit).
| PDB | Core FF | Fit FF |
|---|---|---|
| 1N2J | 3.85 | 1.42 |
| 1NAV | 6.26 | 0.35 |
| 1OQ5 | 0.92 | 2.83 |
| 1R58 | 3.03 | 0.74 |
| 1UML | 7.59 | 0.66 |
| 1V0P | 7.60 | 0.41 |
| 1W1P | 2.94 | 0.39 |
| 1XOQ | 4.04 | 0.30 |
| 1YVF | 6.11 | 0.89 |