| Literature DB >> 27677751 |
Stefano Bosisio1, Antonia S J S Mey1, Julien Michel2.
Abstract
In the context of the SAMPL5 challenge water-cyclohexane distribution coefficients for 53 drug-like molecules were predicted. Four different models based on molecular dynamics free energy calculations were tested. All models initially assumed only one chemical state present in aqueous or organic phases. Model A is based on results from an alchemical annihilation scheme; model B adds a long range correction for the Lennard Jones potentials to model A; model C adds charging free energy corrections; model D applies the charging correction from model C to ionizable species only. Model A and B perform better in terms of mean-unsigned error ([Formula: see text] D units - 95 % confidence interval) and determination coefficient [Formula: see text], while charging corrections lead to poorer results with model D ([Formula: see text] and [Formula: see text]). Because overall errors were large, a retrospective analysis that allowed co-existence of ionisable and neutral species of a molecule in aqueous phase was investigated. This considerably reduced systematic errors ([Formula: see text] and [Formula: see text]). Overall accurate [Formula: see text] predictions for drug-like molecules that may adopt multiple tautomers and charge states proved difficult, indicating a need for methodological advances to enable satisfactory treatment by explicit-solvent molecular simulations.Entities:
Keywords: Distribution coefficient; SAMPL5
Mesh:
Substances:
Year: 2016 PMID: 27677751 PMCID: PMC5206288 DOI: 10.1007/s10822-016-9969-1
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
Fig. 1A Diagram of the dominant species approximation, B Diagram of the two-species approximation. Symbols are defined in the main text
Fig. 2Thermodynamic cycle for calculation. First the atoms’ partial charges are turned off retrieving , and in water, vacuum and cyclohexane phase respectively. This step is referred to as the ’discharging step’ in the main text. Then, van der Waals terms are switched off and , and are calculated in each phase. This step is referred to as the ’vanishing step’ in the main text. The diagram assumes no change in protonation state between solvated and vacuum phases
Fig. 3Computed solvation free energy in water (blue circles) and in cyclohexane (red triangles) for neutral compounds selected from the Minnesota Solvation Database [32] according to models A (A), B (B), and C (C). Model D is not shown since only neutral species are plotted, meaning that model D is equivalent to model B. The grey dashed line assumes a perfect correlation and the yellow shaded interval corresponds to an error of 1
Computed solvation free energy for Minnesota dataset [32]. is the absolute free energy of hydration and the absolute free energy of solvation in cyclohexane, both expressed in . A, B, C and D refer to the model described in section 2. MUE and denotes the mean unsigned error and the determination coefficient for the whole dataset. MUE neutral and shows the mean unsigned error and determination coefficient for the neutral species only. Model D for solvation free energies in cyclohexane is the same as model B. The notation z- z+ signifies 95 % confidence intervals computed from the bootstrapping of the data
| Molecule |
| A | B | C | D |
|---|---|---|---|---|---|
| Cyclohexane | 1.2 | 1.4 ± 0.1 | 1.2 ± 0.1 | 1.5 ± 0.1 | 1.2 ± 0.1 |
| Benzene | −0.9 | −0.5 ± 0.1 | −0.6 ± 0.1 | −0.2 ± 0.1 | −0.6 ± 0.1 |
| Acetic acid | −6.7 | −6.2 ± 0.1 | −6.3 ± 0.1 | −6.7 ± 0.2 | −6.3 ± 0.1 |
| Trimethylamine | −3.2 | −2.6 ± 0.1 | −2.8 ± 0.1 | −2.1 ± 0.1 | −2.8 ± 0.1 |
| Chlorobenzene | −1.1 | −0.2 ± 0.1 | −0.5 ± 0.1 | −0.1 ± 0.1 | −0.5 ± 0.1 |
| Methanol | −5.1 | −3.5 ± 0.1 | −3.5 ± 0.1 | −3.2 ± 0.3 | −3.5 ± 0.1 |
| n-Propane | 2.0 | 2.6 ± 0.1 | 2.5 ± 0.1 | 2.6 ± 0.1 | 2.5 ± 0.1 |
| Pyridine | −4.7 | −3.3 ± 0.1 | −3.4 ± 0.1 | −3.3 ± 0.1 | −3.4 ± 0.1 |
| Phenol | −6.6 | −5.7 ± 0.1 | −5.9 ± 0.1 | −4.5 ± 0.6 | −5.9 ± 0.1 |
| Acetone | −3.9 | −3.6 ± 0.1 | −3.6 ± 0.1 | −3.6 ± 0.1 | −3.6 ± 0.1 |
| Aniline | −5.5 | −5.2 ± 0.1 | −5.4 ± 0.1 | −4.8 ± 0.4 | −5.4 ± 0.1 |
| Trimethylammonium | −61.4 | −24.7 ± 0.1 | −24.8 ± 0.1 | −61.4 ± 0.3 | −61.4 ± 0.3 |
| Acetate | −77.6 | −74.8 ± 0.1 | −74.9 ± 0.2 | −81.1 ± 0.3 | −81.1 ± 0.3 |
| MUE | 3.58 < 3.63 < 3.69 |
|
|
| |
|
|
|
|
|
| |
| MUE neutral |
|
|
|
| |
|
|
|
|
|
|
Fig. 4Scatter plots of computed for molecules modelled as neutral in water and in cyclohexane (A–C) and molecules modelled as charged in water and neutral in cyclohexane (D–F) molecules according to model A (top, blue circles), model B (middle, green triangles) and model C (bottom, red squares); MUE and values are given with 95 % confidence intervals and MUE in units. The grey dashed line assumes a perfect correlation and the yellow shaded interval corresponds to an error of 1
Comparison between and MUE for model A, B, C and D considering the whole dataset ( and MUE) or neutral molecules ( neutral and MUE neutral) or protonated species only ( charged and MUE charged) for the dominant species approximation. All MUE are given in units. The notation z− z+ signifies 95 % confidence intervals taken from the bootstrapping of the data
|
|
| |
|---|---|---|
|
|
|
|
| MUE |
|
|
|
|
|
|
| MUE neutral |
|
|
|
|
|
|
| MUE charged |
|
|
Fig. 5Scatter plot of estimation with the two-species model, for the subset of molecules predicted to co-exist in charged and neutral forms in aqueous phase, according to model A (blue circles), model B (green circles), model C (red circles), model D (purple circles)
Comparison between and MUE for model A, B, C and D considering the whole dataset ( and MUE) or protonated species only ( charged and MUE charged) for the two-species approximation. All MUE give in units. The notation z − z+ signifies 95 % confidence intervals taken from the bootstrapping of the data
|
|
| |
|---|---|---|
|
|
|
|
| MUE |
|
|
|
|
|
|
| MUE charged |
|
|
Fig. 6Comparison between SOMD and Gromacs A red circles, hydration free energy B blue circles and solvation free energy in cyclohexane C green triangles. All MUD values of solvation free energies are given in . Dashed red line shows perfect correlation between datasets and shaded yellow area a 1 (A) and 1 (B and C) deviation bound