| Literature DB >> 34244905 |
Antonio Viayna1, Silvana Pinheiro2, Carles Curutchet3, F Javier Luque4, William J Zamora5,6.
Abstract
Within the scope of SAMPL7 challenge for predicting physical properties, the Integral Equation Formalism of the Miertus-Scrocco-Tomasi (IEFPCM/MST) continuum solvation model has been used for the blind prediction of n-octanol/water partition coefficients and acidity constants of a set of 22 and 20 sulfonamide-containing compounds, respectively. The log P and pKa were computed using the B3LPYP/6-31G(d) parametrized version of the IEFPCM/MST model. The performance of our method for partition coefficients yielded a root-mean square error of 1.03 (log P units), placing this method among the most accurate theoretical approaches in the comparison with both globally (rank 8th) and physical (rank 2nd) methods. On the other hand, the deviation between predicted and experimental pKa values was 1.32 log units, obtaining the second best-ranked submission. Though this highlights the reliability of the IEFPCM/MST model for predicting the partitioning and the acid dissociation constant of drug-like compounds compound, the results are discussed to identify potential weaknesses and improve the performance of the method.Entities:
Keywords: Conformational study; Continuum solvation models; MST model; Physical properties; SAMPL7; Solvation free energy; Water-octanol log P; pK a
Mesh:
Substances:
Year: 2021 PMID: 34244905 PMCID: PMC8295120 DOI: 10.1007/s10822-021-00394-6
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
Scheme 1Thermodynamic cycles used to determine (left) the transfer free energy of a neutral (HX) compound between n-octanol and water, and (right) the pKa estimation of a titratable compound, where HX and X− stand for the acidic and basic species, respectively
Fig. 1Dataset of 22 small molecules proposed in the SAMPL7 log P challenge
Calculated (ID TFE IEFPCM MST) and experimental n-octanol/water partition coefficient (log P) determined for the set of compounds included in the SAMPL7 dataset
| Compound | Calculated | Experimentala | Δlog |
|---|---|---|---|
| SM25 | 1.89 | 2.67 | − 0.78 |
| SM26 | − 0.21 | 1.04 | − 1.25 |
| SM27 | 1.76 | 1.56 | 0.20 |
| SM28 | 0.83 | 1.18 | − 0.35 |
| SM29 | 1.24 | 1.61 | − 0.37 |
| SM30 | 3.54 | 2.76 | 0.78 |
| SM31 | 1.62 | 1.96 | − 0.34 |
| SM32 | 1.64 | 2.44 | − 0.80 |
| SM33 | 4.29 | 2.96 | 1.33 |
| SM34 | 2.40 | 2.83 | − 0.43 |
| SM35 | 0.77 | 0.88 | − 0.11 |
| SM36 | 3.75 | 0.76 | |
| SM37 | 1.88 | 1.45 | 0.43 |
| SM38 | 0.48 | 1.03 | − 0.55 |
| SM39 | 2.48 | 1.89 | 0.59 |
| SM40 | 1.43 | 1.83 | − 0.40 |
| SM41 | 0.88 | 0.58 | 0.30 |
| SM42 | 3.75 | 1.76 | |
| SM43 | 1.85 | 0.85 | 1.00 |
| SM44 | − 0.16 | 1.16 | − 1.32 |
| SM45 | 2.04 | 2.55 | − 0.51 |
| SM46 | 0.95 | 1.72 | − 0.77 |
| mseb | − 0.07 | ||
| mueb | 0.80 | ||
| rmsdb | 1.03 |
Bold values indicate compounds with the largest deviation (> 1.50 log P units) between predicted and experimental values
aSee [39]
bMean signed error (mse), mean unsigned error (mue), and root-mean square deviation (rmsd) calculated relative to the experimental values (log P units)
Fig. 2Comparison between experimental and IEFPCM/MST n-octanol/water log P for the SAMPL7 dataset. Red points represent the compounds with the largest errors in the original submission. Statistical analyses are shown for (top left) all compounds and (bottom right) after exclusion of SM36 and SM42
Fig. 3Comparison between experimental and IEFPCM/MST n-octanol/water log P for the combined dataset including the 11 fragment-like small molecules in the SAMPL6 log P challenge (blue) and 22N-acylsulfonamides in the SAMPL7 log P challenge (lightblue). The red point represents the compound with the largest error in the final dataset. Statistical analyses are shown for (top left) all compounds and (bottom right) after exclusion of SM36
Calculated (ID IEFPCM MST) and experimental pKa determined for the set of compounds included in the SAMPL7 dataset
| Compound | Calculated | Experimentala | Δp |
|---|---|---|---|
| SM25 | 7.24/3.30 | 4.49 | |
| SM26 | 4.52 | 4.91 | − 0.39 |
| SM27 | 12.34 | 10.45 | |
| SM28 | 16.12 | > 12.00 | – |
| SM29 | 11.51 | 10.05 | 1.46 |
| SM30 | 11.00 | 10.29 | 0.71 |
| SM31 | 10.84 | 11.02 | − 0.18 |
| SM32 | 11.95 | 10.45 | 1.50 |
| SM33 | 10.69 | > 12.00 | – |
| SM34 | 10.64 | 11.93 | − 1.24 |
| SM35 | 10.28 | 9.87 | 0.41 |
| SM36 | 9.20 | 9.8 | − 0.6 |
| SM37 | 8.11 | 10.33 | − |
| SM38 | 9.82 | 9.44 | 0.38 |
| SM39 | 8.85 | 10.22 | − 1.37 |
| SM40 | 8.26 | 9.58 | − 1.32 |
| SM41 | 5.13 | 5.22 | − 0.09 |
| SM42 | 4.86 | 6.62 | − |
| SM43 | 4.43 | 5.62 | − 1.19 |
| SM44 | 7.09 | 6.34 | 0.75 |
| SM45 | 7.37 | 5.93 | 1.44 |
| SM46 | 5.56 | 6.42 | − 0.86 |
| mse | 0.00 | ||
| mue | 1.13 | ||
| rmsd | 1.32 |
Bold values indicate the compounds with the largest deviation (> 1.50 in pKa units) between theoretical and experimental values. For SM25, the value of the original submission and the corrected one during the revision of the calculated data are indicated as plain text and in italics, respectively
aRef [43]
Fig. 4Comparison between experimental and IEFPCM/MST pKa for the SAMPL7 Dataset. Red points denote compounds with the largest errors in the original submission. Statistical analyses are shown for (top left) all compounds and (bottom right) after exclusion of SM25, SM27, SM37 and SM42
Comparative results of the four highly deviated compounds with the first (ID EC_RISM) and third (ID TZVP_QM) ranked methods in the SAMPL7 pKa challenge
| Compound | Exp | Calculated IEFPCM/MST | Calculated EC_RISM | Δp | Calculated TZVP_QM | Δp |
|---|---|---|---|---|---|---|
| SM25 | 4.49 | 7.24 | 5.42 | − 0.93 | 7.34 | − 2.85 |
| SM27 | 10.45 | 12.34 | 10.17 | 0.28 | 7.65 | 2.80 |
| SM37 | 10.33 | 8.11 | 9.95 | 0.38 | 6.77 | 3.56 |
| SM42 | 6.62 | 4.86 | 5.59 | 1.03 | 7.45 | − 0.83 |
Fig. 5Microstates involved in the error of SM25 pK estimate