| Literature DB >> 30767086 |
Patrick J Ropp1, Jesse C Kaminsky1, Sara Yablonski1, Jacob D Durrant2.
Abstract
Small-molecule protonation can promote or discourage protein binding by altering hydrogen-bond, electrostatic, and van-der-Waals interactions. To improve virtual-screen pose and affinity predictions, researchers must account for all major small-molecule ionization states. But existing programs for calculating these states have notable limitations such as high cost, restrictive licenses, slow execution times, and poor modularity. Here, we present dimorphite-DL 1.0, a fast, accurate, accessible, and modular open-source program for enumerating small-molecule ionization states. Dimorphite-DL uses a straightforward empirical algorithm that leverages substructure searching and draws on a database of experimentally characterized ionizable molecules. We have tested dimorphite-DL using several versions of Python and RDKit on all major operating systems. We release it under the terms of the Apache License, Version 2.0. A copy is available free of charge from http://durrantlab.com/dimorphite-dl/ .Entities:
Keywords: Drug discovery; Ionization; Modeling; Protonation; Virtual screening; pH
Year: 2019 PMID: 30767086 PMCID: PMC6689865 DOI: 10.1186/s13321-019-0336-9
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
The 38 ionizable dimorphite-DL substructures in order of decreasing priority from left to right, with representative compounds
Exact substructure definitions are given in Additional file 1: Table S1. The pKa range is the average of all associated pKa values in the database, plus or minus the standard deviation
Fig. 1A schematic representation of the dimorphite-DL approach. Each ionizable moiety is associated with a pKa range (rangePKA) defined by three parameters: µ, σ, and n. The user specifies a pH range (rangepH) and pKa precision factor (n; default: 1.0). The mean (µ) and standard deviation (σ) associated with each moiety are derived from the database of small molecules with experimentally characterized pKa values. If rangePKA is entirely less than rangepH, dimorphite-DL outputs a deprotonated molecule. If rangePKA is entirely greater than rangepH, dimorphite-DL outputs a protonated molecule. If rangePKA and rangepH overlap, dimorphite-DL outputs both deprotonated and protonated molecules
Dimorphite-DL accuracy
| pKa precision factor, | Correct (%) | Excess (%) | Incorrect (%) |
|---|---|---|---|
| 0.0 | 70.9 | 23.9 | 5.2 |
| 0.5 | 69.1 | 26.5 | 4.4 |
| 1.0 | 58.8 | 40.2 | 0.9 |
| 1.5 | 51.2 | 48.8 | 0.0 |
| 2.0 | 50.7 | 49.3 | 0.0 |
| 2.5 | 23.9 | 76.1 | 0.0 |
| 3.0 | 22.1 | 77.9 | 0.0 |
The percentage of molecules that are correctly/excessively/incorrectly protonated at different pKa precision factors (n), at physiological pH (6.4–8.4). To generate these statistics, we considered all 1938 compounds in our primary set, as well as the 78 additional phosphate and phosphonate compounds described in the Additional file 1
Dimorphite-DL accuracy at physiological pH (6.4–8.4) for five common moieties
| Correct (%) | Excess (%) | Incorrect (%) | |
|---|---|---|---|
| Amine (1°, 2°, and 3°) | 26.9 ± 3.0 | 73.1 ± 3.0 | 0.0 ± 0.0 |
| Carboxylic acid | 100.0 ± 0.0 | 0.0 ± 0.0 | 0.0 ± 0.0 |
| Phenol | 33.7 ± 3.8 | 66.3 ± 3.8 | 0.0 ± 0.0 |
| Benzoic acid | 100.0 ± 0.0 | 0.0 ± 0.0 | 0.0 ± 0.0 |
| Sulfonamide | 37.1 ± 11.6 | 62.9 ± 11.6 | 0.0 ± 0.0 |
Mean ± standard-deviation percentages were calculated using three-fold cross validation. The pKa precision factor (n) is 1.0. Additional file 1: Tables S2, S3, and S4 report similar accuracy measures for additional moieties, rangepH, and n