| Literature DB >> 28630414 |
Antonio Peón1,2,3,4, Stefan Naulaerts1,2,3,4, Pedro J Ballester5,6,7,8.
Abstract
Many computational methods to predict the macromolecular targets of small organic molecules have been presented to date. Despite progress, target prediction methods still have important limitations. For example, the most accurate methods implicitly restrict their predictions to a relatively small number of targets, are not systematically validated on drugs (whose targets are harder to predict than those of non-drug molecules) and often lack a reliability score associated with each predicted target. Here we present a systematic validation of ligand-centric target prediction methods on a set of clinical drugs. These methods exploit a knowledge-base covering 887,435 known ligand-target associations between 504,755 molecules and 4,167 targets. Based on this dataset, we provide a new estimate of the polypharmacology of drugs, which on average have 11.5 targets below IC50 10 µM. The average performance achieved across clinical drugs is remarkable (0.348 precision and 0.423 recall, with large drug-dependent variability), especially given the unusually large coverage of the target space. Furthermore, we show how a sparse ligand-target bioactivity matrix to retrospectively validate target prediction methods could underestimate prospective performance. Lastly, we present and validate a first-in-kind score capable of accurately predicting the reliability of target predictions.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28630414 PMCID: PMC5476590 DOI: 10.1038/s41598-017-04264-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Ligand-centric target prediction workflow. The selected molecular similarity method is used to find the top k most similar database molecules to the query molecule (vorinostat in this illustrative example). Known targets for the query and the top k hit molecules are retrieved from the ChEMBL database. A novel method is introduced to assign a reliability score for each query-target association prediction based on the proportion of the query’s top hits binding to the predicted target. Lastly, the known targets of the query molecule permit measuring the predictive performance of the method at each reliability level in this binary classification problem.
Change in test set performance of the same ligand-centric target prediction method depending on the employed knowledge-base.
| Method | avNPT | avACC | avPR | avRC | avMCC | avTN | avFP | avFN | avTP |
|---|---|---|---|---|---|---|---|---|---|
| Peon | 7.9 | 0.996 | 0.296 | 0.403 | 0.300 | 3016.8 | 5.9 | 6.3 | 2.0 |
| This paper | 11.4 | 0.996 | 0.311 | 0.384 | 0.305 | 4186.1 | 8.4 | 8.5 | 3.0 |
The expanded knowledge-base can be found in the last row.
Performance of the tested ligand-centric methods averaged over query molecules sorted by descending avMCC value.
| Method | avNPT | avACC | avPR | avRC | avMCC |
|---|---|---|---|---|---|
| Morgan_hashed_bv_2_2048 | 11.7 | 0.996 | 0.348 | 0.423 | 0.339 |
| Morgan_hashed_bv_2_512 | 11.7 | 0.996 | 0.345 | 0.421 | 0.337 |
| Morgan_hashed_bv_2_1024 | 11.6 | 0.996 | 0.345 | 0.42 | 0.336 |
| Morgan_bv_2_2048 | 11.7 | 0.996 | 0.342 | 0.424 | 0.335 |
| Morgan_hashed_bv_3_512 | 11.5 | 0.996 | 0.346 | 0.416 | 0.334 |
| Morgan_hashed_bv_3_1024 | 11.7 | 0.996 | 0.344 | 0.417 | 0.334 |
| FeatMorgan_bv_3_512 | 11.6 | 0.996 | 0.347 | 0.416 | 0.332 |
| Morgan_bv_2_1024 | 11.6 | 0.996 | 0.341 | 0.42 | 0.332 |
| FeatMorgan_bv_3_2048 | 11.6 | 0.996 | 0.345 | 0.416 | 0.332 |
| Morgan_hashed_bv_3_2048 | 11.7 | 0.996 | 0.342 | 0.414 | 0.332 |
| FeatMorgan_bv_2_2048 | 11.7 | 0.996 | 0.345 | 0.418 | 0.331 |
| FeatMorgan_bv_3_1024 | 11.6 | 0.996 | 0.346 | 0.415 | 0.331 |
| Morgan_bv_2_512 | 11.7 | 0.996 | 0.341 | 0.415 | 0.331 |
| Morgan_bv_3_2048 | 11.5 | 0.996 | 0.343 | 0.413 | 0.331 |
| Morgan_bv_3_512 | 11.5 | 0.996 | 0.34 | 0.412 | 0.329 |
| FeatMorgan_bv_2_1024 | 11.7 | 0.996 | 0.343 | 0.414 | 0.329 |
| Morgan_bv_3_1024 | 11.5 | 0.996 | 0.34 | 0.41 | 0.328 |
| FeatMorgan_bv_2_512 | 11.8 | 0.996 | 0.342 | 0.415 | 0.328 |
| RDKit_2_7_2048_2 | 11.8 | 0.996 | 0.34 | 0.4 | 0.323 |
| RDKit_2_7_1024_1 | 11.7 | 0.996 | 0.336 | 0.396 | 0.319 |
| RDKit_2_7_2048_3 | 11.9 | 0.996 | 0.333 | 0.395 | 0.318 |
| RDKit_2_7_1024_2 | 11.5 | 0.996 | 0.331 | 0.392 | 0.316 |
| MACCS keys | 11.4 | 0.996 | 0.311 | 0.384 | 0.305 |
Each method is named after the employed fingerprint, as the remaining components are common to all methods.
Performance of the best method in Table 2 (Tanimoto score on Morgan_hashed_bv_2_2048 fingerprints) using now similarity cutoffs 90%, 80%, 70%, 60% and 50% instead of the top 10 hits.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 90 | 347 | 398 | 2.22 | 5.07 | 0.243 |
| 80 | 258 | 487 | 4.13 | 5.69 | 0.295 |
| 70 | 151 | 594 | 9.51 | 8.06 | 0.333 |
| 60 | 69 | 676 | 21.03 | 11.66 | 0.338 |
| 50 | 30 | 715 | 54.82 | 21.43 | 0.323 |
nNullQueries is the number of query molecules for which no hits are found. In contrast, nQueries is the number of query molecules for which at least a hit is found (thus performance is now averaged over nQueries). AvNHITS is the average number of database molecules with similarity scores above the cutoff. The 60% cutoff provides the best performance (avMCC = 0.338 leaving 69 drugs without predicted targets), which is slightly worse than that from using the top 10 most similar this (avMCC = 0.339 leaving no drugs without predicted targets).
True-positive and false-positive target predictions for the test set of 745 approved drugs grouped by the reliability score L.
|
|
|
|
| % | % |
|---|---|---|---|---|---|
| 0.1 | 1,080 | 4,634 | 0.2 | 19% | 81% |
| 0.2 | 399 | 1,000 | 0.4 | 29% | 71% |
| 0.3 | 267 | 378 | 0.7 | 41% | 59% |
| 0.4 | 163 | 154 | 1.1 | 51% | 49% |
| 0.5 | 123 | 65 | 1.9 | 65% | 35% |
| 0.6 | 77 | 39 | 2.0 | 66% | 34% |
| 0.7 | 58 | 25 | 2.3 | 70% | 30% |
| 0.8 | 74 | 12 | 6.2 | 86% | 14% |
| 0.9 | 53 | 5 | 10.6 | 91% | 9% |
| 1.0 | 74 | 7 | 10.6 | 91% | 9% |
From L ≥ 0.4, TP is higher than FP. Importantly, %TP is strongly correlated with the reliability score L.
Figure 2Boxplot showing how the precision of predicted targets varies depending on L (NB: l1 is l = 1 and corresponds to L = 0.1). The mean precision for a given L is marked with a blue line, whereas the median is given by a red line.
Performance results for the best method (quantification of results from Fig. 2).
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | 0.1 | 662 | 5,714 | 0.191 | 0.1 |
| 2 | 0.2 | 514 | 1,399 | 0.309 | 0.0 |
| 3 | 0.3 | 362 | 645 | 0.417 | 0.0 |
| 4 | 0.4 | 228 | 317 | 0.505 | 0.5 |
| 5 | 0.5 | 142 | 188 | 0.641 | 1 |
| 6 | 0.6 | 95 | 116 | 0.626 | 1 |
| 7 | 0.7 | 65 | 83 | 0.714 | 1 |
| 8 | 0.8 | 77 | 86 | 0.857 | 1 |
| 9 | 0.9 | 49 | 58 | 0.898 | 1 |
| 10 | 1.0 | 70 | 81 | 0.929 | 1 |
The mean and median values for precision (PR) are shown, as well as the number of query molecules with a given l value.
Figure 3The top 10 hits for the Busulfan query are shown ranked by similarity. The approved drug Busulfan (ChEMBL820) is an alkylating agent. All these hits bind to the predicted target, Carbonic anhydrase 2 (target CHEMBL205), and hence this is a L = 1 target prediction. Since Carbonic anhydrase 2 is not a known target of Busulfan, this is one of the seven L = 1 false positives. This seems to be a genuine false positive due to the relatively low similarity of the hits to the query molecule (56.5–31.5%).
Figure 4Chemical structures of the 10 most similar database molecules to Bexarotene. This is a L = 1 true positive prediction as these molecules bind the predicted target (Retinoid X receptor alpha).