| Literature DB >> 20098496 |
Jacob D Durrant1, Rommie E Amaro, Lei Xie, Michael D Urbaniak, Michael A J Ferguson, Antti Haapalainen, Zhijun Chen, Anne Marie Di Guilmi, Frank Wunder, Philip E Bourne, J Andrew McCammon.
Abstract
Conventional drug design embraces the "one gene, one drug, one disease" philosophy. Polypharmacology, which focuses on multi-target drugs, has emerged as a new paradigm in drug discovery. The rational design of drugs that act via polypharmacological mechanisms can produce compounds that exhibit increased therapeutic potency and against which resistance is less likely to develop. Additionally, identifying multiple protein targets is also critical for side-effect prediction. One third of potential therapeutic compounds fail in clinical trials or are later removed from the market due to unacceptable side effects often caused by off-target binding. In the current work, we introduce a multidimensional strategy for the identification of secondary targets of known small-molecule inhibitors in the absence of global structural and sequence homology with the primary target protein. To demonstrate the utility of the strategy, we identify several targets of 4,5-dihydroxy-3-(1-naphthyldiazenyl)-2,7-naphthalenedisulfonic acid, a known micromolar inhibitor of Trypanosoma brucei RNA editing ligase 1. As it is capable of identifying potential secondary targets, the strategy described here may play a useful role in future efforts to reduce drug side effects and/or to increase polypharmacology.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20098496 PMCID: PMC2799658 DOI: 10.1371/journal.pcbi.1000648
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1The strategy workflow.
Circles and squares represent protein chains. Homologous chains share the same color. From each group of homologous chains, one is selected as representative and is shown as a square. (A) As the PDB has approximately 110,000 protein chains, identifying secondary targets directly is computationally intractable. (B) To reduce the number of chains, all chains are grouped by sequence homology into 12,646 clusters, and (C) a single representative chain is selected from each cluster. The set of all representative chains is called the PDB30. (D) SOIPPA is used to eliminate all protein chains in the PDB30 with active sites that are dissimilar to that of the primary target, TbREL1. Only 218 chains remain. (E) A new set of 645 protein-chain structures called the PDBr is created by taking the union of all those clusters whose representative PDB30 protein chains have active sites that are not dissimilar to that of TbREL1. (F) Redundant chains are ignored; compound 1 is docked into the remaining 87 protein chains. Chains are ranked by their corresponding docking scores.
Selected predicted secondary targets of compound 1 in humans.
| Protein | AD Score | SOIPPA | Sequence Identity | FATCAT | |
| Metabolism | UDP-glucose 4-epimerase (1I3L:B) | −11.22 | 3% | 0.613 | |
| NAGK (2CH5:C) | −10.75 | 3.45×10−2 | 9% | 0.351 | |
| acetyl-CoA carboxylase 2 (2HJW:A) | −10.28 | 7.82×10−3 | 5% | 0.423 | |
| mitochondrial 2-enoyl thioester reductase (1ZSY:A) | −10.04 | <1×10−5 | 4% | 0.754 | |
| tubby isoform A (1S31:A) | −9.17 | 6% | 0.615 | ||
| DNA synthesis, repair, replication | DNA ligase I (1X9N:A, residues 535–751) | −9.70 | <1×10−5 | 5% | 3.82×10−3 |
| 3-methyl-adenine DNA glycosylase (1EWN:A) | −9.09 | 1.18×10−2 | 10% | 0.800 | |
| thymidylate synthase (1I00:A) | −8.50 | 3% | 0.237 | ||
| Amino acid synthesis | pyrroline-5-carboxylate reductase 1 (2GR9:B) | −10.49 | 4.28×10−2 | 1% | 0.702 |
| Blood clotting | fibrinogen (1FZE:B) | −9.53 | 4% | 0.626 | |
| Vision | tubby related 1 (2FIM:B) | −10.21 | 2.60×10−2 | 3% | 0.637 |
| Nuclear transport | snurportin-1 (1XK5:A) | −10.10 | <1×10−5 | 4% | 6.07×10−4 |
Human secondary targets are involved in metabolism; polynucleotide synthesis, repair, and replication; amino acid synthesis; blood clotting; vision; and nuclear transport. “AD score” refers to the AutoDock-predicted energy of binding to 1; “SOIPPA p-value” refers to the similarity between the secondary-target and TbREL1 active sites; “sequence identity” refers to the percent amino-acid identity with TbREL1 as computed by the CLUSTALW2 algorithm; and “FATCAT p-value” refers to the structural similarity between the secondary target and TbREL1. Protein sequences were extracted from PDB seqres headers.
Selected predicted secondary targets of compound 1 in pathogens.
| Protein | Species | AD Score | SOIPPA | Sequence Identity | FATCAT |
| probable ATP-dependent DNA ligase (2FAO:A) |
| −10.57 | 1.34×10−2 | 5% | 0.440 |
| UDP-galactose 4-epimerase (1GY8:D) |
| −10.29 | 1% | 0.622 | |
| dTDP-D-glucose 4,6-dehydratase (1KET:B) |
| −9.55 | 5% | 0.502 | |
| dihydrofolate reductase-thymidylate (1J3I:C) |
| −9.53 | 1.34×10−2 | 5% | 0.428 |
| DNA ligase, NAD-dependent (1TAE:B) |
| −9.49 | 8% | 3.56×10−2 | |
| dTDP-D-glucose 4,6-dehydratase (1G1A:C) |
| −9.24 | 1% | 0.529 | |
| adenine phosphoribosyltransferase (1MZV:A) |
| −8.61 | 11% | 0.649 | |
| UTP-gluc-1-P uridylyltransferase 2 (2OEG:A) |
| −8.56 | 7.82×10−3 | 8% | 0.724 |
| purine nucleoside phosphorylase (2B94:A) |
| −7.61 | 2% | 0.650 | |
| DNA ligase (1ZAU:A) |
| −6.75 | 4% | 2.82×10−2 |
“AD score” refers to the AutoDock-predicted energy of binding to 1; “SOIPPA p-value” refers to the similarity between the secondary-target and TbREL1 active sites; “sequence identity” refers to the percent amino-acid identity with TbREL1 as computed by the CLUSTALW2 algorithm; and “FATCAT p-value” refers to the structural similarity between the secondary target and TbREL1. Protein sequences were extracted from PDB seqres headers.
Global sequence and structural homology between HsLigI, residues 535 to 751, and three other selected DNA ligases.
| Identity Similarity | FATCAT | |
|
| 31% | - |
|
| 4% | 2.36×10−7 |
|
| 4% | 1.07×10−6 |