| Literature DB >> 29868550 |
Hongbin Huang1,2, Guigui Zhang1,3, Yuquan Zhou1,2, Chenru Lin1,3, Suling Chen1,2, Yutong Lin1,3, Shangkang Mai1,2, Zunnan Huang1,3.
Abstract
This article is a systematic review of reverse screening methods used to search for the protein targets of chemopreventive compounds or drugs. Typical chemopreventive compounds include components of traditional Chinese medicine, natural compounds and Food and Drug Administration (FDA)-approved drugs. Such compounds are somewhat selective but are predisposed to bind multiple protein targets distributed throughout diverse signaling pathways in human cells. In contrast to conventional virtual screening, which identifies the ligands of a targeted protein from a compound database, reverse screening is used to identify the potential targets or unintended targets of a given compound from a large number of receptors by examining their known ligands or crystal structures. This method, also known as in silico or computational target fishing, is highly valuable for discovering the target receptors of query molecules from terrestrial or marine natural products, exploring the molecular mechanisms of chemopreventive compounds, finding alternative indications of existing drugs by drug repositioning, and detecting adverse drug reactions and drug toxicity. Reverse screening can be divided into three major groups: shape screening, pharmacophore screening and reverse docking. Several large software packages, such as Schrödinger and Discovery Studio; typical software/network services such as ChemMapper, PharmMapper, idTarget, and INVDOCK; and practical databases of known target ligands and receptor crystal structures, such as ChEMBL, BindingDB, and the Protein Data Bank (PDB), are available for use in these computational methods. Different programs, online services and databases have different applications and constraints. Here, we conducted a systematic analysis and multilevel classification of the computational programs, online services and compound libraries available for shape screening, pharmacophore screening and reverse docking to enable non-specialist users to quickly learn and grasp the types of calculations used in protein target fishing. In addition, we review the main features of these methods, programs and databases and provide a variety of examples illustrating the application of one or a combination of reverse screening methods for accurate target prediction.Entities:
Keywords: drug design; methodology; online service; pharmacophore modeling; reverse docking; reverse screening; screening databases; shape similarity
Year: 2018 PMID: 29868550 PMCID: PMC5954125 DOI: 10.3389/fchem.2018.00138
Source DB: PubMed Journal: Front Chem ISSN: 2296-2646 Impact factor: 5.221
Figure 1Compounds described in the manuscript.
Figure 2The principle and workflow of shape screening, pharmacophore screening, and reverse docking.
Figure 3Software and online services for shape screening, pharmacophore screening, and reverse docking.
Characteristics of reverse screening tools.
| SuperPred | SMILES, Pubchem-Name | 2D similarity | 341,000 compounds, 1800 targets, 665,000 compound-target interactions | 2014 | 2014 | Accessible | |
| HitPick | SMILES | 1NN similarity searching and Laplacian-modified naïve Bayesian target models | 145,549 chemical-protein interactions collected from STITCH 3.1 | 2013 | 2013 | Accessible | |
| ChemMapper | SMILES, MOL2, SDF, SMI | SHAFTS, USR, FP2, MACCS, and random walk algorithm | Nearly 300,000 chemical structures and >3 million compounds | 2013 | 2016 | Accessible | |
| SEA server | SMILES | FP2 and BLAST-like model | Hundreds of target-ligand sets | 2007 | 2007 | Accessible | |
| ReverseScreen3D | SMILES | Hybrid 2D&3D | Automatically updated from RCSB PDB | 2011 | 2011 | Inaccessible | |
| TarPred | SMILES | KNN-based data fusion with molecular similarity | 533 individual targets with 179,807 active ligands | 2015 | 2015 | Inaccessible | |
| SwissTargetPrediction | SMILES | Five species, FP2, 3D similarity | 280,000 compounds and >2000 targets | 2014 | 2014 | Accessible | |
| SwissSimilarity | SMILES | FP2, five 3D methods | >30 chemical databases covering drugs, bioactive compounds, etc. | 2016 | 2016 | Accessible | |
| ChemProt | SMILES, name | MACCS, FP2, daylight-like fingerprints | >1.7 million unique chemicals and >20,000 proteins | 2011 | 2016 | Accessible | |
| TargetHunter | SMILES | ECFP6, ECFP4, FP2 | CHEMBL Version 22 | 2013 | 2016 | accessible | |
| CSNAP3D | SMILES, SDF | 3D similarity, network algorithms | Based on CHEMBL database | 2016 | 2016 | accessible | |
| ROCS | SDF, MOL2, PDB | 3D similarity | User prepared | 2006 | 2017 | Accessible | |
| PharmMapper | MOL2, SDF | Pharmacophores | 23,236 proteins covering >53,000 pharmacophore models | 2010 | 2017 | Accessible | |
| TarFisDock | MOL2 | DOCK4.0 | Based on the PDTD, which contains 1207 entries covering 841 known and potential drug targets | 2006 | 2008 | Inaccessible | |
| idTarget | PDB, MOL2, pdbqt, cif | MeDock, divide-and-conquer | All protein structures in the PDB | 2012 | 2012 | Accessible | |
| INVDOCK | NA | DOCK | An in-house database (9000 protein and nucleic acid entries) | 2001 | NA | Accessible | |
| Discovery Studio | SDF, MOL2, PDB | Pharmacophores | 140,000 receptor-ligand pharmacophore models. | 2012 | 2017 | Accessible |
Figure 4The relationships among direct databases, indirect databases, and reference databases used in reverse screening.
Characteristics of indirect and reference databases.
| STITCH | A database of known and predicted interactions between chemicals and proteins | 9.6 million proteins, 0.5 million chemicals | 2017 | |
| HMDB | A comprehensive, high-quality, freely accessible online database of small-molecule metabolites found in the human body | 74,507 metabolite entries, 5,701 protein sequences linked to metabolite entries | 2017 | |
| ZINC | Database of commercially available compounds for virtual screening | >100 million purchasable compounds | 2015 | |
| ChEMBL | An open large-scale bioactivity database | 2,101,843 compound records, 1,735,442 distinct compounds | 2017 | |
| BindingDB | Database of measured binding affinities | 2037 FDA-approved small-molecule drugs, 241 FDA-approved biotech (protein/peptide) drugs, 96 nutraceuticals, >6000 experimental drugs | 2011 | |
| DrugBank | Database combining detailed drug data with comprehensive drug target information | 8,261 drug entries, 4,338 non-redundant protein sequences | 2017 | |
| PDB | Crystal structures of macromolecules and ligands | 133,093 macromolecular structures, 53,025 citations, 23,711 ligands | 2017 | |
| scPDB | Druggable binding sites from the PDB | 9283 entries, 3678 proteins, 5608 ligands | 2013 | |
| TTD | Information on therapeutic protein and nucleic acid targets, relevant diseases, pathway information and the corresponding drugs | 2,589 targets, 31,614 drugs, 20,278 small molecules, 653 antisense drugs | 2011 | |
| PubChem | Information on chemical substances and their biological activities | >235 million substance descriptions, 90 million unique chemical structures, 230 million bioactivity outcomes from over one million biological assay experiments | 2017 | |
| CHEBI | A database and ontology of molecular entities focused on “small” chemical compounds | >52,450 compounds | 2017 | |
| UniProt | Resource for protein sequences and functional information | Swiss-Prot (555,100), TrEMBL (88,032,926) | 2017 | |
| PDSP Ki database | A unique resource in the public domain that provides information on the abilities of drugs to interact with an expanding number of molecular targets | 60,231 annotated ligands | 2017 | |
Applications of shape screening in predicting protein targets of small molecules.
| Prozac1, Vadilex2, Rescriptor3 | β1 receptor1, 5-HT transporter2, HRH43 | SEA | Keiser et al., |
| Wuweizi (compound 11/12) | GBA311, 12, SHBG11, 12 | SEA | Wang et al., |
| Lignan | 5-HT1AR | SEA | Zheng et al., |
| Plumbagin | TrxR, GR' | SEA | Hwang et al., |
| Obacunone | MIF | SEA | Gao et al., |
| 5-aza-dC | HDM2 | SuperPred | Putri et al., |
| Sini decoction (aconitine1, liquiritin2, 6-gingerol3) | ADRB11, ACE2, HMGCR1, 3 | TargetHunter | Zhang H. et al., |
| Salvinorin A | OPRK, CB1, CB2, DRD2 | TargetHunter | Xu et al., |
| NBP | NQO1, IDO, NADH-ubiquinone oxidoreductase | SEA | Wang Y. et al., |
| Quinoline derivative (83b1) | PPARδ | SEA | Pun et al., |
| Tributyltin (Ch-409) | RamC | SwissTargetPrediction | Waseem et al., |
| Xeronine | Adr A3, TDP1, muscleblind-like proteins 1 | SwissTargetPrediction | Sanni et al., |
| Methadone1, emetine2, loperamide3 | Muscarinic M31, α2 adrenergic2, NK2 receptors3 | SEA | Keiser et al., |
| CID 46907796 | Nrf2 | TargetHunter | Wang L. et al., |
| Chlorotrianisene | COX-1, ESR1 | SwissTargetPrediction | Gfeller et al., |
| Entecavir | POLB | TarPred | Liu et al., |
| Taxol mimetics | Tubulin | CSNAP3D | Lo et al., |
| Caffeine | D2R | ChemProt | Kringelum et al., |
Target prediction confirmed by the literature. Superscript values denotes that the protein targets in the second column correspond to the query molecules in the first column respectively.
Applications of pharmacophore screening in predicting protein targets of small molecules.
| CT | MAP2K1 | PharmMapper | Yuan et al., |
| Arctigenin | PDK1 | PharmMapper | Fang et al., |
| HSYA | XO | PharmMapper | Xu et al., |
| ZYZ-488 | Apaf-1 | PharmMapper | Wang Y. et al., |
| NCI 748494/1 | c-Met kinase | PharmMapper | El-Wakil et al., |
| UA | CASP-3, JNK2, ERK1 | PharmMapper | Ma et al., |
| BBR | GR, p38, DHODH | PharmMapper | Liu et al., |
| 5,7-dihydroxy-4′-methoxy-8-prenylflavanone | AChE | PharmMapper | Das et al., |
| Phytoestrogens (genistein1, daidzein2, secoisolariciresinol3) | AKR1B11, H-Ras2, GSTP13 | PharmMapper | Dutta et al., |
| MCDF | GR | PharmMapper | Chitrala and Yeguvapalli, |
| Capsaicin | CA2 | PharmMapper | Ye et al., |
| SID 242078875 | DPP-IV, PTP1B, PEPCK, GSK-3B, GK | PharmMapper | Krishnasamy and Muthusamy, |
| Flavanoid analogs | CDK2 | PharmMapper | Simon et al., |
| Chalcones and chalcone-like compounds | Cysteine proteases | PharmMapper | Gomes et al., |
| 16E-arylideno-nitrogen mustard hybrids(3/4) | GRs | PharmMapper | Acharya et al., |
| Components of CO | ESR1, ESR2, HSD11B1, cortisone reductase | PharmMapper | Wang N. et al., |
| N-substituted tetrahydro-β-carboline imidazolium salt derivatives | MEK-1 | PharmMapper | Liang et al., |
| ASC | AKR1B1, ALB, AR, BACE1, CDK2, F2 | PharmMapper | Zeng et al., |
| GFW compounds | F2, MMP3, CA2, AKR1B, CDK2 | PharmMapper | Zeng et al., |
| Thiadiazole compounds | c-Met | PharmMapper | Meshram et al., |
| Components in SFJD | Multi-targets in ERK pathway | PharmMapper | Li et al., |
| Isoquinoline alkaloids | MIF, ZipA-FtsZ, GAPDH, etc. | Discovery Studio 3.5 | Lei et al., |
| Six GTs | GCN5, CDK2 | Discovery Studio 4.0 | Shao et al., |
| Pinctada fucata oligopeptide | 5HT2A, BACE-1 | Discovery Studio | Chen et al., |
| Tamoxifen | ERRγ | PharmMapper | Liu et al., |
| S-adenosyl-L-homocysteine | Modification methylase TaqI | PharmMapper | Wang X. et al., |
| Kanamycin | APH(2′)-Iva | PharmMapper | Wang X. et al., |
.
Applications of reverse docking in predicting protein targets of small molecules with experimental verification.
| Triptonide1, triptolide2, triptriolide3 | ERα1, 2, 3 | AutoDock 4.2 | Liu et al., |
| c-di-GMP | Human LCN2 protein | DOCK 6 | Li et al., |
| Derivatives of indirubin (6BIO) | PDK1 | GlamDock | Zahler et al., |
| DAPH | Hexokinase | GOLD | Da Matta et al., |
| Apple polyphenols | GMP reductase, GTPase H-ras, HGPRT | idTarget | Scafuri et al., |
| Anti-HIV drugs (Pis, NRTIs) | POLB, TOP1, etc. | INVDOCK | Ji et al., |
| Analgesics | ErbB-2, PLA2, GSH-S | INVDOCK | Pan et al., |
| GAD | EphA7, EB1, PRDX3 | INVDOCK | Yue et al., |
| BBR | p53 | INVDOCK | Lu et al., |
| SB | EGFR | INVDOCK | Feng et al., |
| BBR | HSPA8, ANXA5 | INVDOCK | Lu et al., |
| AGS-IV | CN, ACE, JNK | INVDOCK | Zhao et al., |
| SB | ACE, REN | INVDOCK | Ye et al., |
| WB | CDK2, PAK4, BRaf1 | INVDOCK | Zhang et al., |
| BBR | CaM | INVDOCK | Ma et al., |
| Ophiobolin O | GSK3β | INVDOCK | Lv et al., |
| PRIMA-1 | OSC | Mdock | Grinter et al., |
| Meranzin | COX1, COX2, PPARγ | SELNERGY | Do et al., |
| Tofisopam | PDE4 | SELNERGY | Bernard et al., |
| Anti- | HpPDF1, 2 | TarFisDock | Cai et al., |
| [6]-gingerol | LTA4H | TarFisDock | Jeong et al., |
| 5 of 19 natural products | DPP-IV | TarFisDock | Zhang S. et al., |
| Bezafibrate | CDK2 | TarFisDock | Liu et al., |
| Bicyclol | IMPDHII | TarFisDock | Zhang Y. W. et al., |
| Esculentoside A | CK2 | TarFisDock | Li Y. et al., |
Target prediction confirmed by the literature. Superscript values denotes that the protein targets in the second column correspond to the query molecules in the first column respectively.
Applications of reverse docking in predicting protein targets of small molecules without experimental verification.
| Tea polyphenols | LTA4 hydrolase | Autodock, TarFisDock | Zheng et al., |
| 4 compounds | PBP4 | Autodock Vina | Sarangi et al., |
| Lenalidomide | VEGFR-2, erbB-3, FGFR-4, ABL, p38MAPK, MMP-3 | Autodock | Hu et al., |
| Torcetrapib | PDGFR, HGFR, IL-2, ErbB1 | Discovery Studio | Fan et al., |
| Melamine and cyanuric acid | GPX1, HEXB, LDH, lys C | INVDOCK | Ma et al., |
| PAs | GSTA1, GPX1 | INVDOCK | Yan et al., |
| Icariin | PI3K, AChE | INVDOCK | Cui et al., |
| Dioscin | TOP1 | MDock | Yin et al., |
| Ginsenosides | MEK1, EGFR, thrombin, Aurora A | Schrödinger | Park and Cho, |
| TCDD | MMP8, MMP3, OSC, MPO | TarFisDock | Oliveroverbel et al., |
| Ganoderic acid | HIV-1 proteasein | TarFisDock | Akbar and Yam, |
| Fullerene derivatives | HPRT, BACE1 | TarFisDock | Gupta et al., |
| Alpha lipoic acid | LTA4 hydrolase, VGKC | TarFisDock | Maldonado-Rojas, |
| Aryl-aminopyridine derivatives | CDK2, aurora kinase, KIT receptor | TarFisDock | Erić et al., |
| 4-HT, vitamin E | ER, GST | INVDOCK | Chen and Zhi, |
| ASA1, gentamicin2, ibuprofen3, IDV4, neomycin5, penicillin G6, 4-HT7, vitamin C8 | Antithrombin1, CA12, 5, SULT1E13, IFABP4, GST6, ADH7, alphaamylase8 | INVDOCK | Chen and Ung, |
| Biotin1, 4-HT2, HDPR3, methotrexate4 | Streptavidin1, ERa2, ADA3, DHFR4 | GOLD | Paul et al., |
| ε-viniferin | PDE4 | SELNERGY | Do et al., |
| Vitamin E1, 4-HT2 | AChE1, DHFR2 | TarFisDock | Li et al., |
| DRV1, 6BIO2, N-(4-aminobiphenyl-3-yl)-benzamide3 | HDAC21, HIV-1 PR2, CDK23 | idTarget | Wang et al., |
.
Applications of hybrid screening in predicting protein targets of small molecules.
| Rosemary components (carnosol,CA,UA,RA) | CDK2, MAPK-14, AR', PPARγ | RD&PS | PharmMapper, idTarget | Deshmukh et al., |
| DIP | DPD, Bub1 | RD&PS | PharmMapper, idTarget | Ge et al., |
| CYP450, NMT, GS, CHS | RD&PS | PharmMapper, TarFisDock | Chen et al., | |
| SAA | AR | RD&PS | DRAR-CPI, PharmMapper | Chen and Cui, |
| Curcumin | CDK2 | RD&SS | Schrödinger | Lim et al., |
| Naproxen | PI 3-K | RD&SS | Schrödinger | Kim et al., |
| GV2–20 | CA2 | RD&SS | ROCS, AutoDock | Mori et al., |
| Kinetin | Chitinase | RD&SS | idTarget, ReverseScreen3D | Kumar et al., |
| Glabridin | Braf, MEK1/2 | RD&SS | Schrödinger | Wang Z. et al., |
| Macrocyclic amidinoureas | Chitinase | RD&SS | ROCS, OEDocking | Maccari et al., |
| α-FMH | GST | PS&SS | PharmMapper, ReverseScreen3D | Considine et al., |
| Baicalein | COMT, MAO-B | RD&PS&SS | Schrödinger, ReverseScreen3D | Gao et al., |
| Saffron bioactive ingredients (picrocrocin) | HSP 90-α | RD&PS | PharmMapper, idTarget | Bhattacharjee et al., |
| Danshensu | GTPase Hras | RD&PS | PharmMapper, idTarget | Chen and Ren, |
| Tanshinone IIA | RARα | RD&PS | PharmMapper, AutoDock Vina | Chen, |
| 2-thiazolylimino-5-benzylidin-thiazolidin-4-one | COX2, AChE, AR, THRα | RD&PS | PharmMapper, TarFisDock | Iyer et al., |
| Glycopentalone | CDK-2, VEGFR-2 | RD&PS | AutoDock4.2, PharmMapper | Gurung et al., |
| PGS1, PLMF12, 67DiOHC8S3 | GSTA11, PTPNT12, 3, CBS3 | RD&PS | PharmMapper, DRAR-CPI | Pereira et al., |
| Oxindole pentacyclic alkaloids | DHFR, MDM2 | RD&SS | TarFisDock, ReverseScreen3D | Kozielewicz et al., |
| Quercetin | PARP1 | RD&SS | SHAFTS, idTarget | Carvalho et al., |
| Cardamom bioactive components (eucalyptol) | CASP-3, PKA | PS&SS | PharmMapper, ReverseScreen3D | Bhattacharjee and Chatterjee, |
| Amai alkaloid and pyridine derivatives in maca | AR', CA2, ERα, MAPK14, etc. | PS&SS | Discovery Studio4.5 | Yi et al., |
Target prediction confirmed by the literature. Superscript values denotes that the protein targets in the second column correspond to the query molecules in the first column respectively.
Figure 5The number and trend of applications using the three reverse screening methods and representative software since the year 2000.
Figure 6Twenty-eight representative compounds obtained by the clustering of 57 bioactive compounds for target prediction by different reverse screening methods.