| Literature DB >> 31638744 |
Hiroaki Iwata1,2, Ryosuke Kojima1, Yasushi Okuno1,2,3.
Abstract
Phenotypic and target-based approaches are useful methods in drug discovery. The phenotypic approach is an experimental approach for evaluating the phenotypic response. The target-based approach is a rational approach for screening drug candidates targeting a biomolecule that causes diseases. These approaches are widely used for drug discovery. However, two serious problems of target deconvolution and polypharmacology are encountered in these conventional experimental approaches. To overcome these two problems, we developed a new in silico method using a probabilistic framework. This method integrates both the phenotypic and target-based approaches to estimate a relevant network from compound to phenotype. Our method can computationally execute target deconvolution considering polypharmacology and can provide keys for understanding the pathway and mechanism from compound to phenotype, thereby promoting drug discovery.Entities:
Keywords: drug discovery; machine learning; phenotypic approach; polypharmacology; target deconvolution; target-based approach
Mesh:
Year: 2019 PMID: 31638744 PMCID: PMC7050533 DOI: 10.1002/minf.201900096
Source DB: PubMed Journal: Mol Inform ISSN: 1868-1743 Impact factor: 3.353
Figure 1The phenotypic and target‐based approaches, target deconvolution, and polypharmacology in drug discovery
Dataset of compound‐protein interactions.
|
|
Number of inter‐ actions |
Number of non‐ interactions |
Number of compounds |
Number of proteins |
|---|---|---|---|---|
|
GPCR |
312,989 |
4,435 |
227,846 |
2033 |
|
kinase |
245,853 |
7,578 |
72,736 |
392 |
|
ion channel |
37,773 |
2,233 |
24,139 |
122 |
|
transporter |
18,392 |
1,268 |
11,561 |
50 |
|
nuclear receptor |
49,619 |
7,763 |
41,857 |
28 |
|
protease |
92,958 |
9,001 |
54,778 |
182 |
Dataset of compound‐phenotype associations.
|
Number of associations |
Number of non‐ associations |
Number of compounds |
Number of phenotypes |
|---|---|---|---|
|
740,147 |
34,219,825 |
900,688 |
548 |
Figure 2Bayesian network representation of the integrating phenotypic and target‐based approaches
Figure 3Workflow of the proposed method. Our proposed method comprises of two steps. In the first step, the method predicts compound‐target protein interactions. In the second step, the method selects target proteins related to a phenotype.
Figure 4ROC curves and AUC scores from the cross‐validation experiments of compound‐protein interactions. (A) The plot shows the ROC curve; the x‐axis indicates false positive rate and the y‐axis indicates true positive rate (blue: GPCR, red: ion channel, green: kinase, purple: protease, light blue: transporter). (B) The results of AUC scores by L1‐ and L2‐regularized logistic regression are shown for each protein family.
Figure 5ROC curves and AUC scores for cross‐validation experiments of compound‐phenotype associations using the proposed method and random method. (A) The histogram shows the AUC scores of 548 phenotypes. The red bars represent results obtained using the proposed method while the blue bars represent results from the random method. The red bars represent results of our method while the gray bars represent results of the random method. The x‐axis indicates AUC scores while the y‐axis indicates the number of phenotypes. (B) The ROC curves for the top 10 results of the AUC scores are shown. The x‐axis indicates false positives rate while the y‐axis indicates true positives rate.
Figure 6AUC scores and the number of features in the cross‐validation experiments using L1‐ and L2‐regularized logistic regression. (A) The histogram shows the number of extracted proteins for each phenotype. The x‐axis indicates the number of proteins while the y‐axis indicates the number of phenotypes. (B) The histogram shows AUC scores for each phenotype. The x‐axis shows the AUC score while the y‐axis indicates the number of phenotypes. The red bars show L1‐regularization while the blue bars show L2‐regularization.
List of significantly enriched GO biological process terms for the predicted protein.
|
Rank |
AUC score |
PubChem AID |
PubChem description |
GO term |
GO description |
FDR |
|---|---|---|---|---|---|---|
|
1 |
0.9250 |
AID1828 |
qHTS for Inhibitors of Plasmodium falciparum proliferation: Summary |
– |
– |
– |
|
2 |
0.8910 |
AID1883 |
qHTS for differential inhibitors of proliferation of Plasmodium falciparum line W2 |
GO:0030154 |
cell differentiation |
4.77E‐02 |
|
3 |
0.8802 |
AID1815 |
qHTS for differential inhibitors of proliferation of Plasmodium falciparum line 7G8 |
GO:0050896 |
response to stimulus |
2.77E‐07 |
|
|
|
|
|
GO:0065007 |
biological regulation |
1.16E‐06 |
|
|
|
|
|
GO:0007154 |
cell communication |
3.94E‐06 |
|
|
|
|
|
GO:0050789 |
regulation of biological process |
1.11E‐05 |
|
|
|
|
|
GO:0032501 |
multicellular organismal process |
1.16E‐05 |
|
|
|
|
|
GO:0035556 |
intracellular signal transduction |
1.18E‐05 |
|
|
|
|
|
GO:0044707 |
single‐multicellular organism process |
1.20E‐05 |
|
|
|
|
|
GO:0009987 |
cellular process |
1.25E‐05 |
|
|
|
|
|
GO:0007165 |
signal transduction |
1.90E‐05 |
|
|
|
|
|
GO:0007274 |
neuromuscular synaptic transmission |
1.71E‐04 |
|
|
|
|
|
GO:0003008 |
system process |
2.03E‐04 |
|
|
|
|
|
GO:0050790 |
regulation of catalytic activity |
2.51E‐04 |
|
|
|
|
|
GO:0007268 |
synaptic transmission |
3.46E‐04 |
|
|
|
|
|
GO:0050877 |
neurological system process |
4.29E‐04 |
|
|
|
|
|
GO:0007267 |
cell‐cell signaling |
4.76E‐04 |
|
|
|
|
|
GO:0065009 |
regulation of molecular function |
7.00E‐04 |
|
|
|
|
|
GO:0009719 |
response to endogenous stimulus |
2.71E‐03 |
|
|
|
|
|
GO:0032502 |
developmental process |
4.10E‐03 |
|
|
|
|
|
GO:0006796 |
phosphate‐containing compound metabolic process |
6.52E‐03 |
|
|
|
|
|
GO:0007270 |
neuron‐neuron synaptic transmission |
1.04E‐02 |
|
|
|
|
|
GO:0000165 |
MAPK cascade |
1.05E‐02 |
|
|
|
|
|
GO:0043066 |
negative regulation of apoptotic process |
1.52E‐02 |
|
|
|
|
|
GO:0006811 |
ion transport |
1.69E‐02 |
|
|
|
|
|
GO:0005977 |
glycogen metabolic process |
2.88E‐02 |
|
|
|
|
|
GO:0042592 |
homeostatic process |
3.94E‐02 |
|
4 |
0.8706 |
AID540 |
Cell Viability – N2a |
– |
– |
– |
|
5 |
0.8657 |
AID1816 |
qHTS for differential inhibitors of proliferation of Plasmodium falciparum line GB4 |
GO:0032501 |
multicellular organismal process |
9.95E‐04 |
|
|
|
|
|
GO:0050789 |
regulation of biological process |
1.34E‐03 |
|
|
|
|
|
GO:0065007 |
biological regulation |
1.43E‐03 |
|
|
|
|
|
GO:0030154 |
cell differentiation |
1.71E‐03 |
|
|
|
|
|
GO:0019233 |
sensory perception of pain |
1.74E‐03 |
|
|
|
|
|
GO:0043066 |
negative regulation of apoptotic process |
2.06E‐03 |
|
|
|
|
|
GO:0044707 |
single‐multicellular organism process |
2.74E‐03 |
|
|
|
|
|
GO:0035556 |
intracellular signal transduction |
2.77E‐03 |
|
|
|
|
|
GO:0050790 |
regulation of catalytic activity |
3.64E‐03 |
|
|
|
|
|
GO:0065009 |
regulation of molecular function |
7.14E‐03 |
|
|
|
|
|
GO:0050896 |
response to stimulus |
7.61E‐03 |
|
|
|
|
|
GO:0050909 |
sensory perception of taste |
1.02E‐02 |
|
|
|
|
|
GO:0007165 |
signal transduction |
1.72E‐02 |
|
|
|
|
|
GO:0007154 |
cell communication |
3.37E‐02 |
|
|
|
|
|
GO:0006812 |
cation transport |
3.39E‐02 |
|
|
|
|
|
GO:0000165 |
MAPK cascade |
3.57E‐02 |
|
6 |
0.8605 |
AID543 |
Cell Viability – H‐4‐II‐E |
– |
– |
– |
|
7 |
0.8560 |
AID988 |
Cell Viability – LYMP2‐024 |
– |
– |
– |
|
8 |
0.8460 |
AID426 |
Cell Viability – Jurkat |
– |
– |
– |
|
9 |
0.8459 |
AID1882 |
qHTS for differential inhibitors of proliferation of Plasmodium falciparum line Dd2 |
GO:0065007 |
biological regulation |
6.16E‐09 |
|
|
|
|
|
GO:0050789 |
regulation of biological process |
8.79E‐09 |
|
|
|
|
|
GO:0032501 |
multicellular organismal process |
1.09E‐05 |
|
|
|
|
|
GO:0044707 |
single‐multicellular organism process |
1.24E‐05 |
|
|
|
|
|
GO:0007154 |
cell communication |
1.37E‐05 |
|
|
|
|
|
GO:0009987 |
cellular process |
2.96E‐05 |
|
|
|
|
|
GO:0003008 |
system process |
1.16E‐04 |
|
|
|
|
|
GO:0050896 |
response to stimulus |
6.51E‐04 |
|
|
|
|
|
GO:0035556 |
intracellular signal transduction |
8.86E‐04 |
|
|
|
|
|
GO:0007165 |
signal transduction |
1.21E‐03 |
|
|
|
|
|
GO:0050877 |
neurological system process |
1.25E‐03 |
|
|
|
|
|
GO:0019229 |
regulation of vasoconstriction |
2.15E‐03 |
|
|
|
|
|
GO:0009719 |
response to endogenous stimulus |
5.18E‐03 |
|
|
|
|
|
GO:0007274 |
neuromuscular synaptic transmission |
5.45E‐03 |
|
|
|
|
|
GO:0032502 |
developmental process |
1.04E‐02 |
|
|
|
|
|
GO:0006796 |
phosphate‐containing compound metabolic process |
1.45E‐02 |
|
|
|
|
|
GO:0001525 |
angiogenesis |
1.46E‐02 |
|
|
|
|
|
GO:0008015 |
blood circulation |
1.52E‐02 |
|
|
|
|
|
GO:0000165 |
MAPK cascade |
1.61E‐02 |
|
|
|
|
|
GO:0050790 |
regulation of catalytic activity |
1.94E‐02 |
|
|
|
|
|
GO:0043066 |
negative regulation of apoptotic process |
2.02E‐02 |
|
|
|
|
|
GO:0007268 |
synaptic transmission |
2.31E‐02 |
|
|
|
|
|
GO:0007267 |
cell‐cell signaling |
2.33E‐02 |
|
|
|
|
|
GO:0006874 |
cellular calcium ion homeostasis |
2.70E‐02 |
|
|
|
|
|
GO:0005977 |
glycogen metabolic process |
3.26E‐02 |
|
|
|
|
|
GO:0065009 |
regulation of molecular function |
3.62E‐02 |
|
|
|
|
|
GO:0007169 |
transmembrane receptor protein tyrosine kinase signaling pathway |
4.90E‐02 |
|
|
|
|
|
GO:0042592 |
homeostatic process |
4.93E‐02 |
|
10 |
0.8456 |
AID1877 |
qHTS for differential inhibitors of proliferation of Plasmodium falciparum line D10 |
GO:0050789 |
regulation of biological process |
5.79E‐07 |
|
|
|
|
|
GO:0032501 |
multicellular organismal process |
7.06E‐07 |
|
|
|
|
|
GO:0044707 |
single‐multicellular organism process |
8.95E‐07 |
|
|
|
|
|
GO:0065007 |
biological regulation |
1.58E‐06 |
|
|
|
|
|
GO:0003008 |
system process |
1.87E‐04 |
|
|
|
|
|
GO:0050877 |
neurological system process |
4.55E‐04 |
|
|
|
|
|
GO:0043066 |
negative regulation of apoptotic process |
1.82E‐03 |
|
|
|
|
|
GO:0007154 |
cell communication |
3.95E‐03 |
|
|
|
|
|
GO:0035556 |
intracellular signal transduction |
6.14E‐03 |
|
|
|
|
|
GO:0019233 |
sensory perception of pain |
6.16E‐03 |
|
|
|
|
|
GO:0050896 |
response to stimulus |
1.04E‐02 |
|
|
|
|
|
GO:0030154 |
cell differentiation |
1.31E‐02 |
|
|
|
|
|
GO:0050790 |
regulation of catalytic activity |
1.41E‐02 |
|
|
|
|
|
GO:0007268 |
synaptic transmission |
1.61E‐02 |
|
|
|
|
|
GO:0007165 |
signal transduction |
2.28E‐02 |
|
|
|
|
|
GO:0065009 |
regulation of molecular function |
2.65E‐02 |
|
|
|
|
|
GO:0032502 |
developmental process |
3.38E‐02 |
|
|
|
|
|
GO:0050909 |
sensory perception of taste |
4.46E‐02 |