| Literature DB >> 35213968 |
Kai Zhao1, Yujia Shi1, Hon-Cheong So1,2,3,4,5,6,7.
Abstract
Identification of the correct targets is a key element for successful drug development. However, there are limited approaches for predicting drug targets for specific diseases using omics data, and few have leveraged expression profiles from gene perturbations. We present a novel computational approach for drug target discovery based on machine learning (ML) models. ML models are first trained on drug-induced expression profiles with outcomes defined as whether the drug treats the studied disease. The goal is to "learn" the expression patterns associated with treatment. Then, the fitted ML models were applied to expression profiles from gene perturbations (overexpression (OE)/knockdown (KD)). We prioritized targets based on predicted probabilities from the ML model, which reflects treatment potential. The methodology was applied to predict targets for hypertension, diabetes mellitus (DM), rheumatoid arthritis (RA), and schizophrenia (SCZ). We validated our approach by evaluating whether the identified targets may 're-discover' known drug targets from an external database (OpenTargets). Indeed, we found evidence of significant enrichment across all diseases under study. A further literature search revealed that many candidates were supported by previous studies. For example, we predicted PSMB8 inhibition to be associated with the treatment of RA, which was supported by a study showing that PSMB8 inhibitors (PR-957) ameliorated experimental RA in mice. In conclusion, we propose a new ML approach to integrate the expression profiles from drugs and gene perturbations and validated the framework. Our approach is flexible and may provide an independent source of information when prioritizing drug targets.Entities:
Keywords: drug repurposing; drug target; expression profiling; gene perturbation; machine learning
Year: 2022 PMID: 35213968 PMCID: PMC8878225 DOI: 10.3390/pharmaceutics14020234
Source DB: PubMed Journal: Pharmaceutics ISSN: 1999-4923 Impact factor: 6.321
Enrichment test of the predicted targets for HT (enrichment for targets listed in OpenTargets).
| Threshold | SVM | RF | GBM | EN |
|---|---|---|---|---|
| 1 |
|
| 9.81 × 10−2 |
|
| 0.8 |
|
| 8.29 × 10−2 |
|
| 0.6 |
|
| 1.76 × 10−2 |
|
| 0.4 |
|
| 1.60 × 10−2 |
|
| 0.2 |
|
| 2.91 × 10−2 |
|
| 0 | 1.56 × 10−1 | 6.84 × 10−1 | 1.96 × 10−1 | 1.12 × 10−1 |
Enrichment p-values are shown. Four machine learning methods were used to train a model on expression data to predict treatment potential, and the model was fitted to expression profiles after gene perturbation. The false discovery rate (FDR) approach was employed to correct for multiple testing. p-values with corresponding FDR < 0.05 are in bold, while p-values with corresponding FDR between 0.05 and 0.1 are in italics. The first column is the threshold of the ‘relevance’ score (available from OpenTargets) above which we defined a gene as a drug ‘target’. Enrichment was tested against the targets listed in the OpenTargets database, which is the same as below. SVM: support vector machines; EN: logistic regression with elastic net regularization; RF: random forest; GBM, gradient boosted machines. For Table 1, Table 2, Table 3, Table 4 and Table 5, the predicted targets are based on expression data from overexpression (OE) experiments. The results from KD data are listed in Table S6.
Enrichment test of the predicted targets for DM.
| Threshold | SVM | RF | GBM | EN |
|---|---|---|---|---|
| 1 | 5.78 × 10−2 | 6.53 × 10−2 |
|
|
| 0.8 |
| 2.13 × 10−2 |
|
|
| 0.6 |
| 1.37 × 10−2 |
|
|
| 0.4 |
| 4.32 × 10−2 |
|
|
| 0.2 | 6.43 × 10−1 | 3.67 × 10−1 |
|
|
| 0 | 3.00 × 10−1 | 6.24 × 10−1 | 8.10 × 10−1 | 8.21 × 10−2 |
Enrichment test of the predicted targets for RA.
| Threshold | SVM | RF | GBM | EN |
|---|---|---|---|---|
| 1 | 1.18 × 10−1 |
|
| 9.22 × 10−1 |
| 0.8 | 1.36 × 10−1 |
|
| 9.94 × 10−1 |
| 0.6 | 1.18 × 10−1 | 1.41 × 10−1 | 3.67 × 10−1 | 9.20 × 10−1 |
| 0.4 | 6.44 × 10−1 |
|
| 2.69 × 10−1 |
| 0.2 | 3.12 × 10−1 | 8.47 × 10−2 | 4.15 × 10−2 | 8.48 × 10−2 |
| 0 | 3.71 × 10−1 | 7.01 × 10−1 | 1.96 × 10−1 | 2.56 × 10−1 |
Enrichment test of the predicted targets for SCZ (for targets of SCZ listed in OpenTargets).
| Threshold | SVM | RF | GBM | EN |
|---|---|---|---|---|
| 1 | 3.32 × 10−1 | 2.84 × 10−1 | 2.57 × 10−1 | 3.56 × 10−1 |
| 0.8 | 4.66 × 10−1 | 2.47 × 10−1 | 2.64 × 10−1 | 2.80 × 10−1 |
| 0.6 | 2.18 × 10−2 | 7.78 × 10−1 | 9.94 × 10−1 | 7.47 × 10−1 |
| 0.4 | 1.91 × 10−2 | 8.62 × 10−1 | 7.97 × 10−1 | 3.84 × 10−1 |
| 0.2 | 7.00 × 10−2 | 8.97 × 10−1 | 5.42 × 10−1 | 7.18 × 10−1 |
| 0 | 7.11 × 10−1 | 1.85 × 10−1 | 9.01 × 10−1 | 3.14 × 10−1 |
Enrichment test of the predicted targets for SCZ (for targets of bipolar disorder listed in OpenTargets).
| Threshold | SVM | RF | GBM | EN |
|---|---|---|---|---|
| 1 | 4.14 × 10−1 | 7.24 × 10−1 | 5.88 × 10−1 | 5.59 × 10−1 |
| 0.8 | 4.66 × 10−1 | 7.58 × 10−1 | 9.43 × 10−1 | 9.43 × 10−1 |
| 0.6 | 8.57 × 10−1 | 6.84 × 10−1 | 2.56 × 10−1 | 2.90 × 10−2 |
| 0.4 | 8.57 × 10−1 | 6.84 × 10−1 | 2.56 × 10−1 | 2.90 × 10−2 |
| 0.2 | 4.13 × 10−1 | 3.17 × 10−1 | 6.03 × 10−2 |
|
| 0 | 1.31 × 10−2 | 7.97 × 10−1 | 1.91 × 10−1 | 4.76 × 10−1 |
Literature support of selected drug target candidates.
| Potential Target | Disease | Direction of Expression Associated with Treatment Effect (as Predicted) | Literature Support/Functional Relevance |
|---|---|---|---|
| DRD1 | SCZ | up | Insufficient D1 receptor signaling may be associated with cognitive deficits; D1 agonist has been tested in a clinical trial for cognitive symptoms in SCZ, with moderate improvement in some cognitive tasks observed |
| HIF1AN | SCZ | down | Hypoxia may play a role in SCZ by affecting neurodevelopment; genetic studies showed that HIFs may be involved in SCZ |
| ADCY9 | SCZ | up | Involved in glutamate and GABA neurotransmission; |
| NDUFS4 | SCZ | down | An SNP close to |
| SMAD7 | RA | up | |
| TGFBR2 | RA | up | Linked to resistance of methotrexate treatment and non-responsive patient had reduced expression of the gene in regulatory T cells; hypermethylation (associated with decreased expression) found in RA samples |
| FGFR10P | RA | down | An LD block containing this gene was found in the GWAS of RA and other autoimmune conditions |
| PSMB8 | RA | down | Directly supported by experimental evidence from animal studies: treatment with a PSMB8 inhibitor (PR-957) ameliorated experimental RA in mice. |
| IL-21R | RA | down | IL-21 receptor expression on B cells contributed to collagen-induced arthritis in animal studies; berberine inhibits IL-21/IL-21R-dependent autophagy and has been suggested as a treatment for RA |
| LTBR | RA | down | A phase-1 RCT showed that pateclizumab (a drug that inhibits LTα1β2–LTβR interactions) led to a reduction in RA clinical activity compared to placebo, although no statistically significant difference was shown in a phase 2 trial |
| NR0B2 | DM | up | Mutations (associated with reduced activities) in the gene associated with DM; inhibitory effect of metformin on hepatic gluconeogenesis may be mediated through expression of |
| Fos | DM | up | Insulin induced |
| QPRT | DM | up | Expression of |
| MAGED1 | DM | up | MAGED1-deficient mice showed hyperphagia and reduced motor activity, which is associated with obesity (shown by two animal studies). |
| PPP2R1A | DM | up | Encodes a regulatory subunit of PP2A; podocyte-specific loss of PP2A worsened diabetic glomerulopathy and accelerated the progression of diabetic kidney disease; interacts with IRS1 (Insulin receptor substrate 1), which is a key mediator of insulin signal transduction implicated in Type 2 DM |
| TBK1 | DM | down | TBK1 is expressed primarily in beta cells of mammalian islets; inhibition of TBK1/IKKε (IκB kinase ε) led to increased β-cell regeneration |
| TCF7L2 | HT | up | A well-established susceptibility gene for DM found in GWAS (DM and HT are highly comorbid and may share common pathways); genetic association studies showed associations of SNPs in |
| ATP5A1 | HT | up | Reduced expression in HT rats; network analysis showed that actions of a Chinese drug on HT may be mediated through this target |
| FADD | HT | down | Cohort studies reported that a high plasma level of FADD was associated with increased incidence of coronary events and ischemic stroke |
| NFE2L2 | HT | up | A selective Nrf2 activator (tBHQ) significantly reduced mean arterial pressure, plasma norepinephrine levels, and sympathetic nerve activities in hypertensive rats; tBHQ also reduced levels of reactive oxygen species and decreased inflammatory cytokine release in the periventricular nucleus (PVN) |
RCT, randomized controlled trial.; tBHQ, tert-butylhydroquinone.