| Literature DB >> 27025271 |
Yael Silberberg1, Martin Kupiec1, Roded Sharan2.
Abstract
Understanding the genetic basis underlying individual responses to drug treatment is a fundamental task with implications to drug development and administration. Pharmacogenomics is the study of the genes that affect drug response. The study of pharmacogenomic associations between a drug and a gene that influences the interindividual drug response, which is only beginning, holds much promise and potential. Although relatively few pharmacogenomic associations between drugs and specific genes were mapped in humans, large systematic screens have been carried out in the yeast Saccharomyces cerevisiae, motivating the constructing of a projection method. We devised a novel approach for the prediction of pharmacogenomic associations in humans using genome-scale chemogenomic data from yeast. We validated our method using both cross-validation and comparison to known drug-gene associations extracted from multiple data sources, attaining high AUC scores. We show that our method outperforms a previous technique, as well as a similar method based on known human associations. Last, we analyze the predictions and demonstrate their biological relevance to understanding drug response.Entities:
Mesh:
Year: 2016 PMID: 27025271 PMCID: PMC4812343 DOI: 10.1038/srep23703
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) Algorithmic pipeline: Step 1: using genetic datasets (e.g. PFam), drug datasets (e.g. DrugBank) and chemogenetic datasets (e.g. Lee) to construct gene similarity measurements, drug similarity measurements and HIP/HOP scores respectively. Step 2: Example of how to combine the three similarity measurements (gene similarity, drug similarity and HIP/HOP score) to construct one feature score. Step 3: Generating eight feature scores for each of the three main HIP/HOP data-sources, resulting in three three-dimentional feature matrices. Step 4: Uniting the three feature matrices into one matrix with 24 features. Using true PGx association extracted from PharmGKB as the positive training set, and applying Random Forest classifier to predict PGx associations. (B) Feature construction example, demonstrating step 2 in the algorithmic pipeline. The feature score for a given PGx association between a drug D, and human gene G, is the maximal geometric mean of three measurements, across all drugs and genes in a chemogenomic database. In this example the maximal score (marked in red) is achieved by the geometric means of the three following measurements: (i) the chemical similarity between the query drug and the drug marked in blue (ii) the domain similarity between the human query gene and the yeast gene marked in green and (iii) the HIP chemogenomic association between the drug marked in blue and the yeast knock-out gene marked in green.
A list of the eight features derived from each data source.
| Feature formulation |
|---|
Figure 2Cross validation.
(A) Precision recall graph evaluating cross validation performance, using different sizes of negative sets (B). ROC graph evaluating cross validation performance, using different sizes of negative sets.
Figure 3Performance evaluation.
(A) Areas under the curves (AUC) obtained in a 10-fold cross-validation setting comparing our method (yeast based) to a similar human-based method, omitting the HIP/HOP score from feature construction. (B) AUC scores obtained for 140 joint valid associations in both our method and Hansen’s. (C) Performance evaluation on external data sources: A comparison of our predictions to different pathway categories downloaded from The Small Molecule Pathway Database (SMPDB). (D) A comparison of our prediction to different association types extracted from DrugBank.
Figure 4Distribution of number of pharmacogenes associations per drug.
Drugs predicted to have the highest numbers of associated PGx.
| Drug name | Drug class |
|---|---|
| phenelzine | Antidepressive Agents | Monoamine Oxidase Inhibitors |
| trazodone | Antidepressive Agents, Second-Generation | Anti-Anxiety Agents | Serotonin Uptake Inhibitors |
| fluvoxamine | Antidepressive Agents, Second-Generation | Anti-Anxiety Agents | Serotonin Uptake Inhibitors |
| tacrine | Nootropic Agents | Cholinesterase Inhibitors | Parasympathomimetics |
| l-tryptophan | Antidepressive Agents, Second-Generation |
| amitriptyline | Antidepressive Agents, Tricyclic |
| amineptine | |
| paroxetine | Serotonin Uptake Inhibitors | Antidepressive Agents |
| iron | Trace Elements | Anti-anemic Agents | Supplements |
| tranylcypromine | Antidepressive Agents |
| mirtazapine | Histamine H1 Antagonists | Antidepressive Agents, Tricyclic | Adrenergic alpha-Antagonists |
| iproniazid | |
| nefazodone | Antidepressive Agents, Second-Generation |
| clomipramine | Serotonin Uptake Inhibitors | Antidepressive Agents, Tricyclic |
| duloxetine | |
| protriptyline | Adrenergic Uptake Inhibitors | Antidepressive Agents, Tricyclic |
| minaprine | Antidepressive Agents |
| vilazodone | Serotonin Uptake Inhibitors | Serotonin Receptor Agonists | Antidepressive Agents |
| citalopram | Antidepressive Agents, Second-Generation | Serotonin Uptake Inhibitors | Antidepressive Agents |
| sumatriptan | Vasoconstrictor Agents | Serotonin Antagonists |
| venlafaxine | Antidepressive Agents |
| reboxetine | |
| maprotiline | Antidepressive Agents, Second-Generation | Adrenergic Uptake Inhibitors | Antidepressive Agents |
| nortriptyline | Adrenergic Uptake Inhibitors | Antidepressive Agents, Tricyclic | Antidepressive Agents |
| amoxapine | Antidepressive Agents, Second-Generation | Serotonin Uptake Inhibitors | Adrenergic Uptake Inhibitors |
| doxepin | Adrenergic Uptake Inhibitors | Antidepressive Agents |
Categories extracted from DrugBank and are ‘|’ delimited.
Enriched categories in drugs with few PGx associations.
| Drug category | Hyper geometric p-value | # drugs |
|---|---|---|
| Histamine H1 Antagonists | 7.59E-09 | 22 |
| Contrast Media | 9.21E-09 | 11 |
| Diagnostic Agents | 4.98E-07 | 11 |
| Anti-Inflammatory Agents, Non-Steroidal | 1.26E-06 | 21 |
| Bisphosphonates | 8.06E-06 | 7 |
| Muscle relaxant, Skeletal | 1.12E-05 | 8 |
| Anti-Inflammatory Agents | 2.99E-05 | 24 |
| Anti-Allergic Agents | 5.56E-05 | 13 |
| Histamine Antagonists | 5.56E-05 | 13 |
| Cyclooxygenase Inhibitors | 2.75E-04 | 10 |
| Histamine H1 Antagonists, Non-Sedating | 1.19E-03 | 5 |
| Antiresorptives | 1.25E-03 | 4 |
| Muscle Relaxants, Genitourinary | 1.25E-03 | 4 |
| Neuromuscular Nondepolarizing Agents | 1.25E-03 | 4 |
| Bronchodilator Agents | 1.52E-03 | 10 |
| Oxytocics | 3.51E-03 | 5 |
| Antihypocalcemic Agents | 4.84E-03 | 7 |
| Nicotinic Antagonists | 5.33E-03 | 4 |
| Anti-Incontinence Agents | 6.69E-03 | 3 |
| Cyclooxygenase 2 Inhibitors | 6.69E-03 | 3 |
| Bone Density Conservation Agents | 7.35E-03 | 8 |
| Muscle Relaxants, Central | 8.51E-03 | 6 |
| Indicators and Reagents | 2.30E-02 | 3 |
| Insecticides | 2.30E-02 | 3 |
| Neuromuscular Agents | 2.71E-02 | 4 |
| Carbonic Anhydrase Inhibitors | 3.56E-02 | 2 |
| Expectorants | 3.56E-02 | 2 |
| Hormone Replacement Agents | 3.56E-02 | 2 |
| Leukotriene Antagonists | 3.56E-02 | 2 |
| Muscle Relaxants, Respiratory | 3.56E-02 | 2 |
| Phosphodiesterase 5 Inhibitors | 3.56E-02 | 2 |
| Adrenergic alpha-1 Receptor Antagonists | 4.63E-02 | 4 |
| Anti-Asthmatic Agents | 4.95E-02 | 3 |
Cross validation and external evaluation on filtered set of drugs, removing drugs with high volume of predictions.
| #Drugs removed | Maximal number of PGx predictions per drug | # positive examples in training set | Cross validation | AUC w.r.t. DrugBank associations | |||
|---|---|---|---|---|---|---|---|
| AUC | AUPR | Drug targets | Drug transporters | Drugs enzymes | |||
| 26 | 600 | 1215 | 0.95 | 0.96 | 0.69 | 0.85 | 0.94 |
| 351 | 150 | 479 | 0.93 | 0.94 | 0.64 | 0.81 | 0.92 |
The filtered sets of drugs were evaluated by cross validation, reporting the AUC and AUPR scores. Next PGx predictions were obtained and compared to drug-gene associations extracted from DrugBank. Area under ROC curve for three gold-standard association types is reported.