| Literature DB >> 30952858 |
Phuong A Nguyen1, David A Born1, Aimee M Deaton1, Paul Nioi1,2, Lucas D Ward3,4.
Abstract
Only a small fraction of early drug programs progress to the market, due to safety and efficacy failures, despite extensive efforts to predict safety. Characterizing the effect of natural variation in the genes encoding drug targets should present a powerful approach to predict side effects arising from drugging particular proteins. In this retrospective analysis, we report a correlation between the organ systems affected by genetic variation in drug targets and the organ systems in which side effects are observed. Across 1819 drugs and 21 phenotype categories analyzed, drug side effects are more likely to occur in organ systems where there is genetic evidence of a link between the drug target and a phenotype involving that organ system, compared to when there is no such genetic evidence (30.0 vs 19.2%; OR = 1.80). This result suggests that human genetic data should be used to predict safety issues associated with drug targets.Entities:
Mesh:
Year: 2019 PMID: 30952858 PMCID: PMC6450952 DOI: 10.1038/s41467-019-09407-3
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Data sources and analysis workflow
Fig. 2Rate of side effect (SE) manifestation across drug-SE combinations classified by genetic support. OR, odds ratio; P, P-value from Fisher’s exact test (two-tailed). The left panel shows analyses based on all 38,199 drug-SE combinations; the right panel shows analyses based on 33,489 “off-indication” drug-SE combinations. Error bars represent the 95% confidence interval of the reported proportions. Source data and confidence intervals for the OR values are shown in Supplementary Table 1. Similar analyses for Mendelian and GWAS genetics separately are shown in Supplementary Figure 4
Analysis of enrichment between genetic phenotypes and side effects in individual organ systems
| Phenotype category | Drugs with genetics and side effect | Drugs with genetics and no side effect | Drugs with no genetics and side effect | Drugs with no genetics and no side effect | Rate of side effect when genetic association exists | Rate of side effect when genetic association does not exist | OR |
|
|---|---|---|---|---|---|---|---|---|
| Blood | 96 | 269 | 234 | 1220 | 26.3% | 16.1% | 1.86 | 1.34E−05 |
| Heart | 145 | 176 | 451 | 1047 | 45.2% | 30.1% | 1.91 | 3.92E−07 |
| Congenital | 17 | 572 | 22 | 1208 | 2.9% | 1.8% | 1.63 | 0.165 |
| Ear | 5 | 112 | 74 | 1628 | 4.3% | 4.3% | 0.98 | 1.00 |
| Endocrine | 97 | 252 | 177 | 1293 | 27.8% | 12.0% | 2.81 | 3.71E−12 |
| Eye | 56 | 256 | 193 | 1314 | 17.9% | 12.8% | 1.49 | 0.0186 |
| Gastrointestinal | 155 | 276 | 497 | 891 | 36.0% | 35.8% | 1.01 | 0.954 |
| Hepatobiliary | 20 | 153 | 160 | 1486 | 11.6% | 9.7% | 1.21 | 0.423 |
| Immune | 113 | 335 | 251 | 1120 | 25.2% | 18.3% | 1.51 | 1.76E−03 |
| Infection | 64 | 104 | 536 | 1115 | 38.1% | 32.5% | 1.28 | 0.144 |
| Metabolism | 228 | 400 | 280 | 911 | 36.3% | 23.5% | 1.85 | 1.34E−08 |
| Musculoskeletal | 217 | 435 | 322 | 845 | 33.3% | 27.6% | 1.31 | 0.0118 |
| Neoplasm | 48 | 305 | 122 | 1344 | 13.6% | 8.3% | 1.73 | 3.10E−03 |
| Nervous | 343 | 421 | 401 | 654 | 44.9% | 38.0% | 1.33 | 3.73E−03 |
| Pregnancy | 7 | 268 | 35 | 1509 | 2.5% | 2.3% | 1.13 | 0.827 |
| Mental | 208 | 350 | 255 | 1006 | 37.3% | 20.2% | 2.34 | 5.16E−14 |
| Urologic | 75 | 245 | 220 | 1279 | 23.4% | 14.7% | 1.78 | 2.20E−04 |
| Reproductive | 75 | 199 | 186 | 1359 | 27.4% | 12.0% | 2.75 | 5.75E−10 |
| Respiratory | 140 | 211 | 464 | 1004 | 39.9% | 31.6% | 1.44 | 3.68E−03 |
| Skin | 122 | 224 | 413 | 1060 | 35.3% | 28.0% | 1.40 | 8.75E−03 |
| Vascular | 281 | 299 | 436 | 803 | 48.4% | 35.2% | 1.73 | 8.31E−08 |
|
|
|
|
|
|
|
|
|
|
P-value is from Fisher’s exact test (two-sided). Source data and confidence intervals are provided in Supplementary Table 2
OR odds ratio
Fig. 3Summary of side effect modeling results. Left: results of raw enrichment analysis (source data from Supplementary Table 2). Center: coefficients of the Mendelian genetics predictors in each of the 18 side effect models (source data from Supplementary Table 3). Right: coefficients of the GWAS genetics predictors in each of the 18 side effect models (source data from Supplementary Table 3). Coefficients from the regression models were exponentiated to obtain odds ratios. Along each row, circular points indicate odds ratios from the glm models; error bars are 95% confidence intervals from the glm models; large eight-pointed stars indicate odds ratios when the genetics predictors are selected as predictors in the glmnet models (which are built using feature selection, lasso regularization, and cross-validation). Small asterisks offset from the data indicate P-values of the the Fisher’s exact test (left) and P-values of the coefficients from the glm models (center and right) (*P < 0.05; **P < 0.01; ***P < 0.001)
Fig. 4Results of cross validation analysis of contribution of genetics to predictive modeling. Gray bars show distribution of leave-one-target-set-out cross-validation AUC values from 1000 permutations of the genetic data; dashed line shows the cross-validation AUC value for a model omitting genetic information entirely; red line shows the cross-validation AUC of the regression model with real genetic data. The P-values are derived from the frequency of permutation runs that exceed the AUC of the true model. Source data are provided in Supplementary Table 4