Literature DB >> 30346593

Rationalizing Secondary Pharmacology Screening Using Human Genetic and Pharmacological Evidence.

Aimee M Deaton1, Fan Fan1, Wei Zhang1, Phuong A Nguyen1, Lucas D Ward1, Paul Nioi1.   

Abstract

Safety-related drug failures remain a major challenge for the pharmaceutical industry. One approach to ensuring drug safety involves assessing small molecule drug specificity by examining the ability of a drug candidate to interact with a panel of "off-target" proteins, referred to as secondary pharmacology screening. Information from human genetics and pharmacology can be used to select proteins associated with adverse effects for such screening. In an analysis of marketed drugs, we found a clear relationship between the genetic and pharmacological phenotypes of a drug's off-target proteins and the observed drug side effects. In addition to using this phenotypic information for the selection of secondary pharmacology screens, we also show that it can be used to help identify drug off-target protein interactions responsible for drug-related adverse events. We anticipate that this phenotype-driven approach to secondary pharmacology screening will help to reduce safety-related drug failures due to drug off-target protein interactions.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30346593      PMCID: PMC6358245          DOI: 10.1093/toxsci/kfy265

Source DB:  PubMed          Journal:  Toxicol Sci        ISSN: 1096-0929            Impact factor:   4.849


Unintended adverse side effects remain a major reason for clinical trial failure and postmarketing drug withdrawal (Hamon ; Kola and Landis, 2004; Roberts ). One approach used to improve drug safety is to ensure specificity of the drug candidate against the intended target or, at the very least, to eliminate off-target interactions likely to cause adverse events. To help ensure small molecule drug specificity, pharmaceutical companies often perform in vitro secondary pharmacology screening whereby a compound is assessed for its ability to bind to and/or modulate a variety of off-target proteins (Bowes ; Whitebread ). Currently, regulatory authorities do not mandate any specific secondary pharmacology screening, except for testing whether drugs inhibit the Kv11.1 or “hERG” potassium channel (encoded by KCNH2) (ICH, 2005), a known mechanism through which drugs can cause cardiac arrhythmias (Hamon ). Knowing what proteins to select for a counter screen is difficult and requires knowledge of a given protein’s function, and the consequences of it being inadvertently “drugged” in humans. In many ways this challenge is similar to that faced by drug developers when trying to determine what proteins to target to treat disease; understanding the outcome of drugging a certain target is difficult and often is not determined until the completion of lengthy clinical trials. Indeed, the FDA has highlighted the fact that proteins included on counter screens often lack a clear link to human safety (Papoian ) although when four pharmaceutical companies compared their secondary pharmacology screens there was some overlap in terms of the proteins included (Bowes ). To tackle these issues, there are two sources of data that provide insight into the function and consequence of drugging specific proteins in humans; pharmacology; and human genetics. Pharmacology provides knowledge of drug indications which reveal the phenotypic effects of functionally modulating a drug target, but this is limited by the small number of human proteins that have been successfully drugged (672 out of approximately 20 000) (Ponomarenko ). Genetic evidence can also be used to predict the effects of drugging a protein and is increasingly being used in the pharmaceutical industry to select therapeutic targets (Kamb ; Plenge ). Prior work has established genetics as a way of identifying drug targets both in large systematic analyses (Nelson ) and for individual drug targets such as PCSK9 (Abifadel ; Ference ; Hess ). Genetics could be used in a similar manner to select “off-target” proteins that we do not want drugs to modulate by helping us identify proteins whose modulation would have safety concerns. For example, loss-of-function variants in KCNH2 (encoding the hERG channel) cause long QT syndrome (Curran ) consistent with the fact that pharmacological inhibition of hERG can result in cardiac arrhythmia. Similarly, loss-of-function variants in CTSD (encoding cathepsin D) cause neuronal ceroid lipofuscinosis, a retinal disease, mirroring retinal phenotypes observed in animals administered drugs that inadvertently inhibited cathepsin D (Siintola ; Steinfeld ; Zuhl ). For these examples, the relationship between the genetics of the off-target protein and drug side effects is well established. In this study, we examined whether genetics, as well as pharmacology, can be used more broadly to help predict drug side effects resulting from drug off-target protein interactions. To do this, we systematically compared the off-target drug-protein interactions and the observed side effects of a set of marketed drugs. Layering in known genetic associations and drug indications for these off-target proteins allowed us demonstrate a clear relationship between these phenotypes and drug side effects. Our findings have implications for the selection of small molecule secondary pharmacology screens as well as for the retrospective identification of off-target proteins contributing to drug side effects.

MATERIALS AND METHODS

Data Download and Processing

Drug Side Effect and Target Information

The following pharmacology databases were downloaded: DrugBank (v5.0.6, r2017-04-01; http://www.drugbank.ca; last accessed April 10, 2017) (Knox ; Law ; Wishart , 2008), SIDER (v4.1, r2015-10-21; http://sideeffects.embl.de/; last accessed January 14, 2016) (Kuhn , 2016), OFFSIDES (r2012-03-14; http://tatonettilab.org/resources/tatonetti-stm.html; last accessed June 23, 2016) (Tatonetti ), and Pharmaprojects, Pharma Intelligence 2016 (d2016-11-22; https://pharmaintelligence.informa.com/; last accessed November 22, 2016). Side effects of marketed drugs were obtained from the SIDER database that lists side effects reported on drug labels (Kuhn , 2016). Side effect phenotypes were mapped to MedDRA terminology to allow systematic comparison between phenotype terms from different sources (see below). Side effects seen in patients treated with a placebo were removed. To get high-confidence side effects we only considered side effects also present in postmarketing physicians’ reports (within the FDA Adverse Event Reporting System, or FAERS) that were statistically significant, as determined from the OFFSIDES database (Tatonetti ); p values of drug-side effect associations were used to impose a 5% false discovery rate (Benjamini and Hochberg, 1995). Side effects belonging to the “General disorders and administration site conditions” MedDRA category were removed as these were likely to be common side effects associated with drug treatment generally rather than side effects due to specific off-target interactions. The indications of these drugs were obtained from Pharmaprojects. All proteins that interact with the set of drugs extracted from SIDER (including both intended targets and “off-targets”) were identified using Prous Institute Symmetry and Chemotargets CLARITY (http://www.chemotargets.com), which integrate carefully selected data on compound-target interactions from literature, patent applications, and both publically accessible and commercial databases (Excelra GOSTAR). Bioinfogate’s safety intelligence portal, OFF-X (http://www.targetsafety.info), was also used in the process. From this set of drug-protein interaction pairs, the therapeutic drug-target pairs were identified using Drugbank (Knox ) with the remainder of interactions classified as “off-target.” Interactions with a number of cytochrome P450 enzymes were excluded to rule out interactions related to drug metabolism. Altogether, we examined 618 drugs with side effect and target information in the enrichment analysis. For the logistic regression we limited ourselves to small molecule drugs using Drugbank as the source of modality information and analyzed 587 drugs.

Drug target phenotype information

The phenotypic associations of these drugs’ targets were obtained from both human genetics and pharmacology. To get genetic information, genes involved in Mendelian traits curated in Online Mendelian Inheritance in Man (OMIM) were obtained from the Human Phenotype Ontology (HPO) (r2017-04-13; http://human-phenotype-ontology.github.io; last accessed June 7, 2017) (Kohler ). For each OMIM syndrome, we considered only phenotype terms defined as frequently associated with the disease by HPO (“frequent frequencies”) giving 3159 genes associated with 4056 OMIM syndromes, each of which was described by a list of HPO phenotype terms. If the protein was the intended target of an approved drug, pharmacological phenotypes were assigned using the indication of that drug obtained from Pharmaprojects. Where there were different variants in the same gene with varying disease associations or drugs with different indications modulating the same protein we took the union of all phenotypes to make sure we considered all potential safety risks that could be caused by modulating a particular target.

Phenotype Mapping and Comparison

To allow comparison between side effect terms and phenotypes from pharmacology and genetics we mapped all phenotypes to MedDRA terminology using the Unified Medical Language System (UMLS) Metathesaurus (2015AB release), MetaMap natural language processing (NLP) tool (MetaMap2016, r2016-01), and the UMLS-Interface software (https://www.nlm.nih.gov/research/umls/; last accessed May 9, 2016) (McInnes ). All phenotype terms were mapped to the most specific MedDRA term within the UMLS which was in turn mapped to MedDRA Higher Level Group Terms (HLGT) and System Organ Class (SOC) terms. Similarity scores for all MedDRA terms were calculated using the UMLS-Similarity modules from the UMLS-Interface software. For the enrichment analysis we defined a similarity score between two terms of greater than or equal to 0.7 as a “phenotype match.”

Enrichment Analysis

We annotated phenotype matches (as defined above) between drug side effects and off-target phenotypes obtained from genetics, pharmacology, or both. To test for which phenotypes off-target genetics/pharmacology tended to reflect drug side effects we performed an enrichment analysis using 230 MedDRA HLGT terms (ie, all HLGT terms for which there were side effects in our dataset). First, we removed all side effect terms that mapped to the same SOC term as the drug indication to avoid side effects likely to represent exaggerated pharmacology (eg, drugs with cardiac indications were removed when we considered cardiac arrhythmia as a side effect). Second, we excluded drugs where the intended target had genetic or pharmacological evidence for the phenotype reasoning that such effects were more likely to be modulated by the intended target rather than an off-target interaction. We then recorded the number of drugs that had a matching side effect and off-target phenotype (ie, the drug side effect matched the phenotype of at least one of the drug’s off-targets), absence of the side effect and absence of a matching off-target phenotype, presence of side effect but absence of a matching off-target phenotype, and absence of a side effect but presence of an off-target phenotype. Drugs with phenotype matches with the intended target were removed from the analysis. We constructed 2 × 2 contingency tables containing the number of drugs fulfilling each of these criteria and calculated an odds ratio and p value for each HLGT term using a two-sided Fisher’s exact test (Agresti, 2002; Fisher, 1935) “fisher.test” in the R “stats” package (R version 3.4.2). Fisher’s exact test was chosen to be robust to small sample sizes in certain contingency tables (Kim, 2017; Ludbrook, 2008). For instances where there were zero values in the contingency table (ie, when no drugs matched the criteria) these were assigned a pseudocount of one to avoid infinite or zero odds ratio values. We corrected our significance threshold for multiple testing using the Bonferroni method which adjusts the p value based on the number of tests performed (Bland and Altman, 1995). In this instance, we examined 618 drugs over each of 230 phenotypes giving a total of 1.4 × 105 tests performed. We considered a p value of < 3.5 × 10−7 as significant, which is equivalent to an adjusted p value < .05.

Logistic Regression

To assess the correlation between off-target phenotypes (from genetics and pharmacology) and the side effect profile of a drug, we performed a multivariate logistic regression (using the “glm” function in the R “stats” package) (R version 3.4.2). Out of the 46 MedDRA HLGT phenotype terms significant in the enrichment analysis, 44 had a sufficient number of drugs with that side effect to build a model. The logistic regression model for each of these phenotypes used disease indication (21 MedDRA SOC or organ system level terms), whether the intended targets have genetic evidence matching that phenotype, and whether the off-targets have evidence for the phenotype as predictors of drug side effect. All predictors were encoded as binary variables.

Deep Neural Network Modeling of ADRA2B Activity

The R “deepnet” package version 2.0 (Warr, 2012) was used to generate a categorical deep neural network (DNN) model to predict whether a compound can bind to ADRA2B. This DNN model was trained using compounds derived from CHEMBL database (version 23, last accessed 2017-09-22) with known activities against ADRA2B (Bento ). Compounds marked as “not active” in CHEMBL were collected as “inactives” for training set. To avoid the potential discrepancy between different assays, only compounds with confirmed IC50 values (between 0.52 nM and 31.88 μM) in radioligand ADRA2B binding assays were used for the “active” set. A total of 824 structurally unique CHEMBL compounds, including 720 “inactives” and 104 “actives,” were used to build the model. The 824 CHEMBL compounds were randomly separated into a training set (75%, 641 compounds) and a testing set (25%, 183 compounds). The model was first generated based on the training set, and then validated using the testing set. The default neural net parameters and multiple compound properties, including FCFP_4 fingerprints, AlogP, molecular weight, number of fragment, molecular polar surface area, molecular solubility, number of H donors, number of H acceptors, number of aromatic rings, etc. were used for model training. The generated DNN model reached a high predictive accuracy of 98.5% (632 of 641) for the training set, with 91.6% sensitivity of (76 of 83) and 99.6% specificity (556 of 558). Cross-validation using the testing set resulted a comparable accuracy of 95.6% (175 of 183), with 76.2% sensitivity (16 of 21) and 98.1% specificity (159 of 162). After the DNN model was validated using the test set of compounds, it was applied to predict the activities of additional compounds not known to bind ADRA2B. From SIDER, 59 marketed drugs with seizure as a side effect whose intended targets did not include ADRA2B and that had no ADRA2B interaction documented in Symmetry were collected. Drugs containing mixtures of compounds were removed leaving 55 drugs. Out of the 55 drugs, 16 already had ADRA2B binding data in CHEMBL with 5 of these drugs showing binding to the receptor. For the 39 remaining drugs, their activity against ADRA2B was predicted using the DNN model.

Selection of Targets for Secondary Pharmacology Screen

Phenotypes were assigned to all human proteins using genetic and pharmacological evidence as described above. Using MedDRA terms, proteins with phenotypes affecting the cardiovascular, respiratory, and nervous systems were selected (SOC terms “Cardiac disorders,” “Vascular disorders,” “Respiratory, thoracic, and mediastinal disorders,” “Nervous system disorders”) giving 2542 human proteins. To construct a selectivity screen with immediate application in the pharmaceutical industry we identified which of these proteins currently have in vitro assays available from major suppliers (CEREP, Panlabs, DiscoveRx). We excluded DNA methyltransferases, histone methyltransferases and transcription factors (with the exception of nuclear receptors). To reduce redundancy on the panel representative members were selected. Protein families were defined using HUGO gene nomenclature committee gene family assignation. Representative proteins from families were selected by aligning all members of a family against each other using Clustal Omega (Goujon ; Larkin ). For each family member, the percent identity against all other family members was summed, and the family member with the highest value was chosen as the most representative. When strong evidence for association with a concerning cardiovascular, respiratory, or nervous system phenotype was present, this “representative” protein was chosen for inclusion in the selectivity panel. If phenotypic evidence was less convincing the next most “representative” family member was chosen.

RESULTS

The Genetic and Pharmacological Phenotypes of a Drug’s Off-Targets Match Its Side Effects

We performed a retrospective analysis of marketed compounds to assess whether the genetics and intended pharmacology of a drug’s off-target proteins correlate with its side effects. Drug side effects were obtained from the drug label (SIDER database) (Kuhn ) and these were intersected with side effects from postmarketing reports (OFFSIDES database) (Tatonetti ) to get high-confidence side effects for each compound. To exclude the possibility that some side effects may represent exaggerated pharmacology rather than being the result of off-target interactions, we removed side effects in the same organ system as the drug indication (eg, cardiac arrhythmia side effects were excluded for any drug treating a cardiac disorder). We annotated both the drugs’ intended targets and off-targets using various databases as described in the methods. Altogether 618 individual drugs with side effect and target information were analyzed. We then examined phenotypic associations for each “off-target” protein with which these drugs interact. Phenotypes were assigned from genetics (Mendelian diseases caused by variants in the gene encoding the protein) and pharmacology (disease indications of approved drugs targeting that protein) or both and mapped to MedDRA terminology. To test for correspondence between these genetic and pharmacological phenotypes and drug side effects, we performed an enrichment test, excluding drugs where the intended target had genetic or pharmacological evidence for involvement in the side effect phenotype. For 230 MedDRA terms, we recorded the number of drugs that had a matching side effect and off-target phenotype (ie, the drug side effect matched the phenotype of at least one of the drug’s off-targets), absence of the side effect and absence of a matching off-target phenotype, presence of side effect but absence of a matching off-target phenotype, and absence of a side effect but presence of an off-target phenotype. We constructed 2 × 2 contingency tables containing the number of drugs fulfilling each of these criteria and calculated an odds ratio and p value for each phenotype term using Fisher’s exact test. When considering genetic and pharmacological phenotypes combined we found that for 46 side effect categories, drugs with off-target phenotypes predicted by these data were more likely have that particular side effect. Drugs associated with blood platelet disorders (p = 1.44 × 10−40, OR ≈ 354) and seizure (p = 5.38 × 10−39, OR ≈ 541.61) side effects showed the most significant enrichment followed by drugs with side effects affecting vision and glucose metabolism (Table 1). Other phenotypes of high safety concern where drugs with off-target evidence were overrepresented were movement disorders, heart failure, vascular hemorrhage, and cardiac arrhythmia (Table 1 and Supplementary Table 1). To further explore the biology driving these enrichment results, for each significant phenotype we identified the most frequently hit off-target protein with genetic and/or pharmacological support (Table 1).
Table 1.

Phenotypes Most Significantly Enriched for Matches Between Side Effect and Off-Target Genetics/Pharmacology

Phenotype Term (MedDRA)p ValueORMost Frequently Hit Off-Target
Blood platelet disorders1.45 × 10−40353.97TBXAS1
Seizures1.82 × 10−37541.61ADRA2B
Vision disorders1.24 × 10−3480.94KCNJ13
Glucose metabolism disorders1.81 × 10−2566.24KCNJ11
Movement disorders1.10 × 10−2358.98KCNA1, KCNC1, KCNA2, KCND3, KCNJ10, ADRA2B
Heart failures3.29 × 10−23218.55KCNJ5, KCNJ8
Nonhemolytic anemias3.05 × 10−22125.69TBXAS1
Cardiac and vascular investigations6.91 × 10−2211.80ADRA1B, ADRA1A, ADRA1D
Skin appendage conditions9.96 × 10−2251.19KCNJ8
Bone disorders9.42 × 10−21100.83TBXAS1
Disease coronary artery1.83 × 10−20323.05ADRA1A
Vascular hemorrhagic disorders2.37 × 10−2040.39TBXAS1
Acid-base disorders3.53 × 10−2082.54KCNJ1, KCNJ10
Cardiac arrhythmia NOS3.10 × 10−199.44KCNQ1, KCNJ2, KCNH2

A p value < 3.5 × 10−7 from Fisher’s exact test was considered significant based on Bonferroni correction for the 1.4 × 10−5 tests performed. This is equivalent to an adjusted p value < .05.

Phenotypes Most Significantly Enriched for Matches Between Side Effect and Off-Target Genetics/Pharmacology A p value < 3.5 × 10−7 from Fisher’s exact test was considered significant based on Bonferroni correction for the 1.4 × 10−5 tests performed. This is equivalent to an adjusted p value < .05. We then repeated our enrichment analysis using genetic and pharmacological sources of phenotypes separately. When only genetic associations were used, results were similar to those from the combined analysis, suggesting that genetic data drove most of these predictions (Supplementary Table 2). This might reflect the fact that fewer off-targets have been drugged than have genetic evidence; 315 off-target proteins in the analysis had genetic information whereas 202 had pharmacological information (of these, 94 proteins had both genetics and pharmacology). When considering only the pharmacological phenotypes of drug off-targets, 14 out of 230 phenotypes were significant and drugs associated with heart failure, coronary artery disease and seizure showed the most significant enrichment (Supplementary Table 3). These results show that the phenotypes of a drug’s off-target proteins inferred from genetics or pharmacology can be used to help predict its side effects and that combining both sources of phenotypic evidence is likely to be the most powerful approach to identify the most problematic proteins that a drug could unintentionally modulate.

Regression Modeling Shows That the Correlation Between Off-Target Phenotypes and Drug Side Effects Persists When Controlling for Confounders

Whereas a causal relationship between the phenotypes of a drug’s off-target proteins and its side effects is mechanistically plausible, there are potential confounding factors that could contribute to the enrichment we observe, including side effects that correlate with the disease being treated. In the enrichment analysis we removed side effects where there was a likely contribution from drug indication or the genetics of the drug’s intended target but may not have controlled adequately for all confounders. To address this, we tested the contribution of drug indication, on-target genetics and off-target phenotypes (from genetics and pharmacology) to the side effects of 587 small molecule drugs in a logistic regression model. We tested the 46 phenotypes that were significant in the enrichment analysis and, out of 44 with enough data to run the model, we found that off-target phenotypes were a significant predictor of drug side effects for 7 phenotypes tested (p < .05). Three of these phenotypes passed a more conservative threshold adjusted for the number of side effect classes tested (p < .001) (Figure 1 and Supplementary Table 4). The vast majority of side effect phenotypes tested showed a positive association between off-target phenotypes and drug side effects (38/44). The side effect that had the most significant contribution from off-target genetics was cardiac arrhythmia (p = 5 × 10−6, OR = 2.45) (Figure 1). Thus, the contribution of off-target phenotypes to drug side effects persists even when controlling for well-established contributors to drug side effects.
Figure 1.

Results for off-target phenotypes in logistic regression models. Showing all phenotypes where off-target phenotypes had p < .05. Circular points indicate odds ratios from the models; error bars are 95% confidence intervals. ***p < .001, **p < .01, *p < .05.

Results for off-target phenotypes in logistic regression models. Showing all phenotypes where off-target phenotypes had p < .05. Circular points indicate odds ratios from the models; error bars are 95% confidence intervals. ***p < .001, **p < .01, *p < .05.

Genetics and Pharmacology Can Help to Retrospectively Identify Off-Target Proteins Contributing to Drug Side Effects

Secondary pharmacology screening is ideally performed as part of the process of selecting which drug should continue into clinical trials. However, in reality, only a subset of human proteins can practically be screened against during development. This means that there is a risk that a drug could have off-target activity that is not detected until human exposure to the molecule. When such instances arise, it is critical to be able to determine which off-target the drug is interacting with so that a back-up molecule without these effects can be advanced into trials. We wanted to test whether our approach of applying phenotypic associations from human genetics and pharmacology could be useful for “issue resolution” in such instances. In our analysis of marketed compounds, 29% of drugs causing seizure had off-target interactions with the alpha-2B adrenergic receptor (ADRA2B) in the dataset used. Recent reports have implicated gain of function mutations in ADRA2B in familial adult myoclonic epilepsy (De Fusco ) (OMIM:607876). We asked whether 55 drugs that have seizure as a side effect but do not have ADRA2B binding data in our off-target protein dataset might have interactions with ADRA2B that could contribute seizure. To assess this, we extracted ADRA2B binding data from ChEMBL (Bento ) and found that five of the 55 drugs with seizure as a side effect had known ADRA2B interactions (Table 2). Because most of the drugs (39/55) did not have ADRA2B binding data in ChEMBL, we generated a ligand-based DNN model to predict whether any of these 39 drugs might also bind ADRA2B. This DNN model had a high prediction accuracy of 95.6% as demonstrated by 2-fold cross validation (see Materials and Methods section).
Table 2.

Drugs With Seizure As A Side Effect: Drugs With Seizure As A Side Effect That Have Known ADRA2B Binding From ChEMBL (“Known Active”) or Predicted Binding From the ADRA2B DNN Model (“Predicted Active”, Score from Model Is Shown)

DrugActivity Against ADRA2BADRA2B Activity ScoreIntended TargetDrug Indication
ChlorpromazineKnown active1 (known)DRD2, DRD1, HTR2A, HTR1A, ADRA1A, ADRA1B, HRH1Schizophrenia, nausea, vomiting, agitation
CyproheptadineKnown active1 (known)HRH1, HTR2A, HTR2CAllergic rhinitis, angioedema, urticaria, anaphylaxis
MaprotilineKnown active1 (known)SLC6A2Depression
QuetiapineKnown active1 (known)HTR2A, DRD2Bipolar disorder, schizophrenia, depression
DesloratadineKnown active1 (known)HRH1Rhinitis, urticaria, eczema, pruritus
ClozapinePredicted active0.98DRD2, HTR2ASchizophrenia
HydroxyzinePredicted active0.88HRH1Anxiety, pruritus, allergy
DuloxetinePredicted active0.71SLC6A4, SLC6A2Depression, pain, anxiety disorder
EletriptanPredicted active0.74HTR1D, HTR1B, HTR1FMigraine
AprepitantPredicted active0.96TACR1Nausea, vomiting

The intended target of these drugs (from Drugbank) along with the drug indication (from Drugbank/Pharmaprojects) is shown.

Drugs With Seizure As A Side Effect: Drugs With Seizure As A Side Effect That Have Known ADRA2B Binding From ChEMBL (“Known Active”) or Predicted Binding From the ADRA2B DNN Model (“Predicted Active”, Score from Model Is Shown) The intended target of these drugs (from Drugbank) along with the drug indication (from Drugbank/Pharmaprojects) is shown. The DNN model was then applied to predict ADRA2B binding for the 39 drugs for which seizure was documented as a side effect, but without known binding to ADRA2B. The predictions indicated that five out of these 39 drugs are likely to bind ADRA2B. Interestingly, eight of the drugs with confirmed or predicted ADRA2B binding were antipsychotics or anti-histamines and six out of these 10 drugs had structurally similar tricyclic ring scaffolds (Table 2). Notably, one of the drugs identified as an ADRA2B interactor, clozapine, is known for its risk of seizure although little was previously known about the underlying mechanism (Varma ; Williams and Park, 2015). The confirmation that 10 out of 55 drugs with seizure as a side effect have known or predicted off-target activity against ADRA2B demonstrates that human genetics can be used more broadly to prioritize proteins that may be responsible for a given side effect.

Systemic Selection of Proteins for Secondary Pharmacology Screening Using Genetics and Pharmacology

In addition to the use of in vitro assays to explain adverse events related to drug treatment in a preclinical or clinical setting, secondary pharmacology screens are used routinely during drug development to improve specificity and help select the best clinical candidates. We used phenotypes from genetics and pharmacology to systematically select proteins to include on such a screen (Figure 2A). Proteins associated with phenotypes affecting the cardiovascular, respiratory and central nervous systems were prioritized for inclusion, as disrupting the function of such vital organ systems has the potential to result in serious life-threatening conditions. This is in line with international regulatory guidance emphasizing that these organ systems are the most critical for safety pharmacology (ICH, 2001). To design a panel that could immediately be used by the pharmaceutical industry we identified which of those proteins have commercially available in vitro assays. Furthermore, to reduce redundancy on the panel, representative family members were selected in cases where more than three related proteins were under consideration (see Materials and Methods section). This resulted in a secondary pharmacology screen consisting of 70 targets from diverse protein classes (Figure 2B) associated with cardiovascular, respiratory and nervous system phenotypes (Figure 2C, Table 3 and Supplementary Table 5).
Figure 2.

Selection and characteristics of phenotype-focused secondary pharmacology screen. A, Selection of targets to include on secondary pharmacology screen, using phenotypes from human genetics and drug indications. B, Protein class distribution. C, Distribution of key safety phenotypes for the 70 proteins included on our proposed secondary pharmacology panel.

Table 3.

Secondary Pharmacology Screen for Use in Small Molecule Drug Development: Selected Proteins Out of 70 Proteins on the Screen and Their Associated Phenotypes

ProteinGenePhenotype PharmExample DrugsPhenotype GeneticsGenetics ReferenceCategory
Potassium channel subunit SUR2ABCC9AnginaNicorandilAtrial fibrillationOMIM:614050CV
Adenosine receptor A2AADORA2AAsthma and bronchodilationTheophylline, theobromineNoneNoneRespiratory
Adrenergic receptor A2BADRA2BHypertensionClonidine, prazosinEpilepsyOMIM:607876CV, nervous
Na+/K+-ATPase 1A1ATP1A1Heart failureDigoxinNoneNoneCV
Cathepsin FCTSFNoneNoneCeroid lipofuscinosisOMIM:615362Nervous
Dopamine receptor 2DRD2Parksinson's diseaseCabergolineDystonia, myoclonicOMIM:159900Nervous
Coagulation factor II receptorF2RThrombosisVorapaxarNoneNoneCV
Gamma-aminobutyric acid type A receptor alpha1 subunitGABRA1EpilepsyEstazolam, topiramateEpilepsyOMIM:615744Nervous
Glutamate metabotropic receptor 1GRM1NoneNoneSpinocerebellar ataxiaOMIM:614831Nervous
Hyperpolarization-activated cyclic nucleotide gated potassium channel 4HCN4NoneNoneSick sinus syndrome, Brugada syndromeOMIM:163800CV
OMIM:613123
Purinergic receptor P2RY12P2RY12ThrombosisClopidogrel, ticlopidinePlatelet bleeding disorderOMIM:609821CV
Nav1.5 channelSCN5ACardiac arrhythmiasDisopyramide, flecainideBrugada syndrome, ventricular fibrillation, long QT syndrome, sick sinus syndrome, atrial fibrillationOMIM:601144CV
OMIM:601154
OMIM:603829
OMIM:603830
OMIM:608567
OMIM:614022
High-affinity choline transporterSLC5A7NoneNoneNeuronopathyOMIM:158580Nervous
Thromboxane A synthase 1TBXAS1NoneNoneGhosal hematodiaphyseal syndromeOMIM:231095CV
Transient receptor potential cation channel subfamily M member 4TRPM4NoneNoneProgressive familial heart blockOMIM:604559CV
Secondary Pharmacology Screen for Use in Small Molecule Drug Development: Selected Proteins Out of 70 Proteins on the Screen and Their Associated Phenotypes Selection and characteristics of phenotype-focused secondary pharmacology screen. A, Selection of targets to include on secondary pharmacology screen, using phenotypes from human genetics and drug indications. B, Protein class distribution. C, Distribution of key safety phenotypes for the 70 proteins included on our proposed secondary pharmacology panel. A number of proteins on our panel have phenotypic associations that come solely from human genetics including the hyperpolarization-activated ion channel HCN4 and transient receptor potential cation channel TRPM4, which are included based on links between mutations in the genes encoding these proteins and cardiac conduction defects (Kruse ; Liu ; Milanesi ; Milano ; Nof ; Schulze-Bahr ; Schweizer ; Stallmeyer ; Ueda ). Some proteins on the screen have evidence coming from both pharmacology and genetics. One such example is the purinergic receptor P2RY12 which is a target of the anti-clotting drugs clopidogrel and ticlopidine (Boeynaems ; Dorsam ; Herbert and Savi, 2003). Consistent with a biological role in blood clotting, loss-of-function mutations in the P2RY12 gene cause a platelet-type bleeding disorder (Cattaneo ; Nurden ). By systematically choosing proteins with known phenotypic associations for screening, we can eliminate interactions between drug candidates and the most concerning off-target proteins. Furthermore, if certain drug off-target protein interactions cannot be avoided, knowing the potential consequences of such interactions allows us to monitor for them in preclinical and clinical studies.

DISCUSSION

We show that drug side effect can be predicted, in part, from the genetics and pharmacology of its “off-target” protein interactions. We examined the contribution made by the phenotypes of a drug’s off-target proteins to its side effects by examining a set of marketed drugs using two different methods—enrichment analysis and logistic regression. The enrichment analysis showed that for 46 side effect phenotypes, including many of high safety concern such as seizure and platelet disorders, drugs with off-target evidence for the same phenotype were more likely to have that side effect. We then attempted to more comprehensively model the effect of off-target phenotype, drug indication, and on-target genetics on drug side effects using logistic regression. For seven side effect phenotypes we found that off-target genetics and/or pharmacology were a significant predictor of drug side effects, with cardiac arrhythmia having the most significant effect. The reasons why the phenotypes significant in the enrichment analysis are not all significant in the logistic regression may be due to an inability to separate the contribution of drug off-target protein interactions from the contribution of other predictors such as drug indication or due to limited power of the regression models due to the relatively small number of drugs examined. Nonetheless these results, taken together with the enrichment analysis, suggest that using phenotypes from genetics and pharmacology can help to identify drug off-target protein interactions contributing to side effects. A challenge with this type of analysis is having a comprehensive set of “off-target” interactions available for each drug. Here we used a curated database of in vitro drug-target interactions, a limitation of which is a lack of negative experimental results (ie, a set of targets with which a drug does not interact). Furthermore, available in vitro data may be skewed towards highly studied proteins. In the future, as such data sources expand and methods of in silico target prediction improve, more comprehensive information on drug off-target interactions may strengthen such analyses. In addition, if a larger set of well-curated drug side effect information were available this would increase statistical power and perhaps allow us detect effects for additional phenotypes. For some phenotypes certain off-targets predominate, for example drugs causing platelet side effect frequently modulated thromboxane synthase (TBXAS1) and drugs with seizure side effects often interacted with ADRA2B as well as a number of potassium channels. This could be due to a number of factors. One of these may be the propensity of small molecules to interact with these proteins due to structural features or other protein properties. In the case of TBXAS1, although its structure remains undermined, a detailed fluorescence spectroscopy study demonstrated that the hydrophobic active site of the enzyme is large so that TBXAS1 could accommodate multiple ligands simultaneously. The ligand binding is further facilitated by a cluster of phenylalanine residues near the ligand binding pocket, which increases the chance of unspecific entropic binding (Chao ). In the case of ADRA2B and many other GPCR targets, their flexible and half buried binding pockets tend to attract compounds with varied chemical structures containing hydrophobic fragment(s) (Maudsley ). Another explanation is that proteins similar to a drug’s intended target are more likely to have unintended interactions with that molecule. As approximately 34% of marketed drugs modulate GPCRs to exert their therapeutic effects (Hauser ; Rask-Andersen ; Santos ) this may partially explain the propensity of ADRA2B to have off-target interactions with some of these molecules. Such frequently occurring proteins from our analysis could be prioritized for off-target screening when investigating drug-protein interactions that are responsible for particular side effects. During the drug development process secondary pharmacology screening is used for two main applications and prioritizing proteins using genetic and pharmacological evidence can be useful for both. First, assessment of drug off-target protein interactions can be used to explain which proteins may be mediating particular adverse events seen in preclinical or clinical studies. Mimicking this situation, we examined whether drugs causing seizure interacted with alpha-2B adrenergic receptor (ADRA2B), a protein with genetic evidence for involvement in epilepsy. We found that 10 out of 55 drugs examined had known or predicted binding to ADRA2B. This may indicate that, for these drugs, off-target interactions with ADRA2B could contribute to seizure. For the drugs predicted to be inactive against ADRA2B, their observed seizure side effect may result from activities against other off-target proteins. Indeed, 26% of drugs with seizure side effects analyzed interacted with the potassium channels KCNQ2, KCNT1, and/or KCNA2 all of which have genetic evidence for involvement in epilepsy and seizure (Barcia ; Singh ; Syrbe ). Our enrichment analysis does not distinguish between whether a single off-target is the likely culprit for causing a side effect or if, in certain situations, interaction between a drug and multiple off-target proteins is needed to elicit a particular side effect. This would be an interesting avenue for further study. Second, a routine part of lead optimization is screening drugs against a secondary pharmacology panel to ensure specificity against the proteins most likely to cause serious safety concerns. Achieving complete selectivity for the intended drug target is challenging for small molecule drugs so it is important to avoid unintended interactions with the proteins most likely to have serious effects in humans. Using genetic and pharmacological evidence we designed a panel consisting of proteins associated with cardiovascular, respiratory and nervous system phenotypes, consistent with regulatory guidance. A key consideration in drug safety is the dose at which adverse events occur. The potency of the drug candidate against off-target proteins can be determined by titration of the compound in the secondary pharmacology assay and this information can be incorporated into the model of exposure used preclinically or clinically. During in vivo studies, positive hits on the in vitro secondary pharmacology screen could be followed up in a number of ways: (1) Appropriate monitoring for the phenotypes expected from off-target hits can be added to preclinical or clinical studies. For example, for hits against Nav1.5/SCN5A we would monitor for cardiac arrhythmia as genetic and pharmacological evidence shows strong associations between Nav1.5 and arrhythmia (see Table 3 and [Chen ; Wang ]). (2) We can also integrate biomarkers or assays establishing off-target activity into preclinical or clinical studies. For example for inhibitors of the purinergic receptor P2RY12, ADP-induced platelet aggregation assays are commonly used to assess efficacy (Storey ), the same assays could be used to assess whether off-target activity against P2RY12 is detectable in vivo. As data are generated using this new secondary pharmacology screen of 70 proteins, the relationship between off-target in vitro activity and in vivo drug side effects will become more clear, and the screen can be further refined. Our secondary pharmacology screen is limited based on available assays. Given the changing nature of pharmaceutical companies’ portfolios, a broader range of potential off-target proteins may need to be considered for future screening. Proteins for which no assays are currently available could be prioritized for assay development based on genetic evidence linking them to adverse phenotypes in humans. In some cases, the phenotypes resulting from genetic perturbation and drug indications gleaned from pharmacology may be different. For example, ADRA2B is a target of the drug Clonidine used to treat hypertension but gain-of-function mutations in ADRA2B are associated with epilepsy (De Fusco ). This may be due to a number of factors. Notably, genetic information usually results from perturbing a single gene whereas drugs may have multiple intended targets that contribute to their therapeutic phenotypes as is the case for Clonidine. Other differences between genetic and pharmacological phenotypes could be due to differences in effects on protein function (eg, pharmacological agonism but genetic loss-of-function). There may also be differences in the phenotypes manifesting as a result of the lifelong perturbation caused by genetic variants and those caused by acute pharmacological modulation. Nonetheless, our enrichment analysis results support using both genetic and pharmacological to get a comprehensive view of the potential consequences of drugging a particular protein. An additional consideration for assessing the impact of drug off-target interactions is the effect of genetic polymorphisms in the gene encoding the off-target protein. Individuals harboring variants in the genes encoding drug off-target proteins may be more susceptible to side effects caused by such drug off-target interactions. One example of this is people with nonpathogenic variants in genes encoding ion channels such as hERG/KCNH2 and KCNE1 who are more susceptible to drug-induced arrhythmias caused by off-target interactions with these channels (Kannankeril ; Paulussen ). Beyond the applications examined here, which focus on small molecule therapeutics, there are a number of other uses for this kind of phenotype-centered approach in off-target screening. For example, when developing antibody therapeutics although specificity is not as big an issue as it is for small molecules, a counter screen against a protein related to the drug target is usually employed. Genetics could be used in this situation to select the target family member associated with the most concerning phenotype for counter screening. We anticipate that integrating phenotypic information from genetics and pharmacology into secondary pharmacology screening will be relevant to current and future drug development programs and will help to reduce safety-related failures and drug side effects.

SUPPLEMENTARY DATA

Supplementary data are available at Toxicological Sciences online. Click here for additional data file.
  64 in total

1.  Mutational spectrum in the Ca(2+)--activated cation channel gene TRPM4 in patients with cardiac conductance disturbances.

Authors:  Birgit Stallmeyer; Sven Zumhagen; Isabelle Denjoy; Guillaume Duthoit; Jean-Louis Hébert; Xavier Ferrer; Svetlana Maugenre; Wilhelm Schmitz; Uwe Kirchhefer; Ellen Schulze-Bahr; Pascale Guicheney; Eric Schulze-Bahr
Journal:  Hum Mutat       Date:  2011-10-20       Impact factor: 4.878

2.  Cathepsin D deficiency underlies congenital human neuronal ceroid-lipofuscinosis.

Authors:  Eija Siintola; Sanna Partanen; Petter Strömme; Aleksi Haapanen; Matti Haltia; Jan Maehlen; Anna-Elina Lehesjoki; Jaana Tyynelä
Journal:  Brain       Date:  2006-05-02       Impact factor: 13.501

3.  Probing ligand binding to thromboxane synthase.

Authors:  Wei-Chih Chao; Jyh-Feng Lu; Jinn-Shyan Wang; Hsiao-Ching Yang; Tai-An Pan; Steven Chun-Wei Chou; Lee-Ho Wang; Pi-Tai Chou
Journal:  Biochemistry       Date:  2013-01-29       Impact factor: 3.162

Review 4.  Validating therapeutic targets through human genetics.

Authors:  Robert M Plenge; Edward M Scolnick; David Altshuler
Journal:  Nat Rev Drug Discov       Date:  2013-07-19       Impact factor: 84.694

Review 5.  Reducing attrition in drug development: smart loading preclinical safety assessment.

Authors:  Ruth A Roberts; Stefan L Kavanagh; Howard R Mellor; Christopher E Pollard; Sally Robinson; Stefan J Platz
Journal:  Drug Discov Today       Date:  2013-11-21       Impact factor: 7.851

6.  De novo loss- or gain-of-function mutations in KCNA2 cause epileptic encephalopathy.

Authors:  Steffen Syrbe; Ulrike B S Hedrich; Erik Riesch; Tania Djémié; Stephan Müller; Rikke S Møller; Bridget Maher; Laura Hernandez-Hernandez; Matthis Synofzik; Hande S Caglayan; Mutluay Arslan; José M Serratosa; Michael Nothnagel; Patrick May; Roland Krause; Heidrun Löffler; Katja Detert; Thomas Dorn; Heinrich Vogt; Günter Krämer; Ludger Schöls; Primus E Mullis; Tarja Linnankivi; Anna-Elina Lehesjoki; Katalin Sterbova; Dana C Craiu; Dorota Hoffman-Zacharska; Christian M Korff; Yvonne G Weber; Maja Steinlin; Sabina Gallati; Astrid Bertsche; Matthias K Bernhard; Andreas Merkenschlager; Wieland Kiess; Michael Gonzalez; Stephan Züchner; Aarno Palotie; Arvid Suls; Peter De Jonghe; Ingo Helbig; Saskia Biskup; Markus Wolff; Snezana Maljevic; Rebecca Schüle; Sanjay M Sisodiya; Sarah Weckhuysen; Holger Lerche; Johannes R Lemke
Journal:  Nat Genet       Date:  2015-03-09       Impact factor: 38.330

7.  The SIDER database of drugs and side effects.

Authors:  Michael Kuhn; Ivica Letunic; Lars Juhl Jensen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2015-10-19       Impact factor: 16.971

8.  DrugBank: a knowledgebase for drugs, drug actions and drug targets.

Authors:  David S Wishart; Craig Knox; An Chi Guo; Dean Cheng; Savita Shrivastava; Dan Tzur; Bijaya Gautam; Murtaza Hassanali
Journal:  Nucleic Acids Res       Date:  2007-11-29       Impact factor: 16.971

Review 9.  The Size of the Human Proteome: The Width and Depth.

Authors:  Elena A Ponomarenko; Ekaterina V Poverennaya; Ekaterina V Ilgisonis; Mikhail A Pyatnitskiy; Arthur T Kopylov; Victor G Zgoda; Andrey V Lisitsa; Alexander I Archakov
Journal:  Int J Anal Chem       Date:  2016-05-19       Impact factor: 1.885

10.  Reduction of low density lipoprotein-cholesterol and cardiovascular events with proprotein convertase subtilisin-kexin type 9 (PCSK9) inhibitors and statins: an analysis of FOURIER, SPIRE, and the Cholesterol Treatment Trialists Collaboration.

Authors:  Brian A Ference; Christopher P Cannon; Ulf Landmesser; Thomas F Lüscher; Alberico L Catapano; Kausik K Ray
Journal:  Eur Heart J       Date:  2018-07-14       Impact factor: 29.983

View more
  7 in total

1.  In silico approaches in organ toxicity hazard assessment: current status and future needs in predicting liver toxicity.

Authors:  Arianna Bassan; Vinicius M Alves; Alexander Amberg; Lennart T Anger; Scott Auerbach; Lisa Beilke; Andreas Bender; Mark T D Cronin; Kevin P Cross; Jui-Hua Hsieh; Nigel Greene; Raymond Kemper; Marlene T Kim; Moiz Mumtaz; Tobias Noeske; Manuela Pavan; Julia Pletz; Daniel P Russo; Yogesh Sabnis; Markus Schaefer; David T Szabo; Jean-Pierre Valentin; Joerg Wichard; Dominic Williams; David Woolley; Craig Zwickl; Glenn J Myatt
Journal:  Comput Toxicol       Date:  2021-09-09

2.  Current status and future directions for a neurotoxicity hazard assessment framework that integrates in silico approaches.

Authors:  Kevin M Crofton; Arianna Bassan; Mamta Behl; Yaroslav G Chushak; Ellen Fritsche; Jeffery M Gearhart; Mary Sue Marty; Moiz Mumtaz; Manuela Pavan; Patricia Ruiz; Magdalini Sachana; Rajamani Selvam; Timothy J Shafer; Lidiya Stavitskaya; David T Szabo; Steven T Szabo; Raymond R Tice; Dan Wilson; David Woolley; Glenn J Myatt
Journal:  Comput Toxicol       Date:  2022-03-17

3.  In silico approaches in organ toxicity hazard assessment: Current status and future needs for predicting heart, kidney and lung toxicities.

Authors:  Arianna Bassan; Vinicius M Alves; Alexander Amberg; Lennart T Anger; Lisa Beilke; Andreas Bender; Autumn Bernal; Mark T D Cronin; Jui-Hua Hsieh; Candice Johnson; Raymond Kemper; Moiz Mumtaz; Louise Neilson; Manuela Pavan; Amy Pointon; Julia Pletz; Patricia Ruiz; Daniel P Russo; Yogesh Sabnis; Reena Sandhu; Markus Schaefer; Lidiya Stavitskaya; David T Szabo; Jean-Pierre Valentin; David Woolley; Craig Zwickl; Glenn J Myatt
Journal:  Comput Toxicol       Date:  2021-09-13

Review 4.  Using human genetics to improve safety assessment of therapeutics.

Authors:  Keren J Carss; Aimee M Deaton; Alberto Del Rio-Espinola; Dorothée Diogo; Mark Fielden; Diptee A Kulkarni; Jonathan Moggs; Peter Newham; Matthew R Nelson; Frank D Sistare; Lucas D Ward; Jing Yuan
Journal:  Nat Rev Drug Discov       Date:  2022-10-19       Impact factor: 112.288

5.  Exploration of the DARTable Genome- a Resource Enabling Data-Driven NAMs for Developmental and Reproductive Toxicity Prediction.

Authors:  Elzbieta I Janowska-Sejda; Yeyejide Adeleye; Richard A Currie
Journal:  Front Toxicol       Date:  2022-01-19

6.  A proteomic platform to identify off-target proteins associated with therapeutic modalities that induce protein degradation or gene silencing.

Authors:  Xin Liu; Ye Zhang; Lucas D Ward; Qinghong Yan; Tanggis Bohnuud; Rocio Hernandez; Socheata Lao; Jing Yuan; Fan Fan
Journal:  Sci Rep       Date:  2021-08-04       Impact factor: 4.379

7.  GWAS of serum ALT and AST reveals an association of SLC30A10 Thr95Ile with hypermanganesemia symptoms.

Authors:  Lucas D Ward; Ho-Chou Tu; Chelsea B Quenneville; Shira Tsour; Alexander O Flynn-Carroll; Margaret M Parker; Aimee M Deaton; Patrick A J Haslett; Luca A Lotta; Niek Verweij; Manuel A R Ferreira; Aris Baras; Gregory Hinkle; Paul Nioi
Journal:  Nat Commun       Date:  2021-07-27       Impact factor: 14.919

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.