Literature DB >> 27610414

Modeling Reactivity to Biological Macromolecules with a Deep Multitask Network.

Tyler B Hughes¹, Na Le Dang¹, Grover P Miller², S Joshua Swamidass¹.

Abstract

Most small-molecule drug candidates fail before entering the market, frequently because of unexpected toxicity. Often, toxicity is detected only late in drug development, because many types of toxicities, especially idiosyncratic adverse drug reactions (IADRs), are particularly hard to predict and detect. Moreover, drug-induced liver injury (DILI) is the most frequent reason drugs are withdrawn from the market and causes 50% of acute liver failure cases in the United States. A common mechanism often underlies many types of drug toxicities, including both DILI and IADRs. Drugs are bioactivated by drug-metabolizing enzymes into reactive metabolites, which then conjugate to sites in proteins or DNA to form adducts. DNA adducts are often mutagenic and may alter the reading and copying of genes and their regulatory elements, causing gene dysregulation and even triggering cancer. Similarly, protein adducts can disrupt their normal biological functions and induce harmful immune responses. Unfortunately, reactive metabolites are not reliably detected by experiments, and it is also expensive to test drug candidates for potential to form DNA or protein adducts during the early stages of drug development. In contrast, computational methods have the potential to quickly screen for covalent binding potential, thereby flagging problematic molecules and reducing the total number of necessary experiments. Here, we train a deep convolution neural network-the XenoSite reactivity model-using literature data to accurately predict both sites and probability of reactivity for molecules with glutathione, cyanide, protein, and DNA. On the site level, cross-validated predictions had area under the curve (AUC) performances of 89.8% for DNA and 94.4% for protein. Furthermore, the model separated molecules electrophilically reactive with DNA and protein from nonreactive molecules with cross-validated AUC performances of 78.7% and 79.8%, respectively. On both the site- and molecule-level, the model's performances significantly outperformed reactivity indices derived from quantum simulations that are reported in the literature. Moreover, we developed and applied a selectivity score to assess preferential reactions with the macromolecules as opposed to the common screening traps. For the entire data set of 2803 molecules, this approach yielded totals of 257 (9.2%) and 227 (8.1%) molecules predicted to be reactive only with DNA and protein, respectively, and hence those that would be missed by standard reactivity screening experiments. Site of reactivity data is an underutilized resource that can be used to not only predict if molecules are reactive, but also show where they might be modified to reduce toxicity while retaining efficacy. The XenoSite reactivity model is available at http://swami.wustl.edu/xenosite/p/reactivity.

Entities: Chemical Disease Gene Species

Year: 2016 PMID： 27610414 PMCID： PMC4999971 DOI： 10.1021/acscentsci.6b00162

Source DB: PubMed Journal: ACS Cent Sci ISSN： 2374-7943 Impact factor: 14.553

Introduction

Most small-molecule drug candidates fail before entering the market,[1] frequently because of unexpected toxicity.[1,2] Often, toxicity is detected only late in drug development, because many types of toxicities, especially idiosyncratic adverse drug reactions (IADRs), are particularly hard to predict and detect.[3,4] Moreover, drug-induced liver injury (DILI) is the most frequent reason drugs are withdrawn from the market and causes 50% of acute liver failure cases in the United States.[5] A common mechanism often underlies many types of drug toxicity, including both DILI and IADRs. Drugs are bioactivated by drug-metabolizing enzymes into reactive metabolites, which then conjugate to sites in proteins or DNA to form adducts. DNA adducts are often mutagenic and may alter the reading and copying of genes and their regulatory elements, causing gene dysregulation and even triggering cancer.[5−7] Similarly, protein adducts can disrupt their normal biological functions and induce harmful immune responses.[5,8,9] Metabolites are reactive because of their chemical properties and are often generally classified as soft or hard electrophiles based on polarizability and their preferential reaction with targeted nucleophiles.[10] Soft electrophiles such as epoxides or Michael acceptors have low electron density at multiple sites, while hard electrophiles such as carbocations or saturated aldehydes have a localized site with low electron density.[10−12] Soft electrophiles tend to react with soft nucleophiles, such as cysteine residues within glutathione (GSH) or protein, whereas hard electrophiles generally react with hard nucleophiles, such as purine and pyrimidine bases in DNA or lysine and histidine residues within protein.[5,10,11,13−15] Despite these general rules, it remains a challenge to predict the reactivity of small molecules and their likelihood for modifying DNA and proteins. Conjugation between small molecules and nucleophilic GSH (soft) or cyanide (hard) is commonly used in screening studies to identify molecules capable of forming adducts. Detecting GSH or cyanide adducts is easier than detecting protein or DNA adducts, and serves as a proxy for reactivity to macromolecules in experimental studies.[16,17] Moreover, GSH is a physiologically relevant trapping agent, which reaches millimolar levels in cells and protectively conjugates with many reactive molecules.[18] However, GSH and cyanide are only imperfect proxies for protein and DNA. Proteins and DNA are structurally complex macromolecules and likely have correspondingly complex reactivities with a diverse set of soft and hard nucleophilic sites. Therefore, we expect some molecules will react with protein or DNA, but not efficiently react with GSH or cyanide. These molecules are of special concern, because they do not react with GSH or cyanide, and consequently are likely to be missed by many standard reactivity experiments. Computational modeling could provide a complementary strategy for detecting molecules likely to be reactive, and therefore prone to causing DILI or IADRs, including those molecules missed by standard screening assays. Others have proposed QSAR models to predict GSH reactivity, yet these models are of limited value, as they are focused on very limited structural groups.[19] In contrast, we recently published a computational model that predicted GSH reactivity toward a diverse set of chemicals at both the site and molecule level.[20] Here, we extended our previously published GSH reactivity model to also predict reactivity with cyanide, protein, and DNA. First, we extracted a structurally diverse, literature-derived database of molecules known to bind DNA and protein, as well as the simple nucleophilic traps cyanide and GSH (Figure ). Second, we labeled the site of conjugation on each molecule, known as its site of reactivity (SOR). Third, we used a deep convolution neural network to accurately predict these SORs in cross-validated experiments. Fourth, we transformed SOR scores to accurate molecule-level electrophilic reactivity scores that accurately predict whether molecules will conjugate to DNA or protein. Fifth, we applied these molecule reactivity scores to calculate DNA and protein selectivity scores to estimate the fraction of molecules that are reactive to DNA and protein but not cyanide or GSH.

Figure 1

Examples of the four types of sites of electrophilic reactivity modeled. Predictions by the XenoSite reactivity model are indicated on the left with a colored shading gradient and white circle for each known site of reactivity. These predictions range from zero to one, indicating the probability that an atom is reactive with each of the four nucleophilic targets. From top to bottom, a cyanide conjugation of a nefazodone metabolite,[21] a DNA conjugation of N-acetylbenzidine,[22] a glutathione (GSH) conjugation of menadione,[23] and a protein conjugation of a butylated hydroxytoluene metabolite.[24] DNA and protein are inherently structurally diverse, and thus cartoonized macromolecules are depicted, with the rest of each macromolecule represented by “Xx”. Of course, ultimately, the XenoSite reactivity model will be connected to models of drug metabolism to be most useful. Although out of the scope of this study, combined metabolism and reactivity models would be able to predict both bioactivation and subsequent toxicity of metabolites. Nonetheless, this study takes a significant step toward effectively managing the IADR and DILI risk of new medications with computational modeling.

Results and Discussion

In the initial section, we summarize a systematic effort to optimize the structure and training of the model, with the goal of choosing the best method for predicting reactivity. The following sections then investigate the performance of the final optimized model. First, we evaluated the model’s cross-validated atom reactivity scores (ARS) by testing SOR classification performance within reactive molecules. Second, we compared ARS performance to that of atom-level quantum chemical reactivity indices. Third, we assessed ARS performance on an external test set. Fourth, we calculated the accuracy of the model’s cross-validated molecule reactivity scores (MRS) at predicting molecule reactivity. Fifth, we compared MRS performance to molecule-level quantum chemical reactivity indices. Sixth, we use the model to estimate the number of high throughput screening molecules that are reactive with macromolecules (DNA and protein) but are not flagged by small-molecule trapping agents (GSH and cyanide).

Model Optimization

Several experiments demonstrate how each of the innovations in our modeling approach improves performance. These experiments are briefly discussed here, and further details and data are available in Supporting Information. First, we hypothesized that jointly modeling several types of reactivity in a multitask learning model would improve predictions on the smaller data sets.[25,26] Indeed, the multitask model outperformed the individual modeling approach at predicting cyanide and protein SOR (Figure S2). This is likely because the cyanide and protein reactivity tasks are the most difficult and, therefore, benefit most from integrated modeling. The cyanide data set is difficult because it is small, and the protein data set is both small and includes the most diverse mechanisms. The data reported herein reflects modeling all four types of reactivity together in a multitask network, instead of building separate models for each task. Second, a modular input layer was used to group related descriptors (such as the identities of all atoms a certain depth away), rather than a traditional three-layer neural network structure. We hypothesized that building explicit chemical knowledge into the model structure could reduce the total number of parameters in the model, thereby creating a simpler model with better generalizability. This possibility was inspired by several examples of modular neural networks in the literature.[27−32] In fact, the modular structure did enable reduction of the total number of weights in the model by 50%, while retaining the same performance (Figure S3). This weight-reduced, modularly structured model outperformed a traditionally structured model with the same number of weights. Third, we found that including quantum chemical descriptors did not improve performance compared to a topological-descriptor-only model (Figure S4), so we did not include the quantum chemical descriptors in the final model. Construction of this topological-descriptor-only model was inspired by our previous model of GSH reactivity, which primarily relied upon topological descriptors rather than quantum chemical descriptors.[20] A common critique of neural networks is their opacity compared to more transparent methods with easily interpretable weights, like a logistic regressor. In response to this critique and to gain insight into the inner workings of our model, a permutation sensitivity analysis was performed (Figure S5). We have previously used this approach to expose the structure of similar models.[20,33,34] Details of the permutation sensitivity analysis are available in the Supporting Information. Fourth and finally, we found that including the negative epoxides in the training data substantially improved the model’s ability to identify reactive epoxides (Figure S6).

Accuracy at Predicting Sites of Reactivity

A central objective of this study was to accurately predict SORs: the specific atom(s) within reactive molecules that covalently bind to nucleophilic sites within DNA and protein. The XenoSite reactivity model gives four reactivity scores to each atom (ARS) within a test molecule, each of which ranged from zero to one, and represented the probability that an atom is reactive with cyanide, DNA, GSH, or protein, respectively. Within a reactive molecule, a well-performing model should assign reactive atoms with higher scores than nonreactive atoms. The magnitude of the SOR values and identity of the target(s) of the reactive molecule sheds light on preferential chemistries leading to adduction and structures of the adducts. Such knowledge is relevant for reactive metabolite identification under experimental conditions and further study on the consequences of adducts on normal biological processes. Furthermore, SOR predictions indicate the probability of reactive hot spots leading to unfavorable adductions, and such knowledge could be leveraged to engineer rational modifications to reduce a molecule’s reactivity and potentially its toxicity. The accuracy of the ARS scores was assessed using 10 replicates of 10-fold cross-validation, where groups of related molecules are held out in the same fold. Each replicate yielded very similar results, so for brevity we reported the results from the first one. Performance was quantified by two metrics. First, we calculated the average site area under the curve (AUC) by computing the area under the receiver operating characteristic (ROC) curve (AUC) for each molecule and averaging the AUCs for each molecule in the data set.[20,34] Second, we calculated the top-two metric, which is a standard for site of metabolism predictions. This approach considers a molecule correctly predicted if any of its SOR are predicted in the first or second rank positions.[34−38] Reactivity indices drawn from the quantum modeling literature are another method for predicting atom reactivity,[39−43] and thus, they provide an important point of comparison to our ARS for predicting SOR. XenoSite’s cross-validated ARS predicted reactive atoms with average site AUC accuracies of 96.6%, 89.8%, 92.8%, and 94.4%, and top-two accuracies of 83.9%, 80.6%, 80.9%, and 84.2%, for cyanide, DNA, GSH, and protein, respectively (Figure ). These performances are very accurate, especially when compared to the current standard of atom-level reactivity indices (listed in Table ) from quantum simulations.[39−43] Consistent with our previous model of GSH reactivity, ARS outperformed all reactivity indices tested across all four nucleophilic targets.[20] For example, for predicting DNA SOR, the best performing descriptor by both metrics was πS(r), with an average site AUC of 60.9% and a top-two accuracy of 27.6%, which are both significantly lower than ARS’s corresponding performances of 89.8% and 80.6%. The machine learning approach we adopt here is strikingly more accurate than quantum chemical measures of reactivity.

Figure 2

Table 1

Quantum Chemical Reactivity Descriptors

Atom-Level Descriptors
D^N(r)	nucleophilic delocalizability
D^E(r)	electrophilic delocalizability
π^S(r)	self-polarizability

Atom reactivity scores (ARS) accurately identified sites of reactivity (SORs). The average site AUC (top left) and top-two (top center) metrics were computed for each of the four target nucleophiles and used to assess the cross-validated ARS for 1364 reactive molecules extracted from the Accelrys Metabolite Database (AMD) with their SORs labeled. ARS outperformed all quantum chemical descriptors tested for each target nucleophile. Bottom, from the latest AMD release, an external test of 14 molecules was extracted and SOR(s) labeled for GSH. Performance is equivalent to that of the cross-validated predictions as measured by both the average site AUC (bottom left) and the top-two (bottom center) metrics. Right, two example molecules from the external test set are visualized with their predictions, indicated by the colored shading. Each site of reactivity is labeled with a white circle. Top right, an antimalarial drug candidate metabolite,[44] and bottom right, gefitinib.[45] As further evidence of generalizability, the model was applied to an external test of 14 molecules reactive with GSH that were collected from a newer version of the AMD than that used in training (Figure ). The model predicted SOR on this test set with an average site AUC accuracy of 93.6%, matching the cross-validated performance of 92.8%. The top-two external test set performance was 64.3%, somewhat lower than the cross-validated performance of 80.9%. The lower performance is well explained by the high variance of the estimate, because there are only a few molecules in the set. Moreover, the performance of the model is still substantially better than the quantum chemical descriptors, which are indistinguishable from zero performance on the test molecules. Depictions of all 14 external test set molecules with their GSH ARS and SOR are available in the Supporting Information, alongside their most closely similar molecule from the training set (Figure S7).

Accuracy at Identifying Reactive Molecules

Another central goal is to accurately discriminate between reactive and nonreactive molecules. XenoSite predicted four reactivity scores for each molecule (MRS), each of which ranged from zero to one, and represented the probability that the molecule is reactive with cyanide, DNA, GSH, or protein, respectively. The accuracies of MRS were measured by 10-fold cross-validation, with performance quantified by the area under the ROC curve (molecule AUC). The MRS scores were reasonably accurate in separating reactive and nonreactive molecules (90.3%, 78.7%, 77.7%, and 79.8% for cyanide, DNA, GSH, and protein, respectively, Figure ). In contrast, the performances of the best reactivity indices using traditional quantum descriptors were much lower: 84.9% (−EHOMO for cyanide), 73.7% (max[DN(r)] for DNA), 62.9% (πS(r) for GSH), and 65.1% (max[DN(r)]) for protein). For each nucleophilic target, MRS outperformed all quantum chemical descriptors.

Figure 4

Molecule reactivity scores (MRS) predict some molecules significantly more reactive with biological macromolecules than nucleophilic traps used in standard screening assays. Discordant predictions between the protein and GSH models (left), and between the DNA and cyanide models (right), are visualized. As indicated both by the colored shading (corresponding to atom reactivity scores) and by the MRS listed below, for each molecule the biological macromolecule (protein or DNA) prediction is significantly greater than the corresponding standard trapping agent (GSH or cyanide) prediction. Experimentally known sites of reactivity are labeled with white circles. Upper left pair: a metabolite of 6-nitrochrysene (MRSPRO: 0.42, MRSGSH: 0.29), an environmental pollutant.[50] Bottom left pair: N-acetoxy-sulfamethoxazole (MRSPRO: 0.51, MRSGSH: 0.38), a metabolite that may mediate the toxicity of the antibiotic sulfamethoxazole.[51] Upper right pair: a metabolite of lucidin-3-O-primeveroside (MRSDNA: 0.68, MRSCN: 0.26), a component of a food dye known to be carcinogenic in rats.[52] Bottom right pair: 7H-dibenzo(c,g)carbazole-3,4-dione (MRSDNA: 0.64, MRSCN: 0.30), an environmental multispecies carcinogen.[53]

The model’s MRS scores were superior to other methods, yet still were lower accuracy than the ARS scores. This discrepancy was likely due to the presence of more noise in the molecule-level data than in the atom-level data. The ascertainment bias in the literature means that many of the negative molecules are actually reactive. When extracting nonreactive molecules, we assumed that molecules were not reactive to a nucleophile if they were not reported to be reactive to that nucleophile by any reactions in the AMD. This assumption was not solid evidence of molecules’ nonreactivity, because many studies do not test for reactivity. By contrast, the atom-level reactivity data were all drawn from experiments that tested for reactivity and thus were much less noisy with many fewer false negatives. A similar drop from atom- to molecule-level accuracy was observed in our previous studies with the literature derived data, including the GSH reactivity model that was a precursor to this work.[20,34] In the future, we plan to improve the data by experimentally testing the reactivity of “nonreactive” molecules that nonetheless receive high MRS. We expect that some of these false positives will be shown to be in fact reactive and that this effort will validate the models and improve the training data. Molecule reactivity scores (MRS) accurately identified reactive molecules. A number of prediction methods were compared by their performance at identifying reactive molecules. Each nonreactive molecule is structurally similar to a reactive molecule. To measure performance, the area under the ROC curve was calculated (molecule AUC) . For each target nucleophile, MRS outperformed all quantum chemical descriptors tested. Right, four example molecules are visualized with their protein and GSH reactivity predictions indicated by colored shading. From top to bottom, a TRPV1 antagonist metabolite,[46] a nitroso metabolite of 2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine (formed during cooking of meat),[47] a nicotine iminium ion metabolite (reactive with cyanide),[48] and a benoxaprofen acyl glucuronide metabolite (a nonsteroidal anti-inflammatory drug withdrawn due to hepatotoxicity).[49] Sites of reactivity are indicated by white circles.

Molecules Missed by Standard Screening Assays

Experimental studies with cyanide and GSH are traditional screening tools to trap hard and soft electrophilic reactive molecules, respectively, and thus provide potential insights on possible reactions between the reactive molecules and biological macromolecules.[16,17] Importantly, cyanide and GSH possess only a single type of nucleophilic site, while biologically relevant macromolecules often contain both hard and soft nucleophiles with an array of different structures and hence different chemistries. Consequently, nucleophilic trapping assays using cyanide and GSH may not adequately reflect all possible reactions observed within biological macromolecules and thus fail to detect potentially toxic electrophiles. We estimated how frequently molecules will react with macromolecules (protein and DNA) but not with trapping agents (GSH and cyanide) commonly used in experimental screens. We calculated the probability that each molecule will form adducts to either DNA or protein, but neither cyanide nor GSH. The DNA probability was computed by taking the product of its probabilities of reacting with DNA (MRSDNA) and not reacting with cyanide (1 – MRSCN) or GSH (1 – MRSGSH). The protein probability was computed by taking the product of its probabilities of reacting with protein (MRSPRO) and not reacting with cyanide (1 – MRSCN) or GSH (1 – MRSGSH). Next, we summed up the probabilities for all molecules with respect to each macromolecule. These scores are well scaled probabilities, so this sum is a valid estimate of the total number of molecules selectively reacting with each macromolecule and not the traditional nucleophilic traps, and hence those that would be missed by standard screening approaches. For the entire data set of 2803 molecules, this approach yielded totals of 257 and 227 molecules predicted to be reactive only with DNA and protein, respectively, and not cyanide or GSH. This finding suggests that in our data set 9.2% of DNA reactive molecules and 8.1% of protein reactive molecules would be missed by standard screening assays with cyanide and GSH. Though not the majority of molecules, this result suggests there are shortcomings of using small-molecule trapping agents to infer reactivity with macromolecules. Computationally modeling could fill a gap here, by identifying these problematic molecules when screening cannot (Figure ). The experimental validation of specific missed reactive molecules and these estimates is outside the scope of this study but is planned in our future work. Molecule reactivity scores (MRS) predict some molecules significantly more reactive with biological macromolecules than nucleophilic traps used in standard screening assays. Discordant predictions between the protein and GSH models (left), and between the DNA and cyanide models (right), are visualized. As indicated both by the colored shading (corresponding to atom reactivity scores) and by the MRS listed below, for each molecule the biological macromolecule (protein or DNA) prediction is significantly greater than the corresponding standard trapping agent (GSH or cyanide) prediction. Experimentally known sites of reactivity are labeled with white circles. Upper left pair: a metabolite of 6-nitrochrysene (MRSPRO: 0.42, MRSGSH: 0.29), an environmental pollutant.[50] Bottom left pair: N-acetoxy-sulfamethoxazole (MRSPRO: 0.51, MRSGSH: 0.38), a metabolite that may mediate the toxicity of the antibiotic sulfamethoxazole.[51] Upper right pair: a metabolite of lucidin-3-O-primeveroside (MRSDNA: 0.68, MRSCN: 0.26), a component of a food dye known to be carcinogenic in rats.[52] Bottom right pair: 7H-dibenzo(c,g)carbazole-3,4-dione (MRSDNA: 0.64, MRSCN: 0.30), an environmental multispecies carcinogen.[53]

Model Limitations

For predicting reactive metabolites, this study’s approach is limited by not modeling their metabolic activation, which may precede reactions with nucleophilic traps. Nevertheless, models for metabolic reactions are rapidly maturing as evidenced by our work[34,35,54−56] and others,[57−59] and it is conceivable that metabolism and reactivity models could be combined for a more biologically relevant prediction of risks poised by drugs and other compounds. An additional caveat is that some reactions in the database are likely missing reactive intermediates. For example, acyl glucuronides are predicted somewhat reactive, yet are known not to be reactive themselves. Instead, only after acyl migration and ring opening do acyl glucuronides form short-lived, reactive open-chain aldehyde intermediates that can lead to covalent binding to molecules.[60] There are likely other uncertainties where intermediate reactive metabolites are not present in the database, as they may be so reactive that they are too short-lived to be observed experimentally. Missing intermediates is a limitation of the data, but is both a strength and weakness of our method. On the plus side, we implicitly model some types of rearrangements that yield reactive species, and in doing so, expand the utility of the model. Nevertheless, this process leads to motifs that are not reactive themselves being predicted reactive. Therefore, we are actually modeling a combination of intrinsic reactivity and potential for covalent binding. A final caveat is that the domain of applicability of XenoSite is likely limited to drug-like molecules. We do not expect the model to generalize well to molecules outside of this domain, such as to inorganic molecules. Precise assessment of the domain of applicability is a critical question for future work. We plan to directly test this with prospective experiments, which is much more convincing than computational strategies for assessing the applicability domain.

Conclusion

This study establishes that SOR modeling accurately predicts a key driver of drug toxicity: covalent binding to DNA and protein. The XenoSite reactivity model identifies SOR within reactive molecules with AUC performances of 96.6%, 89.8%, 92.8%, and 94.4%, for cyanide, DNA, GSH, and protein, respectively. Compared to building separate models, collectively modeling reactivity for all four nucleophilic targets enabled knowledge transfer between tasks, improving SOR predictions for cyanide and protein. For cyanide, DNA, GSH, and protein, the model separates reactive and nonreactive molecules with, respectively, 90.3%, 78.7%, 77.7%, and 79.8% AUC. These predictions of molecular reactivity can highlight potentially toxic molecules that might be missed in the early stages of drug development. Ultimately, to accurately predict reactive metabolites, both metabolism and reactivity must be modeled. While there has been significant progress on metabolism modeling,[34,35,54−59,61] reactivity modeling has lagged far behind. By accurately modeling reactivity across a broad range of chemical space for biologically relevant macromolecules, such as DNA and protein, this study contributes a fundamental component of an integrative model of metabolism and reactivity.

Methods

Site of Reactivity Training Data

We assembled a training data set from the December 2014 release of the literature-derived Accelrys Metabolite Database (AMD). From 2489 total reactions, molecules were extracted based on reported reactions with cyanide, DNA, GSH, or protein. For each target nucleophile, the reactive atom(s) within each reactive molecule were marked based on the structures of the starting and product molecules, using an automated algorithm constructed using the RDKit python library.[62] Topologically equivalent atoms were found using the Pybel python library, and atoms equivalent to reactive atoms were themselves labeled as reactive.[63] The final data set included, in total, 1364 electrophilic molecules with their reactive atoms and SORs marked. This data set was composed of 51, 145, 1059, and 120 molecules known to be reactive with cyanide, DNA, GSH, or protein, respectively. For each of the four reactive nucleophilic targets, the atoms across the whole data set were labeled as reactive or nonreactive. Some atoms were marked reactive with more than one nucleophile. Metabolically related but nonreactive molecules were extracted from the reaction network of each reactive molecule. Metabolic sibling and parent molecules were extracted from this network, while excluding molecules themselves known to be reactive. We also included 63 naturally occurring[64] and known nonreactive[65] epoxides as nonreactive molecules. While epoxides are generally quite reactive, they can be stable in certain cases; however, a shortcoming of our previous method for predicting GSH reactivity was that it predicted all epoxides reactive.[20] These molecules were added to mitigate this shortcoming. A total of 1439 molecules were labeled nonreactive, with 103, 248, 1269, and 255 labeled nonreactive to cyanide, DNA, GSH, and protein, respectively. At the molecule-level, for each of the four reactive nucleophilic targets, molecules across the entire data set were labeled as reactive or nonreactive. Some molecules were marked reactive or nonreactive with more than one nucleophile. To enable replication of our work, we included all AMD molecule registry numbers in the Supporting Information, in addition to the SMILES strings of the nonreactive epoxides. Unfortunately, the AMD license precluded us from disseminating the rest of the molecule structures themselves.

External Site of Reactivity Test Data

After the training data were assembled, a new version of the AMD was released (June 2015) with several new reactions, which was used as an external data set. We filtered out all molecules already labeled as reactive in the training data. This process left only 14 new molecules that reacted with GSH and no new molecules reacting with cyanide, DNA, or protein. These 14 molecules were completely withheld from model training and optimization decisions, and were only tested with the final model.

Quantum Chemical Reactivity Indices

Several quantum chemical descriptors were calculated from self-consistent field computations with MOPAC, a quantum chemistry package, using the semiempirical method PM7 and the COSMO implicit solvent model.[66,67] These included descriptors that have been previously proposed as reactivity indices, on both the atom- and molecule-level (Table ).[68−70] Atom-level descriptors include the nucleophilic (DN(r)) and electrophilic (DE(r)) delocalizabilities, which are also known as fukui reactivity indices, as well as self-polarizability (πS(r)). The maximum of each of these (max[DN(r)], max[DE(r)], and max[πS(r)]) were used as molecule descriptors, which also included the energies of the lowest unoccupied and highest occupied molecular orbitals (ELUMO and EHOMO). The performances of the final reactivity model—which only used topological descriptors—were compared to those of the quantum chemical indices.

Topological Descriptors

The reactivity model used 15 molecule-level and 194 atom-level topological descriptors, each of which describes a chemical property. An in-house python script calculated these descriptors from the structure of each molecule. This script derives these descriptors from several sources, such as each molecule’s connectivity distance matrix, periodic table properties, or motif patterns defined by Pybel.[63] The majority of our topological descriptors have been shown to be useful for the XenoSite metabolism, reactivity, and epoxidation models.[20,34,35] In this study, we used an expanded set of topological descriptors, which slightly improved performance in comparison to the previous set of topological descriptors (Figure S1). The full list of descriptors is available in the Supporting Information (Tables S1 and S2), as well as a list of new topological descriptors added for this project.

Combined Atom- and Molecule-Level Reactivity Model

We collectively modeled reactivity to cyanide, DNA, GSH, and protein at both the atom- and molecule-level by building a deep neural network using an in-house machine learning python library. The architecture included a molecule layer, an input layer, two hidden layers, and two output layers (Figure ). The atom output layer computed atom-level SOR predictions against each nucleophile, while the molecule output layer computed molecule reactivity scores (MRS) for each nucleophile. The corresponding scores predicted the chances that a test molecule is reactive against that nucleophile.

Figure 5

The structure of the XenoSite reactivity model. This diagram illustrates how information flowed through the model, which consisted of one input layer, two hidden layers, and two output layers. By simultaneously modeling all types of reactivity, the model was able to transfer knowledge between related tasks, thereby improving performance substantially over independent models. The model computed atom-level predictions for reactivity to each of four nucleophilic targets: cyanide (ARSCN), DNA (ARSDNA), GSH (ARSGSH), and protein (ARSPRO), collectively referred to as atom reactivity scores (ARS). Additionally, the model computed molecule reactivity scores (MRS): MRSCN, MRSDNA, MRSGSH, and MRSPRO, which predicted the chances of a molecule’s reactivity to each of the four nucleophilic targets, respectively. From the structure of an input model χ, 15 molecule-level and 194 atom-level descriptors were calculated. Some chemically related descriptors, such as neighbor atom identities, were grouped in the first hidden layer (with 30 nodes). Grouped and ungrouped nodes were inputted into the second hidden layer (with 17 nodes), which outputted four atom-level scores. Finally, for each of the four nucleophilic targets, the respective MRS was computed from the top five ARS for each of the four nucleophilic targets, corresponding to the scores of the five atoms predicted within a molecule to be the most reactive to each nucleophile, as well as all molecule-level descriptors. The diagram is condensed and displays one representative molecule input node, five atom input nodes, and two nodes for each hidden layer. The molecule input node is a chemical structure; all other nodes are vectors of real numbers computed from nodes or layers from which there are incoming connections. We trained this network in two stages. First, the atom-level network was trained to produce atom reactivity scores (ARS). For this process, each atom was considered a possible SOR. A numerical vector was associated with each atom, which contained all of the descriptors for that atom. A binary target vector indicated if each atom was a SOR for each of the nucleophilic targets. For molecules of unknown reactivity against each nucleophilic target, the appropriate values were empty, neither one nor zero. Using gradient descent on the cross-entropy error, the weights of the network were trained such that SOR scored higher ARS than other atoms. Four ARS were produced for each atom, each ranging from zero to one, and representing the probability that the atom will be reactive with cyanide, DNA, GSH, or protein. Second, the molecule-output nodes were trained to compute MRS. For this stage, each molecule was considered possibly reactive, and a numerical vector of descriptors was associated with each molecule. Descriptors included all molecule-level descriptors, as well as the top five ARS for each of the four nucleophilic targets, corresponding to the scores of the five atoms predicted within a molecule to be the most reactive to each nucleophile. For each molecule, a binary target vector (with unknown components empty) indicated if the molecule was reactive to each of the nucleophilic targets. For each nucleophile, a logistic regressor found a scoring function that gave reactive molecules high MRS and nonreactive molecules low MRS, ranging from zero to one.

57 in total

1. Development of a peptide reactivity assay for screening contact allergens.

Authors: G Frank Gerberick; Jeff D Vassallo; Ruth E Bailey; Joel G Chaney; Steve W Morrall; Jean-Pierre Lepoittevin
Journal: Toxicol Sci Date: 2004-07-14 Impact factor: 4.849

2. Exposition and reactivity optimization to predict sites of metabolism in chemicals.

Authors: Gabriele Cruciani; Massimo Baroni; Paolo Benedetti; Laura Goracci; Cosimo Gianluca Fortuna
Journal: Drug Discov Today Technol Date: 2013

3. EaMEAD: Activation energy prediction of cytochrome P450 mediated metabolism with effective atomic descriptors.

Authors: Doo Nam Kim; Kwang-Hwi Cho; Won Seok Oh; Chang Joon Lee; Sung Kwang Lee; Jihoon Jung; Kyoung Tai No
Journal: J Chem Inf Model Date: 2009-07 Impact factor: 4.956

4. Peptide-based in vitro assay for the detection of reactive metabolites.

Authors: Michael D Mitchell; Mollisa M Elrick; Jennie L Walgren; Richard A Mueller; Dale L Morris; David C Thompson
Journal: Chem Res Toxicol Date: 2008-03-28 Impact factor: 3.739

5. An ion exchange liquid chromatography/mass spectrometry method for the determination of reduced and oxidized glutathione and glutathione conjugates in hepatocytes.

Authors: A F Loughlin; G L Skiles; D W Alberts; W H Schaefer
Journal: J Pharm Biomed Anal Date: 2001-08 Impact factor: 3.935

Review 6. The role of metabolic activation in drug-induced hepatotoxicity.

Authors: B Kevin Park; Neil R Kitteringham; James L Maggs; Munir Pirmohamed; Dominic P Williams
Journal: Annu Rev Pharmacol Toxicol Date: 2005 Impact factor: 13.820

7. Determination of lucidin-specific DNA adducts by liquid chromatography with tandem mass spectrometry in the livers and kidneys of rats given lucidin-3-O-primeveroside.

Authors: Yuji Ishii; Kaoru Inoue; Shinji Takasu; Meilan Jin; Kohei Matsushita; Ken Kuroda; Kiyoshi Fukuhara; Akiyoshi Nishikawa; Takashi Umemura
Journal: Chem Res Toxicol Date: 2012-04-30 Impact factor: 3.739

Review 8. The generation, detection, and effects of reactive drug metabolites.

Authors: Andrew V Stachulski; Thomas A Baillie; B Kevin Park; R Scott Obach; Deepak K Dalvie; Dominic P Williams; Abhishek Srivastava; Sophie L Regan; Daniel J Antoine; Christopher E P Goldring; Alvin J L Chia; Neil R Kitteringham; Laura E Randle; Hayley Callan; J Luis Castrejon; John Farrell; Dean J Naisbitt; Martin S Lennard
Journal: Med Res Rev Date: 2012-10-22 Impact factor: 12.944

9. Relationships of DNA adduct formation, K-ras activating mutations and tumorigenic activities of 6-nitrochrysene and its metabolites in the lungs of CD-1 mice.

Authors: E E Li; R H Heflich; T J Bucci; M G Manjanatha; B S Blaydes; K B Delclos
Journal: Carcinogenesis Date: 1994-07 Impact factor: 4.944

Modeling Reactivity to Biological Macromolecules with a Deep Multitask Network.

Introduction

Results and Discussion

Model Optimization

Accuracy at Predicting Sites of Reactivity

Accuracy at Identifying Reactive Molecules

Molecules Missed by Standard Screening Assays

Model Limitations

Conclusion

Methods

Site of Reactivity Training Data

External Site of Reactivity Test Data

Quantum Chemical Reactivity Indices

Topological Descriptors

Combined Atom- and Molecule-Level Reactivity Model

1. Development of a peptide reactivity assay for screening contact allergens.

2. Exposition and reactivity optimization to predict sites of metabolism in chemicals.

3. EaMEAD: Activation energy prediction of cytochrome P450 mediated metabolism with effective atomic descriptors.

4. Peptide-based in vitro assay for the detection of reactive metabolites.

5. An ion exchange liquid chromatography/mass spectrometry method for the determination of reduced and oxidized glutathione and glutathione conjugates in hepatocytes.

Review 6. The role of metabolic activation in drug-induced hepatotoxicity.

7. Determination of lucidin-specific DNA adducts by liquid chromatography with tandem mass spectrometry in the livers and kidneys of rats given lucidin-3-O-primeveroside.

Review 8. The generation, detection, and effects of reactive drug metabolites.

9. Relationships of DNA adduct formation, K-ras activating mutations and tumorigenic activities of 6-nitrochrysene and its metabolites in the lungs of CD-1 mice.

10. Predicting drug metabolism: a site of metabolism prediction tool applied to the cytochrome P450 2C9.

1. Deep Learning to Predict the Formation of Quinone Species in Drug Metabolism.

2. Computationally Assessing the Bioactivation of Drugs by N-Dealkylation.

3. Computational Approach to Structural Alerts: Furans, Phenols, Nitroaromatics, and Thiophenes.

4. A Time-Embedding Network Models the Ontogeny of 23 Hepatic Drug Metabolizing Enzymes.

Review 5. Deep Learning for Drug Design: an Artificial Intelligence Paradigm for Drug Discovery in the Big Data Era.

6. Pesticide application inhibit the microbial carbonic anhydrase-mediated carbon sequestration in a soil microcosm.

7. Site-Level Bioactivity of Small-Molecules from Deep-Learned Representations of Quantum Chemistry.

8. The Metabolic Rainbow: Deep Learning Phase I Metabolism in Five Colors.

9. Metabolic Forest: Predicting the Diverse Structures of Drug Metabolites.

10. XenoNet: Inference and Likelihood of Intermediate Metabolite Formation.