| Literature DB >> 35328472 |
Domenico Gadaleta1, Nicoleta Spînu2, Alessandra Roncaglioni1, Mark T D Cronin2, Emilio Benfenati1.
Abstract
Developmental and adult/ageing neurotoxicity is an area needing alternative methods for chemical risk assessment. The formulation of a strategy to screen large numbers of chemicals is highly relevant due to potential exposure to compounds that may have long-term adverse health consequences on the nervous system, leading to neurodegeneration. Adverse Outcome Pathways (AOPs) provide information on relevant molecular initiating events (MIEs) and key events (KEs) that could inform the development of computational alternatives for these complex effects. We propose a screening method integrating multiple Quantitative Structure-Activity Relationship (QSAR) models. The MIEs of existing AOP networks of developmental and adult/ageing neurotoxicity were modelled to predict neurotoxicity. Random Forests were used to model each MIE. Predictions returned by single models were integrated and evaluated for their capability to predict neurotoxicity. Specifically, MIE predictions were used within various types of classifiers and compared with other reference standards (chemical descriptors and structural fingerprints) to benchmark their predictive capability. Overall, classifiers based on MIE predictions returned predictive performances comparable to those based on chemical descriptors and structural fingerprints. The integrated computational approach described here will be beneficial for large-scale screening and prioritisation of chemicals as a function of their potential to cause long-term neurotoxic effects.Entities:
Keywords: QSAR; adverse outcome pathways; molecular initiating events; neurotoxicity
Mesh:
Year: 2022 PMID: 35328472 PMCID: PMC8954925 DOI: 10.3390/ijms23063053
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
External validation of QSAR models for MIEs based on ChEMBL data. For each MIE predicting QSAR the average number of true positives (TP), false positives (FP), true negatives (TN), and false negatives (FN) were reported. The metrics for evaluating the predictivity of the models were sensitivity (SEN), specificity (SPE), balanced accuracy (BA), Matthew’s correlation coefficient (MCC) and area under the ROC curve (AUC). Performance is the average of metrics obtained over 100 different training-test splits.
| MIE | TP | FP | TN | FN | SEN | SPE | BA | MCC | AUC |
|---|---|---|---|---|---|---|---|---|---|
| AChE | 555.6 | 63.0 | 887.2 | 68.0 | 0.89 | 0.93 | 0.91 | 0.83 | 0.96 |
| AMPAR | 13.4 | 20.0 | 651.0 | 1.2 | 0.92 | 0.97 | 0.94 | 0.60 | 0.99 |
| CAR | 9.0 | 6.8 | 668.6 | 1.2 | 0.88 | 0.99 | 0.94 | 0.72 | 0.95 |
| CYP2E1 | 4.0 | 35.2 | 645.2 | 1.0 | 0.80 | 0.95 | 0.87 | 0.29 | 0.90 |
| GABAR | 20.0 | 11.0 | 649.0 | 6.0 | 0.77 | 0.98 | 0.88 | 0.69 | 0.96 |
| KAR | 4.4 | 17.4 | 663.0 | 0.6 | 0.88 | 0.97 | 0.93 | 0.42 | 0.97 |
| NADHOX | 15.2 | 4.4 | 665.4 | 0.4 | 0.97 | 0.99 | 0.98 | 0.87 | 1.00 |
| NIS | 11.0 | 0.6 | 673.4 | 0.4 | 0.97 | 1.00 | 0.98 | 0.96 | 1.00 |
| NMDAR | 50.0 | 27.8 | 604.4 | 3.4 | 0.94 | 0.96 | 0.95 | 0.75 | 0.98 |
| PXR | 35.8 | 35.8 | 601.8 | 13.0 | 0.73 | 0.94 | 0.84 | 0.57 | 0.92 |
| RYR | 11.0 | 0.6 | 673.6 | 0.2 | 0.98 | 1.00 | 0.99 | 0.96 | 0.99 |
| THRα | 60.0 | 23.4 | 599.2 | 2.8 | 0.96 | 0.96 | 0.96 | 0.81 | 0.99 |
| THRβ | 110.2 | 37.8 | 500.0 | 38.4 | 0.74 | 0.93 | 0.84 | 0.67 | 0.93 |
| TTR | 14.8 | 44.0 | 624.0 | 3.8 | 0.80 | 0.93 | 0.87 | 0.42 | 0.94 |
| VGSC | 28.4 | 12.8 | 639.2 | 5.0 | 0.85 | 0.98 | 0.92 | 0.76 | 0.97 |
Performance of the three classifiers (kNN, RF, NNET) using MIE predictions, chemical descriptors, and extended fingerprints as independent variables. For each method, the average number of true positives (TP), false positives (FP), true negatives (TN) false negatives (FN) and not classified (NC) were reported. The metrics to evaluate the predictivity of the models were sensitivity (SEN), specificity (SPE), balanced accuracy (BA), Matthew’s correlation coefficient (MCC), and area under the ROC curve (AUC). Performance is the average of five-fold cross-validation results obtained over 500 iterations (100 fold-splitting procedures and five parameter combinations).
| Classifier | Variable | TP | FP | TN | FN | NC | SEN | SPE | BA | MCC | AUC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| K-NN | MIE predictions | 30.5 | 11.4 | 19.6 | 7.5 | 0.0 | 0.80 | 0.63 | 0.72 | 0.44 | 0.76 |
| Descriptors | 29.5 | 11.5 | 19.5 | 8.5 | 0.0 | 0.78 | 0.63 | 0.70 | 0.41 | 0.76 | |
| Fingerprints | 14.6 | 2.3 | 28.5 | 22.2 | 1.4 | 0.40 | 0.92 | 0.66 | 0.37 | 0.75 | |
| MLP-NNET | MIE predictions | 29.4 | 9.4 | 21.6 | 8.6 | 0.0 | 0.77 | 0.70 | 0.74 | 0.47 | 0.78 |
| Descriptors | 30.2 | 9.8 | 21.2 | 7.8 | 0.0 | 0.79 | 0.68 | 0.74 | 0.48 | 0.79 | |
| Fingerprints | 28.1 | 12.4 | 18.6 | 9.9 | 0.0 | 0.74 | 0.60 | 0.67 | 0.34 | 0.69 | |
| RF | MIE predictions | 31.1 | 11.6 | 19.2 | 6.4 | 0.7 | 0.83 | 0.62 | 0.73 | 0.47 | 0.77 |
| Descriptors | 32.9 | 6.4 | 24.4 | 4.9 | 0.4 | 0.87 | 0.79 | 0.83 | 0.66 | 0.91 | |
| Fingerprints | 32.9 | 11.9 | 18.8 | 4.8 | 0.5 | 0.87 | 0.61 | 0.74 | 0.51 | 0.80 |
Figure 1Distribution of balanced accuracies calculated among the various QSARs developed to predict neurotoxic potential. Balanced accuracies are grouped based on the algorithm used: (a) k-Nearest Neighbours; (b) Random Forest; (c) Neural Network. Blue bars refer to models developed based on MIE predictions, red bars refer to models based on DRAGON descriptors, and yellow bars refer to models based on Extended Fingerprints. Dashed lines indicate the mean accuracy value achieved by each group of models.
Molecular Initiating Events associated with Developmental Neurotoxicity, adapted from Spînu et al. [45] and Li et al. [27].
| ID | MIE | Target | Reference |
|---|---|---|---|
| A | Binding of agonist, Ionotropic glutamate receptors | Glutamate [NMDA] receptor | [ |
| A | Binding of agonist, Ionotropic glutamate receptors | Glutamate receptor ionotropic kainate | [ |
| A | Binding of agonist, Ionotropic glutamate receptors | Glutamate receptor ionotropic AMPA | [ |
| B | Binding of antagonist, NMDA receptors | Glutamate [NMDA] receptor | [ |
| C | Binding of inhibitor, NADH-ubiquinone oxidoreductase (complex I) | Mitochondrial complex I (NADH dehydrogenase) | [ |
| D | Binding, SH/SeH proteins involved in protection against oxidative stress | Aspecific1 | [ |
| E | CYP2E1 Activation | Cytochrome P450 2E1 | [ |
| F | Inhibition, Na+/I− symporter (NIS) | Sodium/iodide cotransporter | [ |
| G | Thyroperoxidase, Inhibition | Thyroid peroxidase 1 | [ |
| H | Protein Adduct Formation | Aspecific 2 | [ |
| I | Binding of inhibitors to acetylcholinesterase (AChE) | Acetylcholinesterase | [ |
| L | Binding of non-dioxin-like polychlorinated biphenyls with ryanodine receptor (RyR) | Ryanodine receptors 1, 2 and 3 | [ |
| M | Interaction uncouplers with oxidative phosphorylation | Aspecific 3 | [ |
| N | Binding of redox cycling chemicals with NADH-quinone oxidoreductase | Mitochondrial complex I (NADH dehydrogenase) | [ |
| O | Binding of redox cycling chemicals with NADH cytochrome b5 reductase | NADH-cytochrome b5 reductase | [ |
| P | Xenobiotic nuclear receptor activation | Pregnane X receptor | [ |
| P | Xenobiotic nuclear receptor activation | Nuclear receptor subfamily 1 group I member 3 (Constitutive Androstane Receptor) | [ |
| Q | Interference with thyroid serum binding protein | Transthyretin | [ |
| R | Deiodinase inhibition | Deiodinase 4 | [ |
| S | Thyroid receptor binding | Thyroid hormone receptor beta | [ |
| S | Thyroid receptor binding | Thyroid hormone receptor alpha | [ |
| T | Thyroid hormone transporter interference | Monocarboxylate transporter 8 4 | [ |
| T | Thyroid hormone transporter interference | Monocarboxylate transporter 10 4 | [ |
| T | Thyroid hormone transporter interference | Solute carrier organic anion transporter family member 1C1 4 | [ |
| U | Binding of pyrethroids to voltage-gated sodium channels (VGSC) | Sodium channel protein type N alpha subunit | [ |
| V | Binding of antagonist to γ-aminobutyric acid receptor GABAAR | GABA-A receptor; alpha-1/beta-2/gamma-2 | [ |
1 No data found in ChEMBL, QSAR from Gadaleta et al. [68] was used. 2 Replaced with the use of reactivity SMARTS [69]. 3 No specific targets, not considered for modelling. 4 No data found in ChEMBL, not considered for modelling.
List of endpoints modelled using ChEMBL data. For each endpoint, the reference MIE, ChEMBL ID relative to the molecular target, species, and composition of the Training and Test sets are reported; ACT is the number of active compounds, while INA is the number of inactive compounds.
| Target | Code | CheMBL ID | Species | MIE | ACT | INA |
|---|---|---|---|---|---|---|
| Acetylcholinesterase | AChE | CHEMBL220 | Human | I | 3076 | 4793 |
| Glutamate receptor ionotropic AMPA | AMPAR | CHEMBL2096670 | Human | A | 73 | 3355 |
| Nuclear receptor subfamily 1 group I member 3 (Constitutive Androstane Receptor) | CAR | CHEMBL5503 | Human | P | 51 | 3377 |
| Cytochrome P450 2E1 | CYP2E1 | CHEMBL5281 | Human | E | 25 | 3402 |
| GABA-A receptor; alpha-1/beta-2/gamma-2 | GABAR | CHEMBL2095172 | Human | V | 129 | 3298 |
| Glutamate receptor ionotropic kainate | KAR | CHEMBL2109241 | Human | A | 25 | 3402 |
| Mitochondrial complex I (NADH dehydrogenase) | NADHOX | CHEMBL614865 | Bos taurus | C, N | 78 | 3349 |
| Sodium/iodide cotransporter | NIS | CHEMBL2331047 | Human | F | 56 | 3371 |
| Glutamate [NMDA] receptor | NMDAR | CHEMBL2094124 | Human | A, B | 267 | 3161 |
| Pregnane X receptor | PXR | CHEMBL3401 | Human | P | 244 | 3188 |
| Ryanodine receptors 1 | RYR | CHEMBL2062 | Human | L | 56 | 3371 |
| Thyroid hormone receptor alpha | THRα | CHEMBL1860 | Human | S | 311 | 3116 |
| Thyroid hormone receptor beta | THRβ | CHEMBL1947 | Human | S | 728 | 2704 |
| Transthyretin | TTR | CHEMBL3194 | Human | Q | 93 | 3340 |
| Sodium channel protein type N alpha subunit 2 | VGSC | CHEMBL1845 | Human | U | 167 | 3260 |
1 All the three isoforms of RYR were considered. 2 Isoforms 1, 2, 3 and 6 were considered.
Figure 2Modeling workflow. The colours of the various blocks refer to the paragraph in Materials and Methods that describes the specific steps of the workflow. Data from ChEMBL for 15 targets relevant for the MIEs of neurotoxicity (red) were classified based on the threshold pChEMBL = 5; negative samples were enriched with data using “>” and “≥” qualifiers and with chemicals from other MIE data that were treated as decoys. QSARs for MIEs (blue) were developed from these datasets using the BRF method. Datasets were iteratively partitioned into training and test sets and their external performance was calculated as the average of the various iterations; then, the models were retrained on the whole datasets. Neurotoxicity data (green) were retrieved from [76] and curated at the SMILES level. Predictions from the thyroperoxidase model (violet) by [72] and reactivity SMARTS (cyan) by [75] were combined with the predictions from the 15 MIE modes and used as independent variables to develop neurotoxicity QSAR models (orange). kNN, RF, and NNET were used to develop models. The use of MIE predictions as independent variables was benchmarked with fingerprints and DRAGON descriptors; then, the performance of the obtained models was compared with five-fold cross validation.