| Literature DB >> 28316646 |
Prachi Pradeep1, Richard J Povinelli2, Shannon White3, Stephen J Merrill4.
Abstract
Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflicting predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leave-one-out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa (κ): 0.63 and 0.62] for both the datasets. The ROC curves demonstrate the utility of the cut-off feature in the predictive ability of the ensemble model. This feature provides an additional control to the regulators in grading a chemical based on the severity of the toxic endpoint under study.Entities:
Keywords: Computational toxicology; Ensemble models; Hybrid QSAR models; In silico QSAR tools; Risk assessment
Year: 2016 PMID: 28316646 PMCID: PMC5034616 DOI: 10.1186/s13321-016-0164-0
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Prediction combination table with posterior probability, , for each combination number, , which represents a prediction combination from each of the four QSAR tools
| Combination number | Tool 1 | Tool 2 | Tool 3 | Tool 4 | Posterior probability |
|---|---|---|---|---|---|
|
| 0 | 0 | 0 | 0 |
|
|
| 0 | 0 | 0 | 1 |
|
|
| 0 | 0 | 1 | 0 |
|
|
| 0 | 0 | 1 | 1 |
|
|
| 0 | 1 | 0 | 0 |
|
|
| 0 | 1 | 0 | 1 |
|
|
| 0 | 1 | 1 | 0 |
|
|
| 0 | 1 | 0 | 1 |
|
|
| 0 | 1 | 1 | 1 |
|
|
| 1 | 0 | 0 | 0 |
|
|
| 1 | 0 | 0 | 1 |
|
|
| 1 | 0 | 1 | 0 |
|
|
| 1 | 0 | 1 | 1 |
|
|
| 1 | 1 | 0 | 0 |
|
|
| 1 | 1 | 0 | 1 |
|
|
| 1 | 1 | 1 | 1 |
|
Fig. 1Bayesian classifier ensemble for predicting carcinogenicity. The posterior probability, , as determined from Table 1 is compared with a variable cut-off between 0 and 1
Performance metrics for air toxins dataset
| Model | Accuracy (%) | SN (%) | SP (%) | BA (%) | PPV (%) | NPV (%) | Kappa ( |
|---|---|---|---|---|---|---|---|
| Toxtree | 75.56 | 68.18 | 79.51 | 73.85 | 64.10 | 82.32 | 0.47 |
| Lazar | 75.24 | 74.55 | 75.61 | 75.08 | 62.12 | 84.70 | 0.48 |
| Danish QSAR | 74.29 | 80.91 | 70.73 | 75.82 | 59.73 | 87.35 | 0.48 |
| OECD toolbox | 76.19 | 69.09 | 80.00 | 74.55 | 64.96 | 82.83 | 0.48 |
| Bayes ensemble | 83.81 | 70.00 | 91.22 | 80.61 | 81.05 | 85.00 | 0.63 |
| Bayes ensemble | 83.81 | 70.00 | 91.22 | 80.61 | 81.05 | 85.00 | 0.63 |
| Bayes ensemble | 82.22 | 65.45 | 91.22 | 78.34 | 80.00 | 83.11 | 0.59 |
Performance metrics for the CPDB dataset
| Model | Accuracy (%) | SN (%) | SP (%) | BA (%) | PPV (%) | NPV (%) | Kappa ( |
|---|---|---|---|---|---|---|---|
| Toxtree | 66.04 | 84.50 | 44.59 | 64.55 | 63.93 | 71.22 | 0.30 |
| Lazar | 80.63 | 86.05 | 74.32 | 80.19 | 79.57 | 82.09 | 0.61 |
| Danish QSAR | 65.00 | 91.09 | 34.68 | 62.89 | 61.84 | 77.00 | 0.27 |
| OECD toolbox | 64.79 | 84.50 | 41.89 | 63.20 | 62.82 | 69.93 | 0.27 |
| Bayes ensemble | 81.04 | 83.33 | 75.23 | 79.28 | 80.14 | 82.27 | 0.62 |
| Bayes ensemble | 80.21 | 84.50 | 75.23 | 79.87 | 79.85 | 80.68 | 0.60 |
| Bayes ensemble | 80.42 | 84.50 | 77.03 | 80.77 | 80.83 | 79.91 | 0.61 |
Fig. 2Receiver operator characteristics (ROC) curve of Bayes ensemble model as compared to other QSAR tools. The Bayes model at different thresholds is depicted by red points, at 0.5 cut-off by green point and the base QSAR tools by blue points. The ROC plot for the Bayes ensemble model is depicted by the red dotted line. a Air toxins dataset, b CPDB dataset