| Literature DB >> 35216217 |
Salvatore Galati1, Miriana Di Stefano1,2, Elisa Martinelli1, Marco Macchia1, Adriano Martinelli1, Giulio Poli1, Tiziano Tuccinardi1,3.
Abstract
The use of in silico toxicity prediction methods plays an important role in the selection of lead compounds and in ADMET studies since in vitro and in vivo methods are often limited by ethics, time, budget and other resources. In this context, we present our new web tool VenomPred, a user-friendly platform for evaluating the potential mutagenic, hepatotoxic, carcinogenic and estrogenic effects of small molecules. VenomPred platform employs several in-house Machine Learning (ML) models developed with datasets derived from VEGA QSAR, a software that includes a comprehensive collection of different toxicity models and has been used as a reference for building and evaluating our ML models. The results showed that our models achieved equal or better performance than those obtained with the reference models included in VEGA QSAR. In order to improve the predictive performance of our platform, we adopted a consensus approach combining the results of different ML models, which was able to predict chemical toxicity better than the single models. This improved method was thus implemented in the VenomPred platform, a freely accessible webserver that takes the SMILES (Simplified Molecular-Input Line-Entry System) strings of the compounds as input and sends the prediction results providing a probability score about their potential toxicity.Entities:
Keywords: artificial intelligence; carcinogenicity; estrogenicity; hepatoxicity; in silico toxicity; machine learning; mutagenicity
Mesh:
Substances:
Year: 2022 PMID: 35216217 PMCID: PMC8877213 DOI: 10.3390/ijms23042105
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Total number of molecules present in each training and test set, including their classification according to experimental value, was employed for model building and evaluation.
|
| |||
|
|
|
|
|
| Training | 3367 | 1883 | 1484 |
| Test | 798 | 446 | 352 |
|
| |||
|
|
|
|
|
| Training | 645 | 333 | 312 |
| Test | 161 | 89 | 72 |
|
| |||
|
|
|
|
|
| Training | 656 | 234 | 422 |
| Test | 150 | 54 | 96 |
|
| |||
|
|
|
|
|
| Training | 760 | 408 | 352 |
| Test | 157 | 97 | 60 |
Figure 1Evaluation results, expressed in terms of MCC, obtained for all different models developed for the (A) mutagenicity, (B) carcinogenicity, (C) estrogenicity and (D) hepatotoxicity endpoints. The 5 top-scored models for each endpoint are separated from the others by a black dashed line.
Figure 2Performance evaluation results, based on test set prediction, obtained for the 5 top-scored models of the (A) mutagenicity, (B) carcinogenicity, (C) estrogenicity and (D) hepatotoxicity endpoint, in comparison with the reference models included in VEGA.
Figure 3Statistical values obtained with the consensus strategy for the (A) mutagenicity, (B) carcinogenicity, (C) estrogenicity and (D) hepatotoxicity endpoints, compared with the reference models included in VEGA.
Figure 4Non-hepatotoxic compounds sharing a steroid scaffold misclassified by the reference model from VEGA and correctly predicted by our consensus approach.
Figure 5The graphical representation provided by VenomPred related to the confidence level of each prediction, derived from the Probability value.