| Literature DB >> 27120770 |
Sergii Novotarskyi1, Ahmed Abdelaziz2,3, Yurii Sushko1, Robert Körner1, Joachim Vogt1, Igor V Tetko4,5.
Abstract
The ToxCast EPA challenge was managed by TopCoder in Spring 2014. The goal of the challenge was to develop a model to predict the lowest effect level (LEL) concentration based on in vitro measurements and calculated in silico descriptors. This article summarizes the computational steps used to develop the Rank-I model, which calculated the lowest prediction error for the secret test data set of the challenge. The model was developed using the publicly available Online CHEmical database and Modeling environment (OCHEM), and it is freely available at http://ochem.eu/article/68104 . Surprisingly, this model does not use any in vitro measurements. The logic of the decision steps used to develop the model and the reason to skip inclusion of in vitro measurements is described. We also show that inclusion of in vitro assays would not improve the accuracy of the model.Entities:
Mesh:
Year: 2016 PMID: 27120770 PMCID: PMC5413193 DOI: 10.1021/acs.chemrestox.5b00481
Source DB: PubMed Journal: Chem Res Toxicol ISSN: 0893-228X Impact factor: 3.739
Number of Descriptors and Models’ Accuracy for the Prediction of the Test Set Compounds
| RMSE | ||||
|---|---|---|---|---|
| descriptor set | number of selected descriptors | whole
test set ( | inside of AD | outside of AD ( |
| CDK | 159 | 1.13 | 1.01 | 2.4 |
| Dragon | 1824 | 1.15 | 1.05 | 2.4 |
| Fragmentor | 631 | 1.18 | 1.04 | 2.7 |
| GSFrag | 202 | 1.1 | 0.97 | 2.5 |
| Mera, Mersy | 242 | 1.04 | 0.96 | 2.1 |
| Chemaxon | 97 | 1.16 | 1.06 | 2.4 |
| Inductive | 39 | 1.17 | 1.03 | 2.7 |
| Adriana | 133 | 1.14 | 1.01 | 2.5 |
| QNPR | 381 | 1.12 | 1.02 | 2.7 |
| E-state | 185 | 1.16 | 1 | 2.8 |
| 143 | 1.21 | 1.11 | 2.5 | |
| Consensus | 4036 | 1.08 | 0.96 | 2.5 |
AD is the applicability domain of the model as defined by OCHEM[8] (see also ref (20)).
Summary of the Performance of the Top-Ranked Models of the EPA ToxCast Challenge
| test
set | ||||||||
|---|---|---|---|---|---|---|---|---|
| training
set ( | provisional
subset ( | final
subset ( | full, | |||||
| model | RMSE | RMSE | rank | RMSE | rank | RMSE | ||
| novserj | 0.88 ± 0.04 | 0.27 ± 0.04 | 1.03 ± 0.08 | 8 | 1.12 ± 0.08 | 0.31 | 1 | 1.08 ± 0.07 |
| NobuMiu | 1.03 | 9 | 1.13 | 0.30 | 2 | 1.09 | ||
| a9108tc | 1.05 | 16 | 1.13 | 0.29 | 3 | 1.10 | ||
| klo86 min | 1.09 | 27 | 1.14 | 0.29 | 4 | 1.12 | ||
| 0.97 ± 0.04 | 0.11 ± 0.03 | 1.24 ± 0.09 | ||||||
| MW + NC | 0.97 ± 0.04 | 0.11 ± 0.03 | 1.18 ± 0.08 | |||||
Prediction accuracy for the “out-of-the-bag” samples.
Confidence intervals were estimated using the subsets, which were sampled from the training set, and each had the same size as the respective test set (see for more details ref (23)).
Best model based on the in vitro assay descriptors developed using the LibSVM method (see also Table S1).
Model based on molecular weight (MW) and number of carbon atoms (NC) developed using the same approach as the above in vitro model.
Performances of Models Developed Using Different Descriptor Selection Proceduresa
| unsupervised
selection | neural
network pruning | |||||
|---|---|---|---|---|---|---|
| RMSE | RMSE | |||||
| descriptor set | training | test | training | test | ||
| CDK | 159 | 0.93 | 1.13 | 6 | 0.89 | 1.2 |
| Dragon | 1824 | 0.93 | 1.15 | 18 | 0.87 | 1.19 |
| Fragmentor | 631 | 0.98 | 1.18 | 12 | 0.92 | 1.21 |
| GSFrag | 202 | 0.97 | 1.1 | 24 | 0.97 | 1.18 |
| Mera, Mersy | 242 | 0.93 | 1.04 | 10 | 0.93 | 1.18 |
| Chemaxon | 97 | 0.93 | 1.16 | 11 | 0.92 | 1.16 |
| Inductive | 39 | 0.94 | 1.17 | 21 | 0.93 | 1.16 |
| Adriana | 133 | 0.93 | 1.14 | 8 | 0.92 | 1.1 |
| QNPR | 381 | 0.95 | 1.12 | 74 | 0.89 | 1.13 |
| E-state | 185 | 0.96 | 1.16 | 11 | 0.9 | 1.24 |
| Consensus | 4036 | 0.88 | 1.08 | 186 | 0.85 | 1.13 |
N is the number of descriptors selected to develop the respective model. RMSE is the root mean squared error calculated for the training (n = 483) and full test set (n = 143).