| Literature DB >> 31459245 |
Kunal Roy1, Pravin Ambure1, Supratik Kar2.
Abstract
Quantitative structure-activity relationship (QSAR) models have long been used for making predictions and data gap filling in diverse fields including medicinal chemistry, predictive toxicology, environmental fate modeling, materials science, agricultural science, nanoscience, food science, and so forth. Usually a QSAR model is developed based on chemical information of a properly designed training set and corresponding experimental response data while the model is validated using one or more test set(s) for which the experimental response data are available. However, it is interesting to estimate the reliability of predictions when the model is applied to a completely new data set (true external set) even when the new data points are within applicability domain (AD) of the developed model. In the present study, we have categorized the quality of predictions for the test set or true external set into three groups (good, moderate, and bad) based on absolute prediction errors. Then, we have used three criteria [(a) mean absolute error of leave-one-out predictions for 10 most close training compounds for each query molecule; (b) AD in terms of similarity based on the standardization approach; and (c) proximity of the predicted value of the query compound to the mean training response] in different weighting schemes for making a composite score of predictions. It was found that using the most frequently appearing weighting scheme 0.5-0-0.5, the composite score-based categorization showed concordance with absolute prediction error-based categorization for more than 80% test data points while working with 5 different datasets with 15 models for each set derived in three different splitting techniques. These observations were also confirmed with true external sets for another four endpoints suggesting applicability of the scheme to judge the reliability of predictions for new datasets. The scheme has been implemented in a tool "Prediction Reliability Indicator" available at http://dtclab.webs.com/software-tools and http://teqip.jdvu.ac.in/QSAR_Tools/DTCLab/, and the tool is presently valid for multiple linear regression models only.Entities:
Year: 2018 PMID: 31459245 PMCID: PMC6645132 DOI: 10.1021/acsomega.8b01647
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Figure 1Snapshot of the developed software Prediction Reliability Indicator.
Figure 2Schematic diagram of workflow of the analysis.
Results for CDK Dataset (Model Dataset 1) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| model | division method | MAE95%Train | MAE95%Test | correct prediction in % | training set range | number
of compounds outside AD | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | sorted response | 0.74 | 0.71 | 0.15 | 0.73 | 0.14 | 87.84 | 2.396 | 0 |
| 2 | 0.71 | 0.69 | 0.14 | 0.69 | 0.13 | 86.49 | 0 | ||
| 3 | 0.68 | 0.66 | 0.14 | 0.68 | 0.13 | 85.14 | 1 (122) | ||
| 4 | 0.70 | 0.67 | 0.14 | 0.67 | 0.13 | 85.14 | 4 (118, 119, 122, 123) | ||
| 5 | 0.63 | 0.61 | 0.17 | 0.65 | 0.17 | 77.02 | 0 | ||
| 6 | Kennard–Stone | 0.78 | 0.75 | 0.14 | 0.63 | 0.13 | 85.29 | 2.342 | 0 |
| 7 | 0.77 | 0.75 | 0.14 | 0.59 | 0.14 | 79.41 | 1 (195) | ||
| 8 | 0.74 | 0.71 | 0.14 | 0.54 | 0.13 | 89.71 | 0 | ||
| 9 | 0.72 | 0.70 | 0.14 | 0.56 | 0.12 | 83.82 | 0 | ||
| 10 | 0.71 | 0.68 | 0.14 | 0.54 | 0.13 | 86.76 | 0 | ||
| 11 | modified- | 0.74 | 0.71 | 0.13 | 0.70 | 0.17 | 86.67 | 2.397 | 0 |
| 12 | 0.69 | 0.66 | 0.13 | 0.65 | 0.14 | 89.33 | 3 (30, 38, 39) | ||
| 13 | 0.69 | 0.66 | 0.13 | 0.65 | 0.14 | 88 | 0 | ||
| 14 | 0.70 | 0.68 | 0.14 | 0.68 | 0.13 | 92 | 0 | ||
| 15 | 0.64 | 0.61 | 0.16 | 0.65 | 0.14 | 88 | 0 |
AD using standardization technique is used and compound ID mentioned under parenthesis.
NTraining = 154, NTest = 74.
NTraining = 156, NTest = 68.
NTraining = 149, NTest = 75.
Results for PBT Index of Chemicals (Model Dataset 5) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| model | division method | MAE95%Train | MAE95%Test | correct prediction in % | training set range | number
of compounds outside AD | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | sorted response | 0.88 | 0.87 | 0.36 | 0.92 | 0.37 | 91.67 | 8.1 | 1 (189) |
| 2 | 0.90 | 0.89 | 0.33 | 0.95 | 0.27 | 100 | 2 (41, 189) | ||
| 3 | 0.89 | 0.88 | 0.35 | 0.94 | 0.30 | 100 | 2 (14, 189) | ||
| 4 | 0.89 | 0.88 | 0.34 | 0.93 | 0.34 | 91.67 | 1 (14) | ||
| 5 | 0.89 | 0.88 | 0.34 | 0.94 | 0.32 | 88.89 | 4 (14, 41, 189, 206) | ||
| 6 | Kennard–Stone | 0.91 | 0.90 | 0.35 | 0.87 | 0.34 | 95.56 | 8.1 | 2 (211, 212) |
| 7 | 0.90 | 0.89 | 0.34 | 0.88 | 0.29 | 95.56 | 0 | ||
| 8 | 0.91 | 0.90 | 0.34 | 0.88 | 0.34 | 95.56 | 0 | ||
| 9 | 0.91 | 0.90 | 0.34 | 0.88 | 0.31 | 95.56 | 0 | ||
| 10 | 0.92 | 0.91 | 0.33 | 0.88 | 0.31 | 95.56 | 0 | ||
| 11 | modified- | 0.91 | 0.90 | 0.32 | 0.84 | 0.45 | 88.64 | 7.24 | 3 (11, 25, 189) |
| 12 | 0.90 | 0.89 | 0.32 | 0.87 | 0.40 | 86.36 | 2 (11, 25) | ||
| 13 | 0.92 | 0.91 | 0.31 | 0.86 | 0.43 | 93.18 | 2 (11, 25) | ||
| 14 | 0.90 | 0.89 | 0.32 | 0.88 | 0.39 | 93.18 | 0 | ||
| 15 | 0.92 | 0.91 | 0.31 | 0.86 | 0.43 | 93.18 | 2 (11, 25) |
AD using standardization technique is used and compound ID mentioned under parenthesis.
NTraining = 144, NTest = 36.
NTraining = 135, NTest = 45.
NTraining = 136, NTest = 44.
Results for Refractive Index of Polymers Dataset (True External Dataset 1) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| training
set | test
set | true
external test set | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| model | MAE95%Train | MAE95%Test | correct prediction in % | number of compounds outside AD | MAE95%Test | correct prediction in % | number of compounds
outside AD | ||||
| 1 | 0.90 | 0.88 | 0.01 | 0.88 | 0.01 | 98.51 | 1 (143) | 0.87 | 0.01 | 94.90 | 0 |
| 2 | 0.91 | 0.90 | 0.01 | 0.89 | 0.01 | 97.01 | 1 (1) | 0.87 | 0.01 | 91.84 | 5 (319, 333, 334, 339, 340) |
| 3 | 0.90 | 0.89 | 0.01 | 0.89 | 0.01 | 97.01 | 2 (1, 143) | 0.87 | 0.01 | 94.90 | 6 (319, 331, 333, 334, 339, 340) |
| 4 | 0.90 | 0.88 | 0.01 | 0.90 | 0.01 | 97.01 | 2 (1, 185) | 0.88 | 0.01 | 94.90 | 7 (319, 331, 333, 334, 339, 341) |
| 5 | 0.90 | 0.89 | 0.01 | 0.90 | 0.01 | 98.51 | 2 (1, 143) | 0.88 | 0.01 | 93.88 | 0 |
AD using standardization technique is used and compound ID mentioned under parenthesis; division method: Kennard–Stone; NTraining = 154, NTest = 67, NTrue-External-Test = 98.
Results forf Sweetness Potency of Organic Molecules (True External Dataset 4) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| training
set | test
set | true
external test set | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| model | MAE95%Train | MAE95%Test | correct prediction in % | number of compounds outside AD | MAE95%Test | correct prediction in % | number of compounds
outside AD | ||||
| 1 | 0.84 | 0.82 | 0.40 | 0.87 | 0.40 | 93.75 | 1 (228) | 0.74 | 0.59 | 80 | 0 |
| 2 | 0.85 | 0.83 | 0.43 | 0.87 | 0.42 | 95 | 2 (137, 228) | 0.75 | 0.63 | 83.33 | 0 |
| 3 | 0.71 | 0.69 | 0.59 | 0.75 | 0.57 | 73.75 | 2 (137, 228) | 0.64 | 0.79 | 75 | 0 |
| 4 | 0.86 | 0.85 | 0.39 | 0.83 | 0.48 | 93.75 | 1 (228) | 0.69 | 0.66 | 73.33 | 0 |
| 5 | 0.85 | 0.83 | 0.39 | 0.85 | 0.40 | 92.5 | 1 (228) | 0.79 | 0.56 | 86.67 | 0 |
AD using standardization technique is used and compound ID mentioned under parenthesis; division method: activity sorted; NTraining = 160, NTest = 80, NTrue-External-Test = 60.
Results for AChE Dataset (Model Dataset 2) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| model | division method | MAE95%Train | MAE95%Test | correct prediction in % | training set range | number
of compounds outside AD | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | sorted response | 0.68 | 0.65 | 0.49 | 0.58 | 0.55 | 87.32 | 7.82 | 10 (13, 336–338, 340, 342–344, 347, 360) |
| 2 | 0.67 | 0.64 | 0.49 | 0.61 | 0.52 | 85.92 | 5 (242–244, 323, 360) | ||
| 3 | 0.64 | 0.60 | 0.46 | 0.52 | 0.53 | 86.62 | 11 (32, 33, 35, 336–338, 340, 342–344, 347) | ||
| 4 | 0.52 | 0.47 | 0.48 | 0.56 | 0.52 | 85.21 | 15 (32, 33, 242–244, 323, 336–338, 340, 342–344, 347, 368) | ||
| 5 | 0.64 | 0.60 | 0.46 | 0.53 | 0.55 | 86.62 | 14 (32, 33, 35, 174, 218, 323, 336–338, 340, 342–344, 347) | ||
| 6 | Kennard–Stone | 0.71 | 0.69 | 0.49 | 0.48 | 0.58 | 84.38 | 7.82 | 1 (201) |
| 7 | 0.69 | 0.67 | 0.51 | 0.48 | 0.58 | 91.41 | 1 (201) | ||
| 8 | 0.74 | 0.71 | 0.47 | 0.53 | 0.56 | 91.41 | 1 (1) | ||
| 9 | 0.69 | 0.67 | 0.51 | 0.50 | 0.59 | 85.94 | 1 (1) | ||
| 10 | 0.70 | 0.68 | 0.48 | 0.53 | 0.57 | 88.26 | 0 | ||
| 11 | modified- | 0.68 | 0.65 | 0.48 | 0.63 | 0.51 | 86.62 | 7.76 | 8 (177–179, 203, 219, 246, 310, 311) |
| 12 | 0.68 | 0.65 | 0.49 | 0.61 | 0.52 | 87.32 | 8 (177–179, 203, 219, 246, 310, 311) | ||
| 13 | 0.66 | 0.63 | 0.49 | 0.60 | 0.53 | 85.92 | 7 (177–179, 203, 219, 310, 311) | ||
| 14 | 0.67 | 0.65 | 0.50 | 0.55 | 0.58 | 85.21 | 8 (177–179, 203, 219, 246, 310, 311) | ||
| 15 | 0.65 | 0.62 | 0.50 | 0.58 | 0.56 | 87.32 | 1 (246) |
AD using standardization technique is used and compound ID mentioned under parenthesis.
NTraining = 284, NTest = 142.
NTraining = 284, NTest = 142.
NTraining = 284, NTest = 142.
Results for C60 Solubility in Organic Solvents Dataset (Model Dataset 3) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| model | division method | MAE95%Train | MAE95%Test | correct prediction in % | training set range | number of compounds outside AD | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | sorted response | 0.87 | 0.85 | 0.34 | 0.85 | 0.33 | 97.87 | 6.97 | 1 (128) |
| 2 | 0.88 | 0.85 | 0.33 | 0.84 | 0.34 | 95.74 | 1 (128) | ||
| 3 | 0.88 | 0.87 | 0.33 | 0.81 | 0.33 | 91.49 | 1 (85) | ||
| 4 | 0.88 | 0.86 | 0.34 | 0.85 | 0.32 | 95.74 | 1 (21) | ||
| 5 | 0.87 | 0.86 | 0.33 | 0.81 | 0.36 | 95.74 | 1 (128) | ||
| 6 | Kennard–Stone | 0.85 | 0.84 | 0.33 | 0.88 | 0.35 | 100 | 6.97 | 0 |
| 7 | 0.86 | 0.85 | 0.33 | 0.87 | 0.35 | 97.87 | 0 | ||
| 8 | 0.86 | 0.85 | 0.32 | 0.86 | 0.34 | 95.74 | 2 (87, 148) | ||
| 9 | 0.79 | 0.77 | 0.37 | 0.84 | 0.39 | 95.74 | 0 | ||
| 10 | 0.86 | 0.85 | 0.33 | 0.87 | 0.34 | 95.74 | 0 | ||
| 11 | modified- | 0.84 | 0.83 | 0.36 | 0.82 | 0.32 | 93.62 | 6.97 | 0 |
| 12 | 0.84 | 0.83 | 0.37 | 0.80 | 0.34 | 93.62 | 2 (64, 144) | ||
| 13 | 0.89 | 0.88 | 0.34 | 0.78 | 0.34 | 89.36 | 0 | ||
| 14 | 0.88 | 0.85 | 0.33 | 0.82 | 0.33 | 95.74 | 1 (128) | ||
| 15 | 0.89 | 0.88 | 0.34 | 0.79 | 0.33 | 89.36 | 0 |
AD using standardization technique is used and compound ID mentioned under parenthesis.
NTraining = 109, NTest = 47.
NTraining = 109, NTest = 47.
NTraining = 109, NTest = 47.
Results for Bioluminescent Repression of the Bacterium Genus Pseudomonas Dataset (Model Dataset 4) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| model | division method | MAE95%Train | MAE95%Test | correct prediction in % | training set range | number
of compounds outside AD | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | sorted response | 0.78 | 0.73 | 0.32 | 0.42 | 0.32 | 80.65 | 4.06 | 0 |
| 2 | 0.73 | 0.69 | 0.35 | 0.45 | 0.31 | 80.65 | 2 (45, 47) | ||
| 3 | 0.80 | 0.74 | 0.28 | 0.64 | 0.30 | 87.10 | 1 (47) | ||
| 4 | 0.78 | 0.73 | 0.29 | 0.55 | 0.30 | 83.87 | 1 (47) | ||
| 5 | 0.80 | 0.74 | 0.29 | 0.66 | 0.31 | 83.87 | 1 (47) | ||
| 6 | Kennard–Stone | 0.67 | 0.61 | 0.32 | 0.62 | 0.40 | 81.25 | 4.06 | 0 |
| 7 | 0.72 | 0.60 | 0.32 | 0.66 | 0.39 | 84.38 | 0 | ||
| 8 | 0.68 | 0.65 | 0.32 | 0.62 | 0.39 | 78.13 | 0 | ||
| 9 | 0.67 | 0.60 | 0.33 | 0.62 | 0.40 | 81.25 | 0 | ||
| 10 | 0.69 | 0.63 | 0.32 | 0.63 | 0.39 | 78.13 | 0 | ||
| 11 | modified- | 0.75 | 0.70 | 0.30 | 0.63 | 0.32 | 80.65 | 4.06 | 0 |
| 12 | 0.72 | 0.68 | 0.31 | 0.65 | 0.34 | 83.87 | 0 | ||
| 13 | 0.72 | 0.68 | 0.32 | 0.62 | 0.32 | 77.42 | 0 | ||
| 14 | 0.71 | 0.67 | 0.31 | 0.63 | 0.31 | 80.65 | 1 (64) | ||
| 15 | 0.69 | 0.65 | 0.32 | 0.62 | 0.29 | 87.10 | 1 (64) |
AD using standardization technique is used and compound ID mentioned under parenthesis.
NTraining = 73, NTest = 31.
NTraining = 72, NTest = 32.
NTraining = 73, NTest = 31.
Results for BACE1 Dataset (True External Dataset 2) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| training
set | test
set | true
external test set | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| model | MAE95%Train | MAE95%Test | correct prediction in % | number of compounds outside AD | MAE95%Test | correct prediction in % | number of compounds
outside AD | ||||
| 1 | 0.83 | 0.76 | 0.37 | 0.75 | 0.32 | 86.36 | 0 | 0.90 | 0.31 | 82.35 | 2 (81, 91) |
| 2 | 0.80 | 0.75 | 0.37 | 0.79 | 0.32 | 91.30 | 0 | 0.72 | 0.37 | 88.24 | 2 (81, 89) |
| 3 | 0.80 | 0.76 | 0.35 | 0.91 | 0.24 | 95.65 | 0 | 0.83 | 0.34 | 88.24 | 1 (81) |
| 4 | 0.76 | 0.71 | 0.38 | 0.77 | 0.27 | 91.30 | 0 | 0.75 | 0.37 | 88.24 | 2 (81, 89) |
| 5 | 0.79 | 0.75 | 0.38 | 0.79 | 0.28 | 91.30 | 0 | 0.86 | 0.33 | 82.35 | 1 (81) |
AD using standardization technique is used and compound ID mentioned under parenthesis; division method: Kennard–Stone; NTraining = 51, NTest = 23, NTrue-External-Test = 17.
Results for Glass Transition Temperature of Polymers Dataset (True External Dataset 3) with Best Weighting (0.5–0–0.5) Combination as Obtained from the Retrospective Study
| training
set | test
set | true
external test set | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| model | MAE95%Train | MAE95%Test | correct prediction in % | number of compounds outside AD | MAE95%Test | correct prediction in % | number of compounds
outside AD | ||||
| 1 | 0.74 | 0.68 | 0.04 | 0.74 | 0.04 | 86.54 | 3 (2, 16, 37) | 0.75 | 0.06 | 68.42 | 2 (14, 39) |
| 2 | 0.75 | 0.71 | 0.04 | 0.73 | 0.05 | 88.46 | 3 (2, 16, 324) | 0.77 | 0.05 | 84.21 | 1 (39) |
| 3 | 0.76 | 0.72 | 0.04 | 0.80 | 0.04 | 88.46 | 3 (2, 16, 37) | 0.85 | 0.04 | 84.21 | 1 (39) |
| 4 | 0.71 | 0.66 | 0.04 | 0.70 | 0.04 | 90.38 | 3 (2, 16, 37) | 0.80 | 0.04 | 84.21 | 1 (14) |
| 5 | 0.76 | 0.70 | 0.05 | 0.72 | 0.04 | 82.69 | 4 (2, 16, 37, 324) | 0.81 | 0.04 | 81.58 | 1 (39) |
AD using standardization technique is used and compound ID mentioned under parenthesis; division method: activity sorted; NTraining = 154, NTest = 52, NTrue-External-Test = 38.
Figure 3Radar plots showing occurrence (in fractions of all cases) of weighting 0.5–0–0.5 for maximum % correct predictions for individual datasets and combined datasets.