| Literature DB >> 18282294 |
Edward O Cannon1, Florian Nigsch, John B O Mitchell.
Abstract
BACKGROUND: We have introduced a new Hybrid descriptor composed of the MACCS key descriptor encoding topological information and Ballester and Richards' Ultrafast Shape Recognition (USR) descriptor. The latter one is calculated from the moments of the distribution of the interatomic distances, and in this work we also included higher moments than in the original implementation.Entities:
Year: 2008 PMID: 18282294 PMCID: PMC2275267 DOI: 10.1186/1752-153X-2-3
Source DB: PubMed Journal: Chem Cent J ISSN: 1752-153X Impact factor: 4.215
WADA class & number of molecules
| WADA Class | Number of Molecules |
| P2 | 239 |
| S1 | 47 |
| S2 | 272 |
| S3 | 367 |
| S4 | 928 |
| S5 | 1,000 |
| S6 | 804 |
| S7 | 195 |
| S8 | 1,000 |
| S9 | 26 |
| Allowed | 367 |
| Total | 5,245 |
Figure 1Mesokurtic, leptokurtic and platykurtic.
Performance measures. Percentage of actives recalled in the top 1% and top 5% of the ranked validation sets, precision of predicted positives, area under the Receiver Operating Characteristic curve, F-measure and the Matthews Correlation Coefficient. All results are calculated over ten different runs and are based on the validation sets.
| Descriptor | P2 | S1 | S2 | S3 | S4 | S5 | S6 | S7 | S8 | S9 | Average | |
| Recall 1% | Hybrid | 100.00 | 76.36 | 90.29 | 94.73 | 85.86 | 57.60 | 91.74 | 88.96 | 40.92 | 55.00 | 78.15 |
| MACCS | 96.17 | 50.83 | 51.03 | 43.08 | 82.49 | 63.96 | 90.50 | 80.41 | 41.88 | 0.00 | 60.04 | |
| USR | 34.58 | 70.91 | 51.03 | 43.08 | 46.98 | 13.28 | 32.39 | 25.00 | 8.76 | 6.67 | 33.27 | |
| UF4 | 42.88 | 70.00 | 62.94 | 52.53 | 54.05 | 16.72 | 30.80 | 27.87 | 8.40 | 16.67 | 38.29 | |
| UF5 | 69.06 | 41.45 | 32.17 | 25.59 | 15.78 | 4.07 | 12.34 | 21.05 | 3.22 | 0.00 | 22.47 | |
| Recall 5% | Hybrid | 100.00 | 85.45 | 95.44 | 97.47 | 95.82 | 78.12 | 96.37 | 93.13 | 69.80 | 63.33 | 87.49 |
| MACCS | 96.67 | 59.17 | 76.47 | 59.45 | 93.27 | 80.40 | 94.63 | 85.92 | 65.64 | 4.17 | 71.58 | |
| USR | 61.19 | 80.91 | 76.47 | 59.45 | 69.87 | 32.72 | 54.88 | 45.42 | 28.56 | 23.33 | 53.28 | |
| UF4 | 73.39 | 77.27 | 83.82 | 65.82 | 74.01 | 38.76 | 53.98 | 42.19 | 29.04 | 41.67 | 58.00 | |
| UF5 | 84.21 | 49.35 | 48.20 | 34.93 | 24.55 | 11.39 | 22.41 | 38.30 | 11.39 | 3.75 | 32.85 | |
| Precision | Hybrid | 0.94 | 0.80 | 0.91 | 0.80 | 0.79 | 0.62 | 0.90 | 0.78 | 0.58 | 0.42 | 0.75 |
| MACCS | 0.93 | 0.07 | 0.87 | 0.77 | 0.67 | 0.62 | 0.89 | 0.71 | 0.53 | 0.00 | 0.61 | |
| USR | 0.12 | 0.55 | 0.23 | 0.38 | 0.50 | 0.13 | 0.36 | 0.41 | 0.03 | 0.00 | 0.27 | |
| UF4 | 0.32 | 0.69 | 0.17 | 0.51 | 0.71 | 0.08 | 0.51 | 0.49 | 0.27 | 0.00 | 0.38 | |
| UF5 | 0.21 | 0.52 | 0.37 | 0.44 | 0.64 | 0.16 | 0.49 | 0.27 | 0.04 | 0.00 | 0.32 | |
| AUC | Hybrid | 1.00 | 0.89 | 0.96 | 0.87 | 0.79 | 0.67 | 0.94 | 0.90 | 0.70 | 0.68 | 0.84 |
| MACCS | 1.00 | 0.55 | 0.96 | 0.75 | 0.67 | 0.75 | 0.91 | 0.89 | 0.74 | 0.83 | 0.81 | |
| USR | 1.00 | 0.69 | 0.65 | 0.61 | 0.58 | 0.54 | 0.56 | 0.59 | 0.54 | 0.57 | 0.63 | |
| UF4 | 1.00 | 0.77 | 0.64 | 0.72 | 0.60 | 0.55 | 0.57 | 0.63 | 0.53 | 0.55 | 0.66 | |
| UF5 | 1.00 | 0.94 | 0.84 | 0.79 | 0.81 | 0.71 | 0.76 | 0.76 | 0.71 | 0.53 | 0.78 | |
| F-measure | Hybrid | 0.91 | 0.54 | 0.82 | 0.72 | 0.71 | 0.45 | 0.83 | 0.70 | 0.27 | 0.22 | 0.62 |
| MACCS | 0.91 | 0.11 | 0.79 | 0.67 | 0.63 | 0.51 | 0.84 | 0.63 | 0.29 | 0.00 | 0.54 | |
| USR | 0.11 | 0.27 | 0.17 | 0.23 | 0.35 | 0.06 | 0.22 | 0.08 | 0.04 | 0.00 | 0.15 | |
| UF4 | 0.16 | 0.49 | 0.23 | 0.36 | 0.42 | 0.07 | 0.19 | 0.09 | 0.03 | 0.00 | 0.20 | |
| UF5 | 0.10 | 0.31 | 0.20 | 0.23 | 0.36 | 0.07 | 0.22 | 0.07 | 0.03 | 0.00 | 0.16 | |
| MCC | Hybrid | 0.91 | 0.58 | 0.83 | 0.73 | 0.71 | 0.47 | 0.83 | 0.71 | 0.32 | 0.26 | 0.63 |
| MACCS | 0.91 | 0.13 | 0.79 | 0.68 | 0.63 | 0.52 | 0.84 | 0.64 | 0.32 | 0.00 | 0.55 | |
| USR | 0.12 | 0.32 | 0.18 | 0.25 | 0.37 | 0.08 | 0.24 | 0.13 | 0.08 | 0.00 | 0.18 | |
| UF4 | 0.19 | 0.51 | 0.25 | 0.38 | 0.46 | 0.09 | 0.24 | 0.15 | 0.08 | 0.02 | 0.24 | |
| UF5 | 0.13 | 0.34 | 0.23 | 0.26 | 0.40 | 0.09 | 0.26 | 0.10 | 0.06 | 0.00 | 0.19 | |
The standard error of the mean. Percentage of actives recalled in the top 1% and top 5% of the ranked validation sets, precision of predicted positives, area under the Receiver Operating Characteristic curve, F-measure and the Matthews Correlation Coefficient. All results are calculated over ten different runs and are based on the validation sets. The standard error values at the 95% confidence level were calculated over the ten different runs and ten classes.
| Descriptor | P2 | S1 | S2 | S3 | S4 | S5 | S6 | S7 | S8 | S9 | SE 95% | |
| Recall 1% | Hybrid | 0.000 | 3.090 | 1.319 | 1.020 | 0.812 | 1.010 | 0.566 | 1.585 | 0.962 | 4.339 | 3.924 |
| MACCS | 0.255 | 1.944 | 3.142 | 1.092 | 4.183 | 1.000 | 0.381 | 1.330 | 4.379 | 0.000 | 5.679 | |
| USR | 2.652 | 4.242 | 1.612 | 1.369 | 0.972 | 0.710 | 0.837 | 2.106 | 0.665 | 2.722 | 4.025 | |
| UF4 | 2.115 | 5.080 | 1.934 | 1.482 | 0.764 | 0.616 | 0.765 | 1.374 | 0.637 | 5.556 | 4.171 | |
| UF5 | 0.015 | 0.018 | 0.017 | 0.015 | 0.005 | 0.002 | 0.006 | 0.011 | 0.002 | 0.000 | 4.131 | |
| Recall 5% | Hybrid | 0.000 | 2.010 | 0.774 | 0.614 | 0.450 | 0.949 | 0.386 | 1.164 | 0.794 | 3.333 | 2.520 |
| MACCS | 0.000 | 2.307 | 1.373 | 0.442 | 2.836 | 0.912 | 0.220 | 1.037 | 6.719 | 2.668 | 5.395 | |
| USR | 1.722 | 3.442 | 1.886 | 1.231 | 1.403 | 0.749 | 1.169 | 1.195 | 0.966 | 6.667 | 3.959 | |
| UF4 | 1.805 | 3.105 | 1.404 | 1.609 | 0.733 | 0.917 | 1.023 | 2.633 | 1.040 | 5.693 | 3.772 | |
| UF5 | 0.012 | 0.027 | 0.010 | 0.019 | 0.004 | 0.003 | 0.007 | 0.016 | 0.003 | 0.027 | 4.677 | |
| Precision | Hybrid | 0.004 | 0.020 | 0.059 | 0.099 | 0.005 | 0.009 | 0.034 | 0.009 | 0.012 | 0.037 | 0.033 |
| MACCS | 0.010 | 0.014 | 0.018 | 0.043 | 0.026 | 0.022 | 0.008 | 0.030 | 0.036 | 0.000 | 0.064 | |
| USR | 0.005 | 0.034 | 0.015 | 0.018 | 0.010 | 0.015 | 0.013 | 0.033 | 0.002 | 0.000 | 0.038 | |
| UF4 | 0.017 | 0.017 | 0.007 | 0.019 | 0.009 | 0.005 | 0.020 | 0.033 | 0.028 | 0.000 | 0.048 | |
| UF5 | 0.066 | 0.104 | 0.056 | 0.061 | 0.022 | 0.028 | 0.035 | 0.068 | 0.015 | 0.000 | 0.042 | |
| AUC | Hybrid | 0.000 | 0.009 | 0.002 | 0.005 | 0.003 | 0.002 | 0.001 | 0.003 | 0.004 | 0.019 | 0.024 |
| MACCS | 0.000 | 0.002 | 0.002 | 0.004 | 0.002 | 0.002 | 0.001 | 0.003 | 0.003 | 0.015 | 0.027 | |
| USR | 0.000 | 0.011 | 0.004 | 0.003 | 0.001 | 0.001 | 0.001 | 0.006 | 0.001 | 0.011 | 0.027 | |
| UF4 | 0.000 | 0.010 | 0.004 | 0.004 | 0.002 | 0.000 | 0.001 | 0.007 | 0.001 | 0.015 | 0.028 | |
| UF5 | 0.000 | 0.011 | 0.008 | 0.013 | 0.004 | 0.004 | 0.006 | 0.028 | 0.007 | 0.025 | 0.025 | |
| F-measure | Hybrid | 0.003 | 0.016 | 0.035 | 0.052 | 0.003 | 0.002 | 0.027 | 0.006 | 0.004 | 0.019 | 0.047 |
| MACCS | 0.006 | 0.021 | 0.008 | 0.015 | 0.012 | 0.009 | 0.007 | 0.017 | 0.010 | 0.000 | 0.061 | |
| USR | 0.003 | 0.014 | 0.009 | 0.004 | 0.003 | 0.003 | 0.003 | 0.007 | 0.001 | 0.000 | 0.022 | |
| UF4 | 0.006 | 0.015 | 0.006 | 0.010 | 0.003 | 0.003 | 0.003 | 0.006 | 0.001 | 0.001 | 0.033 | |
| UF5 | 0.016 | 0.059 | 0.020 | 0.019 | 0.016 | 0.008 | 0.013 | 0.019 | 0.006 | 0.000 | 0.024 | |
| MCC | Hybrid | 0.914 | 0.580 | 0.828 | 0.727 | 0.711 | 0.468 | 0.830 | 0.708 | 0.316 | 0.257 | 0.044 |
| MACCS | 0.006 | 0.025 | 0.008 | 0.015 | 0.011 | 0.009 | 0.007 | 0.016 | 0.010 | 0.000 | 0.060 | |
| USR | 0.116 | 0.315 | 0.183 | 0.252 | 0.372 | 0.081 | 0.237 | 0.130 | 0.075 | 0.001 | 0.023 | |
| UF4 | 0.188 | 0.511 | 0.251 | 0.383 | 0.455 | 0.085 | 0.244 | 0.150 | 0.077 | 0.019 | 0.033 | |
| UF5 | 0.019 | 0.064 | 0.013 | 0.018 | 0.010 | 0.007 | 0.011 | 0.025 | 0.010 | 0.001 | 0.026 | |