| Literature DB >> 19665859 |
M Jalali-Heravi1, M Asadollahi-Baboli, A Mani-Varnosfaderani.
Abstract
In this work, the inhibitory activity of pyridine N-oxide derivatives against human severe acute respiratory syndrome (SARS) is predicted in terms of quantitative structure-activity relationship (QSAR) models. These models were developed with the aid of multivariate adaptive regression spline (MARS) and adaptive neuro-fuzzy inference system (ANFIS) combined with shuffling cross-validation technique. A shuffling MARS algorithm is utilized to select the most important variables in QSAR modeling and then these variables were used as inputs of ANFIS to predict SARS inhibitory activities of pyridine N-oxide derivatives. A data set of 119 drug-like compounds was coded with over hundred calculated meaningful molecular descriptors. The best descriptors describing the inhibition mechanism were solvation connectivity index, length to breadth ratio, relative negative charge, harmonic oscillator of aromatic index, average molecular weight and total path count. These parameters are among topological, electronic, geometric, constitutional and aromaticity descriptors. The statistical parameters of R2 and root mean square error (RMSE) are 0.884 and 0.359, respectively. The accuracy and robustness of shuffling MARS-ANFIS model in predicting inhibition behavior of pyridine N-oxide derivatives (pIC50) was illustrated using leave-one-out and leave-multiple-out cross-validation techniques and also by Y-randomization. Comparison of the results of the proposed model with those of GA-PLS-ANFIS shows that the shuffling MARS-ANFIS model is superior and can be considered as a tool for predicting the inhibitory behavior of SARS drug-like molecules.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19665859 PMCID: PMC7126869 DOI: 10.1016/j.jpba.2009.07.009
Source DB: PubMed Journal: J Pharm Biomed Anal ISSN: 0731-7085 Impact factor: 3.935
Experimental and calculated inhibitor data using shuffling MARS–ANFIS model for pyridine N-oxide derivatives.
| No. | Subset | X1 | X2 | X3 | X4 | X5 | Z1 | Z2 | Y1 | Y2 | Y3 | Y4 | Exp. | MARS–ANFIS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | A | H | H | H | H | H | H | O | O | H | H | H | H | 3.840 | 3.866 |
| 2 | C | H | Me | H | H | H | H | O | O | H | H | H | H | 4.060 | 4.272 |
| 3 | B | H | H | Me | H | H | H | O | O | H | H | H | H | 4.059 | 4.327 |
| 4 | D | H | H | Me | H | H | H | O | O | H | H | H | H | 3.825 | 3.952 |
| 5 | A | H | H | H | Me | H | H | O | O | H | H | H | H | 4.040 | 4.232 |
| 6 | B | H | H | H | Me | H | H | O | – | H | H | H | H | 4.352 | 4.845 |
| 7 | E | H | Me | H | Me | H | H | O | O | H | H | H | H | 4.239 | 4.082 |
| 8 | D | H | Me | H | H | Me | H | O | O | H | H | H | H | 3.938 | 4.359 |
| 9 | C | H | Me | H | H | Me | H | O | – | H | H | H | H | 3.736 | 3.325 |
| 10 | F | H | Me | Me | H | H | Me | O | O | H | H | H | H | 4.423 | 4.721 |
| 11 | A | H | Me | Me | H | H | Me | O | – | H | H | H | H | 5.294 | 5.128 |
| 12 | E | H | H | H | Et | H | H | O | O | H | H | H | H | 4.361 | 4.150 |
| 13 | D | H | H | H | iProp | H | H | O | O | H | H | H | H | 4.239 | 4.633 |
| 14 | C | H | iProp | H | H | iProp | H | O | O | H | H | H | H | 4.491 | 4.699 |
| 15 | B | H | iProp | H | H | iProp | H | O | – | H | H | H | H | 4.717 | 4.327 |
| 16 | E | H | H | H | tBut | H | H | O | O | H | H | H | H | 4.502 | 4.663 |
| 17 | D | H | H | H | tPent | H | H | O | O | H | H | H | H | 4.652 | 4.626 |
| 18 | A | H | H | H | OMe | H | H | O | O | H | H | H | H | 4.756 | 4.874 |
| 19 | F | H | H | H | OMe | H | H | O | – | H | H | H | H | 3.955 | 3.910 |
| 20 | C | H | OMe | H | H | OMe | H | O | O | H | H | H | H | 5.098 | 5.199 |
| 21 | E | H | H | OMe | OMe | H | H | O | O | H | H | H | H | 3.712 | 3.738 |
| 22 | B | H | H | OMe | OMe | H | H | O | – | H | H | H | H | 3.878 | 4.083 |
| 23 | C | H | H | OMe | OMe | OMe | H | O | O | H | H | H | H | 5.321 | 5.680 |
| 24 | F | H | H | OMe | OMe | OMe | H | O | – | H | H | H | H | 3.823 | 3.498 |
| 25 | E | H | OMe | H | H | Me | H | O | O | H | H | H | H | 3.811 | 4.002 |
| 26 | A | H | OMe | H | H | Me | H | – | – | H | H | H | H | 3.834 | 3.752 |
| 27 | E | H | OEt | H | H | H | H | O | O | H | H | H | H | 4.037 | 3.952 |
| 28 | D | H | OEt | H | H | H | H | O | – | H | H | H | H | 3.899 | 3.905 |
| 29 | C | H | H | F | H | H | H | O | O | H | H | H | H | 3.651 | 3.250 |
| 30 | B | H | H | H | F | H | H | O | – | H | H | H | H | 4.148 | 4.250 |
| 31 | F | H | Cl | H | H | H | H | O | O | H | H | H | H | 3.461 | 3.138 |
| 32 | A | H | Cl | H | Cl | H | H | O | O | H | H | H | H | 4.174 | 4.007 |
| 33 | E | H | Cl | H | H | H | Cl | O | O | H | H | H | H | 4.710 | 4.872 |
| 34 | B | H | H | Cl | Cl | H | H | O | O | H | H | H | H | 4.327 | 4.140 |
| 35 | D | H | Cl | Cl | H | H | Cl | O | O | H | H | H | H | 5.040 | 4.892 |
| 36 | F | H | Cl | Cl | Cl | Cl | Cl | O | O | H | H | H | H | 4.734 | 4.809 |
| 37 | A | H | Cl | Cl | Me | Cl | Cl | O | O | H | H | H | H | 5.546 | 5.770 |
| 38 | E | H | Cl | H | NO2 | H | H | O | O | H | H | H | H | 5.302 | 5.119 |
| 39 | A | H | H | Br | H | H | H | O | O | H | H | H | H | 4.732 | 4.567 |
| 40 | C | H | Br | H | H | OMe | H | O | O | H | H | H | H | 4.647 | 4.170 |
| 41 | F | H | iProp | H | Br | iProp | H | O | O | H | H | H | H | 4.378 | 4.190 |
| 42 | B | H | I | H | H | H | H | O | O | H | H | H | H | 4.972 | 4.892 |
| 43 | F | H | NO2 | H | H | H | H | O | O | H | H | H | H | 4.176 | 4.103 |
| 44 | C | H | H | H | NO2 | H | H | O | O | H | H | H | H | 4.355 | 4.349 |
| 45 | E | H | H | NO2 | H | NO2 | H | O | O | H | H | H | H | 4.588 | 4.404 |
| 46 | B | H | H | NO2 | Me | H | H | O | O | H | H | H | H | 4.887 | 4.438 |
| 47 | F | H | H | Me | NO2 | H | H | O | O | H | H | H | H | 4.726 | 4.421 |
| 48 | C | H | Me | H | H | NO2 | H | O | O | H | H | H | H | 4.799 | 4.849 |
| 49 | A | H | OMe | H | H | NO2 | H | O | O | H | H | H | H | 3.733 | 3.717 |
| 50 | D | H | H | NO2 | Cl | H | H | O | O | H | H | H | H | 4.281 | 4.439 |
| 51 | F | H | CN | H | H | H | H | O | O | H | H | H | H | 4.883 | 4.672 |
| 52 | B | H | H | H | CN | H | H | O | O | H | H | H | H | 4.262 | 4.069 |
| 53 | E | H | H | H | Phe | H | H | O | O | H | H | H | H | 4.359 | 4.188 |
| 54 | D | H | OPhe | H | H | H | H | O | O | H | H | H | H | 4.850 | 4.615 |
| 55 | A | H | H | OMe | OBz | H | H | O | O | H | H | H | H | 4.533 | 4.783 |
| 56 | B | H | H | CF3 | H | H | H | O | – | H | H | H | H | 4.667 | 4.873 |
| 57 | C | H | OH | H | H | NO2 | H | – | – | H | H | H | H | 3.747 | 3.250 |
| 58 | F | Me | H | H | H | H | H | O | – | H | H | H | H | 3.876 | 3.994 |
| 59 | B | Me | H | H | Me | H | H | O | – | H | H | H | H | 5.420 | 5.237 |
| 60 | C | Me | Me | H | H | Me | H | – | – | H | H | H | H | 5.646 | 5.860 |
| 61 | D | Me | H | H | F | H | H | O | – | H | H | H | H | 3.545 | 3.407 |
| 62 | A | Me | Cl | H | H | Me | H | O | – | H | H | H | H | 5.905 | 5.390 |
| 63 | E | Et | H | H | H | H | H | O | O | H | H | H | H | 6.192 | 5.717 |
| 64 | C | Et | Me | H | H | Me | H | O | O | H | H | H | H | 3.658 | 3.525 |
| 65 | F | Prop | H | H | H | H | H | O | O | H | H | H | H | 3.582 | 3.633 |
| 66 | B | Prop | H | H | H | H | H | – | – | H | H | H | H | 3.831 | 3.934 |
| 67 | D | Prop | Me | H | H | Me | H | – | – | H | H | H | H | 3.622 | 3.473 |
| 68 | A | Hept | H | Me | Me | Me | H | – | – | H | H | H | H | 4.495 | 4.380 |
| 69 | F | Hept | Me | H | H | Me | H | O | O | H | H | H | H | 4.252 | 4.272 |
| 70 | D | Undec | Me | H | H | Me | H | O | O | H | H | H | H | 5.043 | 4.767 |
| 71 | C | Isobut | Me | H | H | Me | H | O | O | H | H | H | H | 3.691 | 3.399 |
| 72 | E | C3H6 | Me | H | H | Me | H | O | O | H | H | H | H | 4.731 | 4.549 |
| 73 | B | C6H5 | Me | H | H | H | H | O | O | H | H | H | H | 3.922 | 3.603 |
| 74 | C | C6H5 | Me | H | H | Me | H | O | O | H | H | H | H | 3.859 | 3.781 |
| 75 | D | C6H5 | Me | H | H | Me | H | – | – | H | H | H | H | 3.915 | 3.749 |
| 76 | F | CH2Ph | Me | H | H | Me | H | – | – | H | H | H | H | 3.976 | 4.003 |
| 77 | A | CN | Me | H | H | Me | H | – | – | H | H | H | H | 5.111 | 5.216 |
| 78 | D | CH2CO2H | Me | H | H | Me | H | O | O | H | H | H | H | 3.900 | 3.617 |
| 79 | F | Br | Me | H | H | Me | H | O | O | H | H | H | H | 3.567 | 3.598 |
| 80 | B | CO2CH3 | Me | H | H | H | H | O | – | H | H | H | H | 3.643 | 3.576 |
| 81 | C | CO2CH3 | Me | H | H | Me | H | O | O | H | H | H | H | 3.769 | 3.392 |
| 82 | D | CO2CH3 | H | OPh | H | H | H | O | O | H | H | H | H | 3.969 | 3.860 |
| 83 | F | CF3 | Me | H | H | Me | H | O | O | H | H | H | H | 4.186 | 4.199 |
| 84 | A | CH2OMe | Me | H | H | Me | H | O | O | H | H | H | H | 3.926 | 3.480 |
| 85 | E | Me, Cl | H | H | H | H | H | O | O | H | H | H | H | 4.143 | 3.895 |
| 86 | C | Me, Cl | Me | H | H | Me | H | O | O | H | H | H | H | 4.474 | 4.177 |
| 87 | B | Me | H | H | Me | H | Me | O | O | H | H | Me | H | 4.151 | 3.881 |
| 88 | D | H | H | H | H | H | H | O | O | H | H | H | Me | 4.087 | 4.356 |
| 89 | F | Me | H | H | Me | H | Me | O | O | H | H | H | Me | 3.973 | 3.970 |
| 90 | A | Me | H | H | Me | H | H | O | – | H | H | H | Me | 3.905 | 3.804 |
| 91 | E | Me | H | H | H | H | H | O | O | H | H | H | Me | 3.817 | 3.616 |
| 92 | D | Me | H | Me | H | H | H | O | O | H | H | H | Me | 4.213 | 4.120 |
| 93 | B | Me | H | H | Me | H | Et | O | O | H | H | H | Me | 4.423 | 4.289 |
| 94 | F | Cl | H | H | H | Cl | H | – | – | H | H | H | Me | 4.203 | 4.127 |
| 95 | C | H | H | H | H | H | Me | O | O | H | H | H | Me | 3.986 | 3.957 |
| 96 | E | Cl | H | H | H | H | H | O | O | H | H | H | Me | 3.665 | 3.634 |
| 97 | A | Me | NO2 | H | H | H | H | O | O | H | H | H | Me | 4.152 | 4.113 |
| 98 | B | Me | H | Me | H | H | Me | O | O | H | H | H | Me | 4.526 | 4.321 |
| 99 | F | Cl | H | H | H | H | Me | O | O | H | H | H | Me | 3.832 | 3.577 |
| 100 | D | Me | NO2 | H | H | H | Me | O | O | H | H | H | Me | 3.914 | 3.719 |
| 101 | E | Me | H | H | Me | H | H | O | O | H | H | H | OMe | 3.545 | 3.801 |
| 102 | A | Me | H | H | Me | H | H | O | – | H | H | H | OMe | 4.585 | 4.150 |
| 103 | C | Me | H | H | Me | H | H | – | – | H | H | H | OMe | 3.672 | 3.983 |
| 104 | E | Me | H | H | Me | H | H | – | – | H | H | H | OH | 3.595 | 3.088 |
| 105 | B | H | H | OMe | H | H | H | O | O | H | H | t-Bu | H | 3.604 | 3.629 |
| 106 | D | H | H | OMe | H | H | H | – | – | H | H | t-Bu | H | 3.946 | 4.304 |
| 107 | E | H | H | H | H | H | H | – | – | Cl | H | H | H | 3.801 | 3.871 |
| 108 | A | Me | H | H | Me | H | H | O | O | Cl | H | H | H | 3.720 | 3.324 |
| 109 | B | Me | H | H | Me | H | Me | O | O | Cl | H | H | H | 4.892 | 4.610 |
| 110 | F | H | H | H | H | H | H | – | – | H | Cl | H | H | 4.860 | 4.651 |
| 111 | C | Me | H | H | Me | H | H | O | O | H | Cl | H | H | 3.633 | 3.495 |
| 112 | E | Me | H | H | Me | H | H | O | – | H | Cl | H | H | 4.709 | 4.628 |
| 113 | A | H | H | H | H | H | Cl | O | O | H | Cl | H | H | 4.325 | 4.674 |
| 114 | D | H | H | H | H | H | H | O | O | H | H | H | Cl | 5.141 | 4.740 |
| 115 | F | H | H | H | H | H | H | O | – | H | H | H | Cl | 5.277 | 5.216 |
| 116 | C | Me | H | H | Me | H | H | – | – | H | H | H | Cl | 4.860 | 4.638 |
| 117 | B | Me | H | H | Me | H | Me | O | O | H | H | H | Cl | 3.824 | 4.007 |
| 118 | D | Me | H | H | Me | H | Cl | O | O | H | H | H | Cl | 5.283 | 5.216 |
| 119 | A | H | H | H | H | H | H | – | – | H | H | H | NO2 | 4.363 | 4.643 |
A–F subsets.
Substituted groups in pyridine N-oxide derivatives is shown in Fig. 2.
Fig. 2Main skeleton with different functional positions of pyridine N-oxide derivatives.
Fig. 1A typical ANFIS structure.
Selecting the important variables using shuffling MARS method.
| Run | Calibration set | RMSECal | Validation set | RMSEVal | ||
|---|---|---|---|---|---|---|
| 1 | A + B + C + D | 0.834 | 0.241 | E + F | 0.767 | 0.458 |
| 2 | A + B + C + E | 0.820 | 0.268 | D + F | 0.810 | 0.372 |
| 3 | A + B + D + E | 0.831 | 0.279 | C + F | 0.805 | 0.367 |
| 4 | A + C + D + E | 0.835 | 0.253 | B + F | 0.751 | 0.476 |
| 5 | B + C + D + E | 0.803 | 0.226 | A + F | 0.740 | 0.450 |
| 6 | A + B + C + F | 0.833 | 0.240 | D + E | 0.802 | 0.393 |
| 7 | A + B + D + F | 0.819 | 0.273 | C + E | 0.783 | 0.422 |
| 8 | A + C + D + F | 0.843 | 0.226 | B + E | 0.745 | 0.449 |
| 9 | B + C + D + F | 0.825 | 0.282 | A + E | 0.730 | 0.470 |
| 10 | A + B + E + F | 0.839 | 0.228 | C + D | 0.804 | 0.418 |
| 11 | A + C + E + F | 0.813 | 0.265 | B + D | 0.806 | 0.416 |
| 12 | B + C + E + F | 0.826 | 0.250 | A + D | 0.784 | 0.464 |
| 13 | A + D + E + F | 0.837 | 0.242 | B + C | 0.787 | 0.466 |
| 14 | B + D + E + F | 0.821 | 0.255 | A + C | 0.769 | 0.471 |
| 15 | C + D + E + F | 0.818 | 0.235 | A + B | 0.750 | 0.483 |
| Mean | 0.827 | 0.251 | 0.776 | 0.438 | ||
Fig. 3The selected descriptors and the frequency of each one in the shuffling MARS models.
Fig. 4Plot of the shuffling MARS–ANFIS calculated pIC50 values against the experimental ones for the training, test and validation sets.
Fig. 5Plot of residuals versus experimental values of pIC50 for the shuffling MARS–ANFIS.
Statistics using LOO-CV and LMO-CV methods for comparing the results of shuffling MARS–ANFIS method with GA-PLS-ANFIS method.
| Method | LOO | L12O | L18O | |||
|---|---|---|---|---|---|---|
| RMSEp | RMSEp | RMSEp | ||||
| Shuffling MARS–ANFIS | 0.892 | 0.331 | 0.884 | 0.359 | 0.870 | 0.380 |
| GA-PLS-ANFIS | 0.813 | 0.446 | 0.787 | 0.489 | 0.785 | 0.494 |
Calculation of was based on 1000 random selections of groups of 12 and 18 samples.
All R2 are adjusted coefficient regression.
Selected variables: X3sol, TPC, RNCG and AROM.
Mean values of and after performing 100 Y-randomization tests.
| Method | Mean of | Mean of |
|---|---|---|
| Shuffling MARS–ANFIS | 0.185 | 0.096 |
| GA-PLS-ANFIS | 0.236 | 0.143 |
Fig. 6Selected variables using GA-PLS method after 3000 runs.