| Literature DB >> 19330084 |
Razieh Sabet1, Afshin Fassihi1.
Abstract
A series of 3-hydroxypyridine-4-one and 3-hydroxypyran-4-one derivatives were subjected to quantitative structure-antimicrobial activity relationships (QSAR) analysis. A collection of chemometrics methods, including factor analysis-based multiple linear regression (FA-MLR), principal component regression (PCR) and partial least squares combined with genetic algorithm for variable selection (GA-PLS) were employed to make connections between structural parameters and antimicrobial activity. The results revealed the significant role of topological parameters in the antimicrobial activity of the studied compounds against S. aureus and C. albicans. The most significant QSAR model, obtained by GA-PLS, could explain and predict 96% and 91% of variances in the pIC(50) data (compounds tested against S. aureus) and predict 91% and 87% of variances in the pIC(50) data (compounds tested against C. albicans), respectively.Entities:
Keywords: 3-Hydroxypyridine-4-one; 3-hydroxypyran-4-one; Chemometrics; QSAR
Year: 2008 PMID: 19330084 PMCID: PMC2635637 DOI: 10.3390/ijms9122407
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Chemical structure of the compounds used in QSAR analysis.
| Compound | X | R2 | R3 | R5 | R6 |
|---|---|---|---|---|---|
| NH | CH3 | OH | CH2-Ra | H | |
| NH | C2H5 | OH | CH2-Ra | H | |
| NH | CH3 | OH | CH2-N(CH3)2 | H | |
| NH | C2H5 | OH | CH2-N(CH3)2 | H | |
| NH | CH3 | OH | CH2-N(C2H5)2 | H | |
| NH | C2H5 | OH | CH2-N(C2H5)2 | H | |
| N-Ph | CH3 | OH | H | H | |
| N- | CH3 | OH | H | H | |
| N-C3H7 | CH3 | OH | H | H | |
| N-C4H9 | CH3 | OH | H | H | |
| O | CH2Cl | H | OH | H | |
| O | CH3 | H | OH | H | |
| O | CH2OH | OH | H | CH3 | |
| O | CH2OH | OCH2Ph | H | CH3 | |
| O | CHO | OCH2Ph | H | CH3 | |
| O | COOH | OCH2Ph | H | CH3 | |
| O | CONHRb | OCH2Ph | H | CH3 | |
| O | CONHRc | OCH2Ph | H | CH3 | |
| O | CONHRd | OCH2Ph | H | CH3 | |
| O | CONHRb | OH | H | CH3 | |
| O | CONHRc | OH | H | CH3 | |
| O | CONHRd | OH | H | CH3 | |
| O | CH2OH | H | OCH2Ph | H | |
| O | COOH | H | OCH2Ph | H | |
| O | CONHPh | H | OCH2Ph | H | |
| N-CH3 | CONHPh | H | OCH2Ph | H | |
| N-CH3 | CONHPh | H | OH | H | |
| O | CONH-Re | H | OCH2Ph | H | |
| N-CH3 | CONH-Re | H | OCH2Ph | H | |
| N-CH3 | CONH-Re | H | OH | H | |
| O | CH2OH | H | OH | H |
Brief description of some descriptors used in this study.
| Descriptor Type | Molecular Description |
|---|---|
| Constitutional | Mean atomic van der Waals volume (Mv) (scaled on Carbon atom), no. of heteroatoms, no. of multiple bonds (nBM), no. of rings, no. of circuits, no of H-bond donors, no of H-bond acceptors, no. of Nitrogen atoms (nN), chemical composition, sum of Kier-Hall electrotopological states (Ss), mean atomic polarizability (Mp), number of rotable bonds (RBN), mean atomic Sanderson electronegativity (Me), etc.
|
| Topological | Narumi harmonic topological index (HNar), Total structure connectivity index (Xt), information content index (IC), mean information content on the distance degree equality (IDDE), total walk count, path/walk-Randic shape indices (PW3, PW4, PW5, Zagreb indices, Schultz indices, Balaban J index (such as MSD) Wiener indices, Information content index (neighborhood symmetry of 2-order) (IC2), Ratio of multiple path count to path counts (PCR), Lovasz-Pelikan index (leading eigenvalue) (LP1), total information content index (neighborhood symmetry of 1-order) (TIC1), reciprocal hyper-detour index (Rww), Average connectivity index chi-5 (X5A), piID (conventional bond-order ID number), etc.
|
| Geometrical | 3D Petijean shape index (PJI3), Asphericity (ASP), Gravitational index, Balaban index, Wiener index, Length-to-breadth ratio by WHIM (L/Bw), etc.
|
| Quantum | Highest occupied Molecular Orbital Energy (HOMO), Lowest Unoccupied Molecular Orbital Energy (LUMO), Most positive charge (MPC), Sum of square of positive charges (SSPC), Sum of square of negative charges (SSNC), Sum of positive charges (SUMPC), Sum of negative charges (SUMNC), Sum of absolute of charges (SAC), Standard deviation (Std), Total dipole moment (DMt), Molecular dipole moment at X-direction (DMX), Molecular dipole moment at Y-direction (DMY), Molecular dipole moment at Z-direction (DMZ), Electronegativity (χ= −0.5 (HOMO-LUMO)), Electrophilicity (ω= χ2/2 η), Hardness (η = 0.5 (HOMO+LUMO)), Softness (S=1/ η).
|
| Functional group | Number of total secondary C(sp3) (nCs), Number of total tertiary carbons (nCt), Number of H-bond acceptor atoms (nHAcc), Number of secondary amides (aliphatic) (nCONHR), Number of unsubstituted aromatic C (nCaH), Number of ethers (aromatic) (nRORPh), Number of ketones (aliphatic) (nCO), Number of tertiary amines (aliphatic) (nNR2), Number of phenols (nOHPh), Number of total primary C(sp3) (nCp), etc.
|
| Chemical | LogP (Octanol-water partition coefficient), Hydration Energy (HE), Polarizability (Pol), Molar refractivity (MR), Molecular volume (V), Molecular surface area (SA).
|
Figure 1.PLS regression coefficients for the variables used in GA-PLS model (against S. aureus).
Experimental and predicted activity of compounds against Staphylococcus aureus.
| Compound | Experimental pMIC | Predicted pMIC | REP |
|---|---|---|---|
| 3.29 | 3.3205 | 0.9173 | |
| 3.29 | 3.3007 | 0.3242 | |
| 3.29 | 3.2266 | −1.9664 | |
| 3.29 | 3.3976 | 3.1675 | |
| 4.19 | 3.7498 | −11.740 | |
| 3.29 | 3.3205 | 0.9173 | |
| 3.89 | 3.8255 | −1.6850 | |
| 3.29 | 3.2698 | −0.6172 | |
| 3.29 | 3.2886 | −0.0440 | |
| 3.89 | 3.9283 | 0.9738 | |
| 3.59 | 3.6207 | 0.8470 | |
| 3.59 | 3.7254 | 3.6340 | |
| 3.59 | 3.5063 | −2.3883 | |
| 3.59 | 3.6212 | 0.8627 | |
| 4.19 | 4.1563 | −0.8119 | |
| 3.59 | 3.5611 | −0.8123 | |
| 3.59 | 3.6177 | 0.7647 | |
| 3.59 | 3.5548 | −0.9915 | |
| 3.89 | 3.8950 | 0.1293 | |
| 4.19 | 4.0995 | −2.2079 | |
| 3.59 | 3.7117 | 3.2787 | |
| 5.10 | 5.0840 | −0.3141 | |
| 3.59 | 3.5533 | −1.0318 | |
| 3.59 | 3.7223 | 3.5534 | |
| 3.89 | 3.9222 | 0.8214 | |
| 3.89 | 3.9779 | 2.2092 | |
| 4.80 | 4.8022 | 0.0453 | |
| 3.89 | 3.8591 | −0.8011 | |
| 3.59 | 3.4907 | −2.8470 | |
| 4.49 | 4.5105 | 0.4549 | |
| 3.59 | 3.4728 | −3.3746 |
a pMIC= −log (MIC),
b REP = Relative Error Prediction
*Compounds used as prediction set
Figure 2.PLS regression coefficients for the variables used in GA-PLS model (against C. albicans).
Experimental and predicted activity of compounds against Candida albicans.
| Compd. | Experimental pMIC | Predicted pMIC | REP(%) |
|---|---|---|---|
| 3.29 | 3.4139 | 3.6304 | |
| 3.29 | 3.3893 | 2.9303 | |
| 3.89 | 3.8920 | 0.0514 | |
| 3.29 | 3.3591 | 2.0577 | |
| 3.29 | 3.3835 | 2.7631 | |
| 3.59 | 3.6477 | 1.5813 | |
| 3.29 | 3.3208 | 0.9272 | |
| 3.59 | 3.6196 | 0.8175 | |
| 3.89 | 3.9567 | 1.6857 | |
| 3.89 | 3.7481 | −3.7870 | |
| 3.89 | 3.9092 | 0.4922 | |
| 3.89 | 3.7076 | −4.9191 | |
| 3.89 | 3.8892 | −0.0203 | |
| 3.89 | 3.8422 | −1.2433 | |
| 4.49 | 4.3961 | −2.1360 | |
| 4.49 | 4.4476 | −0.9524 | |
| 3.89 | 3.7076 | −4.9191 | |
| 3.89 | 3.8014 | −2.3296 | |
| 3.89 | 3.9525 | 1.5813 | |
| 3.89 | 3.7450 | −3.8727 | |
| 3.89 | 3.9056 | 0.3994 | |
| 3.89 | 3.9969 | 2.6755 | |
| 3.89 | 3.8489 | −1.0691 | |
| 3.89 | 3.7573 | −3.5304 | |
| 3.89 | 3.9503 | 1.5262 | |
| 3.89 | 3.9964 | 2.6619 | |
| 3.89 | 3.8978 | 0.2006 | |
| 3.89 | 3.8732 | −0.4333 |
*Compounds used as prediction set
Numerical values of factor loading numbers 1–4 for some descriptors after VARIMAX rotation (against S. aureus).
| 1 | 2 | 3 | 4 | Commonality | |
|---|---|---|---|---|---|
| 0.588 | −0.105 | 0.587 | −0.313 | 0.799 | |
| 0.195 | −0.054 | 0.762 | 0.071 | 0.627 | |
| 0.059 | 0.637 | −0.013 | 0.620 | 0.794 | |
| −0.643 | −0.206 | −0.199 | −0.496 | 0.741 | |
| 0.751 | −0.413 | 0.362 | −0.259 | 0.934 | |
| 0.001 | −0.781 | 0.097 | −0.298 | 0.708 | |
| 0.087 | 0.902 | 0.068 | 0.003 | 0.826 | |
| 0.866 | 0.051 | 0.217 | −0.252 | 0.863 | |
| −0.645 | −0.505 | −0.307 | 0.081 | 0.772 | |
| 0.746 | 0.359 | 0.215 | 0.324 | 0.837 | |
| 0.667 | 0.460 | 0.368 | 0.292 | 0.877 | |
| 0.714 | 0.413 | 0.175 | 0.127 | 0.726 | |
| 0.375 | 0.611 | −0.315 | −0.276 | 0.689 | |
| −0.559 | 0.578 | −0.411 | 0.199 | 0.855 | |
| 0.894 | −0.140 | −0.143 | −0.079 | 0.845 | |
| 0.261 | 0.220 | 0.695 | −0.906 | 0.765 | |
| −0.082 | 0.081 | −0.214 | 0.853 | 0.787 | |
| 0.041 | −0.116 | 0.898 | −0.051 | 0.824 | |
| 29.87 | 20.10 | 17.15 | 12.12 | 79.24 |
Numerical values of factor loading numbers 1–5 for some descriptors after VARIMAX rotation (against C. albicans).
| 1 | 2 | 3 | 4 | 5 | Commonality | |
|---|---|---|---|---|---|---|
| −0.491 | −0.431 | −0.459 | −0.107 | 0.095 | 0.657 | |
| −0.007 | 0.102 | −0.209 | 0.860 | 0.322 | 0.898 | |
| 0.240 | 0.811 | −0.156 | −0.349 | 0.014 | 0.861 | |
| −0.706 | −0.389 | 0.142 | 0.323 | −0.310 | 0.871 | |
| −0.627 | −0.664 | −0.134 | −0.102 | 0.129 | 0.879 | |
| −0.166 | 0.594 | −0.377 | −0.158 | 0.893 | 0.584 | |
| 0.913 | −0.079 | 0.055 | 0.135 | −0.132 | 0.879 | |
| 0.579 | 0.272 | −0.164 | 0.210 | 0.584 | 0.820 | |
| 0.750 | −0.070 | −0.333 | −0.190 | −0.208 | 0.758 | |
| −0.075 | 0.087 | 0.866 | −0.198 | 0.322 | 0.905 | |
| 0.064 | 0.117 | 0.926 | −0.023 | 0.164 | 0.902 | |
| −0.206 | 0.754 | −0.224 | −0.097 | −0.325 | 0.777 | |
| −0.366 | 0.722 | 0.148 | 0.287 | −0.234 | 0.814 | |
| −0.191 | −0.415 | −0.165 | −0.447 | 0.356 | 0.562 | |
| 0.571 | −0.522 | 0.379 | 0.002 | −0.341 | 0.858 | |
| 0.628 | −0.627 | −0.277 | −0.107 | 0.602 | 0.872 | |
| 22.58 | 20.58 | 14.71 | 14.02 | 8.71 | 80.60 |
Figure 3.Plots of the cross-validated predicted activity against the experimental activity for the QSAR models obtained by different chemometrics methods (against S. aureus).
Figure 4.Plots of the cross-validated predicted activity against the experimental activity for the QSAR models obtained by different chemometrics methods (against C. albicans).