| Literature DB >> 21589807 |
M Shahlaei1, A Fassihi, A Nezami.
Abstract
In the present study, quantitative relationships between molecular structure and anti-tubercular activity of some 5-methyl/trifluoromethoxy-1H-indole-2,3-dione-3-thiosemicarbazone derivatives were discovered. The detailed application of an efficient linear method and principal component regression (PCR) for the evaluation of quantitative structure activity relationships of the studied compounds is demonstrated. Components produced by principal component analysis were used as the input for a linear model development. Results indicate a linear relationship between the principal components obtained from molecular descriptors and the inhibitory activity of this set of molecules. The maximum variance in the activity of the molecules in PCR method was 73%. The performance of the developed model was tested by several validation methods.Entities:
Keywords: Anti-tuberculosis activity; Principal component analysis; QSAR; Thiosemicarbazone derivaives
Year: 2009 PMID: 21589807 PMCID: PMC3093630
Source DB: PubMed Journal: Res Pharm Sci ISSN: 1735-5362
General structures and structural details of 5-methyl/trifluoromethoxy-1H-indole-2,3-dione-3-thiosemicarbazone derivatives used in this study.
| Compound | R1 | R2 | R3 | Type |
|---|---|---|---|---|
| 1 | CH3 | CH3CH=CH2 | H | A |
| 2 | CH3 | C4H9 | H | A |
| 3 | CH3 | C6H5CH2 | H | A |
| 4 | CH3 | 4-FC6H4 | H | A |
| 5 | CH3 | 2-BrC6H4 | H | A |
| 6 | CH3 | 3-BrC6H4 | H | A |
| 7 | CF3O | 4-NO2C6H4 | H | A |
| 8 | CF3O | CH3 | H | A |
| 9 | CF3O | C2H5 | H | A |
| 10 | CF3O | CH3CH=CH2 | H | A |
| 11 | CF3O | C4H9 | H | A |
| 12 | CF3O | H | A | |
| 13 | CF3O | C6H5CH2 | H | A |
| 14 | CF3O | C6H5 | H | A |
| 15 | CF3O | 4-CH3C6H4 | H | A |
| 16 | CF3O | 4-CH3OC6H4 | H | A |
| 17 | CF3O | 4-FC6H4 | H | A |
| 18 | CF3O | 4-ClC6H4 | H | A |
| 19 | CF3O | 4-BrC6H4 | H | A |
| 20 | CH3 | 2-BrC6H4 | CH3 | A |
| 21 | CF3O | C2H5 | CH3 | A |
| 22 | CF3O | CH3CH=CH2 | CH3 | A |
| 23 | CF3O | C4H9 | CH3 | A |
| 24 | CF3O | CH3 | A | |
| 25 | CF3O | C6H5CH2 | CH3 | A |
| 26 | CF3O | 4-CH3C6H4 | CH3 | A |
| 27 | CF3O | 4-FC6H4 | CH3 | A |
| 28 | CF3O | 4-ClC6H4 | CH3 | A |
| 29 | CF3O | 4-BrC6H4 | CH3 | A |
| 30 | - | CH3 | - | B |
| 31 | - | C2H5 | - | B |
| 32 | - | CH3CH=CH2 | - | B |
| 33 | - | C4H9 | - | B |
| 34 | - | - | B | |
| 35 | - | C6H5CH2 | - | B |
| 36 | - | C6H5 | - | B |
| 37 | - | 4-CH3C6H4 | - | B |
| 38 | - | 4-CH3OC6H4 | - | B |
| 39 | - | 4-FC6H4 | - | B |
| 40 | - | 4-ClC6H4 | - | B |
| 41 | - | 4-BrC6H4 | - | B |
| 42 | - | 4-NO2C6H4 | - | B |
| 43 | CH3 | C2H5 | H | A |
| 44 | CH3 | H | A | |
| 45 | CH3 | C6H5 | H | A |
| 46 | CH3 | 4-CH3C6H4 | H | A |
| 47 | CH3 | 4-ClC6H4 | H | A |
| 48 | CH3 | 4-BrC6H4 | H | A |
| 49 | CH3 | C6H5 | CH3 | A |
Molecules assigned by Kennard and Stone algorithm as test set
Some descriptors used in model building.
| Descriptor | Molecular Descriptor |
|---|---|
| Molecular weight, no. of atoms, no. of non-H atoms, no. of bonds, no. of heteroatoms, no. of multiple bonds (nBM), no. of aromatic bonds, no. of functional groups (hydroxyl, amine, aldehyde, carbonyl, nitro, nitroso, etc.), no. of rings, no. of circuits, no of H-bond donors, no of H-bond acceptors, no. of Nitrogen atoms (nN), chemical composition, sum of Kier-Hall electrotopological states (Ss), mean atomic polarizability (Mp), number of rotable bonds (RBN), etc. | |
| Molecular size index, molecular connectivity indices (X1A, X4A, X2v, X1Av, X2Av, X3Av, X4Av), information content index (IC), Kier Shape indices, total walk count, path/walk-Randic shape indices (PW3, PW4, Zagreb indices, Schultz indices, Balaban J index (such as MSD) Wiener indices, topological charge indices, Sum of topological distances between F..F (T(F..F)), Ratio of multiple path count to path counts (PCR), Mean information content vertex degree magnitude (IVDM), Eigenvalue sum of Z weighted distance matrix (SEigZ), reciprocal hyperdetour index (Rww), Eigenvalue coefficient sum from adjacency matrix (VEA1), radial centric information index, 2D petijean shape index (PJI2), etc. | |
| D petijean shape index (PJI3), Gravitational index, Balaban index, Wiener index, etc | |
| Number of total tertiary carbons (nCt), Number of H-bond acceptor atoms (nHAcc), number of total hydroxyl groups (nOH), number of unsubstituted aromatic C(nCaH), number of ethers (aromatic) (nRORPh), etc. |
Eigenvslues of calculated PCs, % of explained variances and cumulative variances.
| PC No. | Eigenvalue | % variance explained | cumulative variance |
|---|---|---|---|
| 1 | 195.0 | 60.70 | 60.70 |
| 2 | 39.46 | 12.26 | 72.96 |
| 3 | 27.92 | 8.672 | 81.63 |
| 4 | 12.28 | 3.814 | 85.45 |
| 5 | 8.755 | 2.719 | 88.16 |
| 6 | 8.281 | 2.572 | 90.73 |
| 7 | 5.879 | 1.826 | 92.56 |
| 8 | 4.508 | 1.400 | 93.96 |
| 9 | 3.887 | 1.207 | 95.17 |
Fig. 1The first two components (PC1, and PC2) from the principal component analysis of the 49 considered molecules.
Calculated activities by PCR and their relative error of prediction (REP).
| Compound | Experimental activity | Predicted activity | REP |
|---|---|---|---|
| 1 | 3.743 | 3.853 | 0.029 |
| 2 | 5.379 | 5.124 | -0.050 |
| 3 | 3.827 | 3.554 | -0.077 |
| 4 | 4.866 | 4.983 | 0.024 |
| 5 | 4.538 | 4.667 | 0.028 |
| 6 | 4.941 | 4.776 | -0.035 |
| 7 | 4.882 | 4.669 | -0.046 |
| 8 | 5.115 | 5.555 | 0.079 |
| 9 | 5.197 | 5.544 | 0.063 |
| 10 | 4.262 | 4.554 | 0.064 |
| 11 | 3.478 | 3.854 | 0.097 |
| 12 | 4.955 | 4.459 | -0.111 |
| 13 | 3.847 | 3.454 | -0.114 |
| 14 | 4.459 | 4.836 | 0.078 |
| 15 | 4.476 | 4.433 | -0.010 |
| 16 | 4.875 | 4.433 | -0.100 |
| 17 | 4.900 | 4.654 | -0.053 |
| 18 | 4.873 | 4.739 | -0.028 |
| 19 | 4.424 | 4.763 | 0.071 |
| 20 | 4.260 | 4.654 | 0.085 |
| 21 | 4.223 | 4.530 | 0.068 |
| 22 | 4.564 | 4.674 | 0.024 |
| 23 | 3.390 | 3.557 | 0.047 |
| 24 | 3.704 | 3.640 | -0.018 |
| 25 | 4.137 | 4.718 | 0.123 |
| 26 | 3.178 | 3.433 | 0.074 |
| 27 | 5.063 | 5.153 | 0.018 |
| 28 | 4.768 | 4.454 | -0.071 |
| 29 | 5.213 | 5.460 | 0.045 |
| 30 | 4.805 | 4.406 | -0.091 |
| 31 | 4.679 | 4.455 | -0.050 |
| 32 | 4.999 | 4.652 | -0.074 |
| 33 | 3.909 | 3.763 | -0.039 |
| 34 | 4.741 | 4.630 | -0.024 |
| 35 | 5.000 | 4.775 | -0.047 |
| 36 | 4.315 | 4.634 | 0.069 |
| 37 | 4.862 | 4.394 | -0.107 |
| 38 | 4.619 | 4.451 | -0.038 |
| 39 | 4.633 | 4.123 | -0.124 |
| 40 | 4.608 | 4.620 | 0.003 |
| 41 | 4.593 | 4.535 | -0.013 |
| 42 | 4.175 | 4.285 | 0.026 |
| 43 | 4.157 | 4.436 | 0.063 |
| 44 | 5.152 | 5.439 | 0.053 |
| 45 | 4.968 | 4.529 | -0.097 |
| 46 | 4.847 | 4.453 | -0.089 |
| 47 | 4.598 | 4.440 | -0.036 |
| 48 | 3.645 | 3.439 | -0.060 |
| 49 | 3.332 | 3.237 | -0.029 |
Fig. 2Plot of calculated vs. experimental activity of investigated compounds in training and test sets.
Statistical parameters obtained for the developed model for anti-tubrecular activity of the investigated compounds.
| Parameter | Training set | Test set |
|---|---|---|
| N | 40 | 9 |
| R2 | 0.735 | 0.762 |
| RMSE | 0.284 | 0.324 |
| PRESS | 3.274 | 0.946 |
| R2LOOCV | 0.792 | |
| RMSELOOCV | 0.207 | |
| R2L5O.CV | 0.746 | |
| RMSEL5OCV | 0.255 | |
| R2-R02/R2 | -0.008 | -0.041 |
| R2-R’02/R2 | -0.009 | -0.041 |
| k | 1.023 | 0.987 |
| k’ | 1.012 | 0.99 |
| Rm2 | 0.800 | 0.711 |
| R2 adjusted | 0.713 |
N: Number of objects in data set, R2: Correlation coefficient of experimental and predicted activities, RMSE: Root mean square error:, PRESS: Predicted error sum of square: , R2cv: Correlation coefficient of leave one out cross validation, RMSEcv: Root mean square error of cross validation