| Literature DB >> 29862081 |
Shola Elijah Adeniji1, Sani Uba1, Adamu Uzairu1.
Abstract
A quantitative structure-activity relationship (QSAR) study was performed to develop a model that relates the structures of 50 compounds to their activities against M. tuberculosis. The compounds were optimized by employing density functional theory (DFT) with B3LYP/6-31G⁎. The Genetic Function Algorithm (GFA) was used to select the descriptors and to generate the correlation model that relates the structural features of the compounds to their biological activities. The optimum model has squared correlation coefficient (R2) of 0.9202, adjusted squared correlation coefficient (Radj) of 0.91012, and leave-one-out (LOO) cross-validation coefficient (Qcv2) value of 0.8954. The external validation test used for confirming the predictive power of the built model has R2pred value of 0.8842. These parameters confirm the stability and robustness of the model. Docking analysis showed the best compound with high docking affinity of -14.6 kcal/mol which formed hydrophobic interaction and hydrogen bond with amino acid residues of M. tuberculosis cytochromes (Mtb CYP121). QSAR and molecular docking studies provide valuable approach for pharmaceutical and medicinal chemists to design and synthesize new anti-Mycobacterium tuberculosis compounds.Entities:
Year: 2018 PMID: 29862081 PMCID: PMC5971244 DOI: 10.1155/2018/1018694
Source DB: PubMed Journal: J Pathog ISSN: 2090-3057
Molecular structure of 1,2,4-Triazole derivatives and their activities.
| S/N | Molecules | Experimental |
|---|---|---|
|
|
| 4.925 |
|
| ||
| 2a |
| 5.0345 |
|
| ||
|
|
| 5.0064 |
|
| ||
|
|
| 5.7386 |
|
| ||
| 5a |
| 5.5994 |
|
| ||
| 6a |
| 5.4543 |
|
| ||
|
|
| 4.7441 |
|
| ||
|
|
| 6.1674 |
|
| ||
| 9a |
| 6.3456 |
|
| ||
|
|
| 7.4134 |
|
| ||
|
|
| 5.7441 |
|
| ||
|
|
| 5.9258 |
|
| ||
| 13a |
| 5.6754 |
|
| ||
|
|
| 6.3793 |
|
| ||
|
|
| 6.1667 |
|
| ||
| 16a |
| 5.8765 |
|
| ||
|
|
| 6.4171 |
|
| ||
|
|
| 5.9413 |
|
| ||
|
|
| 7.6397 |
|
| ||
|
|
| 8.0899 |
|
| ||
|
|
| 6.3981 |
|
| ||
|
|
| 5.8131 |
|
| ||
|
|
| 6.2878 |
|
| ||
|
|
| 5.7268 |
|
| ||
|
|
| 7.366 |
|
| ||
| 26a |
| 7.0123 |
|
| ||
|
|
| 6.5267 |
|
| ||
|
|
| 5.7405 |
|
| ||
| 29a |
| 5.6533 |
|
| ||
| 30a |
| 6.1923 |
|
| ||
|
|
| 7.3233 |
|
| ||
|
|
| 6.0097 |
|
| ||
|
|
| 6.0928 |
|
| ||
|
|
| 7.3279 |
|
| ||
|
|
| 6.8568 |
|
| ||
| 36a |
| 6.2234 |
|
| ||
|
|
| 7.3079 |
|
| ||
|
|
| 7.314 |
|
| ||
| 39a |
| 8.5854 |
|
| ||
|
|
| 8.0615 |
|
| ||
|
|
| 8.0615 |
|
| ||
| 42a |
| 6.8494 |
|
| ||
|
|
| 7.9432 |
|
| ||
| 44a |
| 7.4535 |
|
| ||
|
|
| 7.9759 |
|
| ||
|
|
| 7.9759 |
|
| ||
|
|
| 7.9294 |
|
| ||
| 48a |
| 6.1213 |
|
| ||
|
|
| 5.4406 |
|
| ||
| 50 |
| 4.9074 |
Superscript a represents the test set.
Minimum recommended value of validation parameters for a generally acceptable QSAR model.
| Validation parameter | Name | Value |
|---|---|---|
|
| Coefficient of determination | ≥0.6 |
|
| Confidence interval at 95% confidence level | <0.05 |
|
| Cross-validation coefficient | >0.5 |
|
| Difference between | ≤0.3 |
|
| Minimum number of external test sets | ≥5 |
|
| Coefficient of determination for external test set | ≥0.6 |
|
| Coefficient of determination for | >0.5 |
Figure 1(a) Prepared structure of Mtb CYP121. (b) 3D structures of the prepared ligands.
Experimental, predicted, and residual values for 1,2,4-Triazole derivatives.
| S/number | Experimental | Predicted | Residual |
|---|---|---|---|
|
| 4.925 | 4.8922 | 0.0328 |
|
| 5.0345 | 4.8716 | 0.1629 |
|
| 5.0064 | 5.0941 | −0.0877 |
|
| 5.7386 | 5.8308 | −0.0922 |
|
| 5.5994 | 5.5803 | 0.0191 |
|
| 5.4543 | 5.6969 | −0.2426 |
|
| 4.7441 | 4.8047 | −0.0606 |
|
| 6.1674 | 6.2999 | −0.1325 |
|
| 6.3456 | 6.5053 | −0.1597 |
|
| 7.4134 | 7.1548 | 0.2586 |
|
| 5.7441 | 6.0862 | −0.3421 |
|
| 5.9258 | 5.6383 | 0.2875 |
|
| 5.6754 | 5.4834 | 0.192 |
|
| 6.3793 | 6.3443 | 0.035 |
|
| 6.1667 | 6.5432 | −0.3765 |
|
| 5.8765 | 6.8765 | −1.000 |
|
| 6.4171 | 6.1354 | 0.2817 |
|
| 5.9413 | 6.02517 | −0.08387 |
|
| 7.6397 | 7.6055 | 0.0342 |
|
| 8.0899 | 7.8436 | 0.2463 |
|
| 6.3981 | 6.2094 | 0.1887 |
|
| 5.8131 | 6.4308 | −0.6177 |
|
| 6.2878 | 6.30457 | −0.01677 |
|
| 5.7268 | 5.9933 | −0.2665 |
|
| 7.366 | 7.5444 | −0.1784 |
|
| 7.0123 | 6.8471 | 0.1652 |
|
| 6.5267 | 5.9850 | 0.5417 |
|
| 5.7405 | 6.0962 | −0.3557 |
|
| 5.6533 | 6.4796 | −0.8263 |
|
| 6.1923 | 6.0426 | 0.1497 |
|
| 7.3233 | 6.5095 | 0.8138 |
|
| 6.0097 | 6.3151 | −0.3054 |
|
| 6.0928 | 5.9501 | 0.1427 |
|
| 7.3279 | 7.3990 | −0.0711 |
|
| 6.8568 | 6.8761 | −0.0193 |
|
| 6.2234 | 8.6487 | −2.4253 |
|
| 7.3079 | 7.2405 | 0.0674 |
|
| 7.314 | 7.5050 | −0.191 |
|
| 8.5854 | 5.6969 | 2.8885 |
|
| 8.0615 | 8.1009 | −0.0394 |
|
| 8.0615 | 7.8073 | 0.2542 |
|
| 6.8494 | 7.6746 | −0.8252 |
|
| 7.9432 | 7.9352 | 0.008 |
|
| 7.4535 | 7.6946 | −0.2411 |
|
| 7.9759 | 7.8569 | 0.119 |
|
| 7.9759 | 8.2103 | −0.2344 |
|
| 7.9294 | 7.9408 | −0.0114 |
|
| 6.1213 | 5.9165 | 0.2048 |
|
| 5.4406 | 5.2695 | 0.1711 |
|
| 4.9074 | 4.8495 | 0.0579 |
Validation of the genetic function approximation from Materials Studio.
| S/number | Equation | |
|---|---|---|
|
| Friedman LOF | 0.40847300 |
|
|
| 0.92023900 |
|
| Adjusted | 0.91017400 |
|
| Cross-validated | 0.89538600 |
|
| Significant regression | Yes |
|
| Significance-of-regression | 58.41835200 |
|
| Critical SOR | 2.45854700 |
|
| Replicate points | 0 |
|
| Computed experimental error | 0.00000000 |
|
| Lack-of-fit points | 28 |
|
| Min expt. error for nonsignificant LOF (95%) | 0.24688800 |
Descriptive statistics of the inhibition data.
| Statistical parameters | Activity | |
|---|---|---|
| Training set | Test set | |
| Number of sample points | 35 | 15 |
| Range | 3.3458 | 3.678 |
| Maximum | 8.0899 | 8.2854 |
| Minimum | 4.7441 | 4.9074 |
| Mean | 6.622234 | 6.498873 |
| Median | 6.3981 | 6.1213 |
| Variance | 0.924712 | 0.866467 |
| Standard deviation | 0.96162 | 0.93084 |
| Mean absolute deviation | 0.871588 | 0.703515 |
| Skewness | −8.48 | 0.87066 |
| Kurtosis | −1.24682 | 0.153415 |
List of some descriptors used in the QSAR optimization model.
| S/number | Descriptors symbols | Name of descriptor(s) | Class |
|---|---|---|---|
|
| AATS7s | Average Moreau-Broto Autocorrelation-lag 7/weighted by I-state | 2D |
|
| nHBint3 | Count of E-state descriptors of strength for potential hydrogen bonds of path length 3 | 2D |
|
| minHCsatu | Minimum atom-type H E-state: H on C sp3 bonded to unsaturated C | 2D |
|
| TDB9e | 3D topological distance based autocorrelation-lag 9/weighted by Sanderson electronegativities | 3D |
|
| RDF90i | Radial distribution function-090/weighted by relative first ionization potential | 3D |
|
| RDF110s | Radial distribution function-110/weighted by relative I-state | 3D |
Pearson's correlation matrix and statistics for descriptor used in the QSAR optimization model.
| Intercorrelation | Statistics | |||||||
|---|---|---|---|---|---|---|---|---|
| Descriptors | AATS7s | nHBint3 | minHCsatu | TDB9e | RDF90i | RDF110s |
| VIF |
| AATS7s | 1 | −3.9153 | 1.8931 | |||||
| nHBint3 | −0.29824 | 1 | 11.6469 | 1.2779 | ||||
| minHCsatu | 0.196097 | 0.269067 | 1 | 10.0386 | 3.6622 | |||
| TDB9e | 0.446768 | −0.19131 | −0.14868 | 1 | 5.66824 | 1.3493 | ||
| RDF90i | 0.097382 | −0.13902 | −0.39183 | 0.144839 | 1 | 9.45783 | 3.0968 | |
| RDF110s | 0.116862 | −0.25217 | −0.66819 | 0.208747 | 0.227911 | 1 | −5.5848 | 3.0275 |
Specification of entered descriptors in genetic algorithm-multiple regression model.
| Descriptors | Standard regression coefficient ( | Mean effect (ME) |
|
|---|---|---|---|
| AATS7s | −0.2769 | −0.31421 | 0.000527 |
| nHBint3 | 0.67675 | 0.153246 | 3 |
| minHCsatu | 0.987436 | 0.58264 | 8.84 |
| TDB9e | 0.338438 | 0.351968 | 4.48 |
| RDF90i | 1.097495 | 0.34097 | 3.25 |
| RDF110s | −0.49948 | −0.11461 | 5.62 |
Y- Randomization parameters test.
| Model |
|
|
|
|---|---|---|---|
| Original | 0.962302 | 0.926026 | 0.895386 |
| Random 1 | 0.387394 | 0.150074 | −0.28301 |
| Random 2 | 0.534646 | 0.285847 | −0.15518 |
| Random 3 | 0.357333 | 0.127687 | −0.43633 |
| Random 4 | 0.509588 | 0.25968 | −0.08884 |
| Random 5 | 0.231807 | 0.053735 | −0.60188 |
| Random 6 | 0.140884 | 0.019848 | −0.61556 |
| Random 7 | 0.513288 | 0.263465 | −0.11043 |
| Random 8 | 0.548099 | 0.300412 | −0.062 |
| Random 9 | 0.36673 | 0.134491 | −0.25601 |
| Random 10 | 0.505524 | 0.255554 | −0.12398 |
|
| |||
| Random models parameters | |||
| Average | 0.409529 | ||
| Average | 0.185079 | ||
| Average | −0.27332 | ||
| c | 0.837983 | ||
Figure 2Plot of predicted activity against experimental activity of training set.
Figure 3Plot of predicted activity against experimental activity of test set.
Figure 4Plot of standardized residual activity versus experimental activity.
Figure 5The Williams plot of the standardized residuals versus the leverage value.
Binding affinity of 1,2,4-Triazole derivatives with M. tuberculosis target (Mtb CYP121).
| Ligand | Target | Binding affinity (BA) |
|---|---|---|
|
| Mtb CYP121 | −7.3 |
|
| Mtb CYP121 | −7.8 |
|
| Mtb CYP121 | −8.5 |
|
| Mtb CYP121 | −9.1 |
|
| Mtb CYP121 | −9.6 |
|
| Mtb CYP121 | −9.8 |
|
| Mtb CYP121 | −10.3 |
|
| Mtb CYP121 | −14.6 |
|
| Mtb CYP121 | −9.6 |
|
| Mtb CYP121 | −9.6 |
|
| Mtb CYP121 | −9.2 |
|
| Mtb CYP121 | −11.2 |
|
| Mtb CYP121 | −11.2 |
|
| Mtb CYP121 | −5.1 |
|
| Mtb CYP121 | −9.9 |
|
| Mtb CYP121 | −5.3 |
|
| Mtb CYP121 | −6.1 |
|
| Mtb CYP121 | −7.9 |
|
| Mtb CYP121 | −7 |
|
| Mtb CYP121 | −7.8 |
|
| Mtb CYP121 | −5.5 |
|
| Mtb CYP121 | −5.7 |
|
| Mtb CYP121 | −5.5 |
|
| Mtb CYP121 | −6.9 |
|
| Mtb CYP121 | −6.6 |
|
| Mtb CYP121 | −6.7 |
|
| Mtb CYP121 | −5.4 |
|
| Mtb CYP121 | −5.1 |
|
| Mtb CYP121 | −5.4 |
|
| Mtb CYP121 | −7.5 |
|
| Mtb CYP121 | −7.3 |
|
| Mtb CYP121 | −6.6 |
|
| Mtb CYP121 | −5.6 |
|
| Mtb CYP121 | −6 |
|
| Mtb CYP121 | −6.3 |
|
| Mtb CYP121 | −7.8 |
|
| Mtb CYP121 | −7.8 |
|
| Mtb CYP121 | −8.4 |
|
| Mtb CYP121 | −5.7 |
|
| Mtb CYP121 | −6.3 |
|
| Mtb CYP121 | −6.3 |
|
| Mtb CYP121 | −5.9 |
|
| Mtb CYP121 | −5.6 |
|
| Mtb CYP121 | −5.5 |
|
| Mtb CYP121 | −6.2 |
|
| Mtb CYP121 | −5.7 |
|
| Mtb CYP121 | −5.9 |
|
| Mtb CYP121 | −5.7 |
|
| Mtb CYP121 | −5.2 |
|
| Mtb CYP121 | −7.8 |
Figure 6(7a) and (7b) show the 3D and 2D interactions between Mtb CYP121 and ligand 7. (8a) and (8b) show the 3D and 2D interactions between Mtb CYP121 and ligand 8. (13a) and (13b) show the 3D and 2D interactions between Mtb CYP121 and ligand 13. (14a) and (14b) show the 3D and 2D interactions between Mtb CYP121 and ligand 14.
Binding affinity, hydrogen bond, and hydrophobic bond of ligands 7, 8, 13, and 14 with M. tuberculosis target (Mtb CYP121).
| Ligand | Binding affinity (BA) | Target | Hydrogen bond | Hydrophobic interaction | |
|---|---|---|---|---|---|
| Amino acid | Bond length (Å) | Amino acid | |||
|
| −10.3 | Mtb CYP121 | GLN385 | 2.16131 | VAL83, PRO285, VAL78, VAL78, LA167 |
|
| −14.6 | Mtb CYP121 | ALA337 | 2.82894 | PHE280, ALA233, CYS345, MET86, ALA233, PRO346 |
|
| −11.2 | Mtb CYP121 | ASN74 | 2.34218 | VAL78, ALA233, PRO285, ALA233, PRO346 |
|
| −11.2 | Mtb CYP121 | ASN74 | 2.36479 | LEU164, VAL228, VAL78, ALA233,PRO285 |