| Literature DB >> 24971318 |
Yuqin Li1, Guirong You1, Baoxiu Jia1, Hongzong Si2, Xiaojun Yao3.
Abstract
Quantitative structure-activity relationships (QSAR) were developed to predict the inhibition ratio of pyrrolidine derivatives on matrix metalloproteinase via heuristic method (HM) and gene expression programming (GEP). The descriptors of 33 pyrrolidine derivatives were calculated by the software CODESSA, which can calculate quantum chemical, topological, geometrical, constitutional, and electrostatic descriptors. HM was also used for the preselection of 5 appropriate molecular descriptors. Linear and nonlinear QSAR models were developed based on the HM and GEP separately and two prediction models lead to a good correlation coefficient (R (2)) of 0.93 and 0.94. The two QSAR models are useful in predicting the inhibition ratio of pyrrolidine derivatives on matrix metalloproteinase during the discovery of new anticancer drugs and providing theory information for studying the new drugs.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24971318 PMCID: PMC4054925 DOI: 10.1155/2014/210672
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The common structure of compounds.
The experimental and predicted log(IC50) and their residues of pyrrolidine derivatives to matrix metalloproteinases in training and test sets with HM and GEP.
|
|
*The compounds of the test set.
aThe predicted log(IC50).
bResidue = log(Exp.) − log(Pred.).
All the parameters and selection of GEP.
| Parameters | Selection |
|---|---|
| Division | / |
| Addition | + |
| Square Root | Sqrt |
| Sine | Sin |
| Tangent | Tan |
| Multiplication | ∗ |
| Subtraction | − |
| Power | Pow |
| Natural logarithm | Ln |
| 10∧X | Pow10 |
| Chromosomes | 100 |
| Genes | 5 |
| Head size | 8 |
| Gene size | 26 |
| Linking function | Addition |
| Generations without change | 200 |
| Number of tries | 3 |
| Max. complexity | 5 |
| Error type | MSE |
| Precision | — |
| Selection range | — |
| 0/1 rounding threshold | — |
| Mutation rate | 0.044 |
| Inversion rate | 0.1 |
| IS transposition rate | 0.1 |
| RIS transposition rate | 0.1 |
| One-point recombination rate | 0.3 |
| Two-point recombination rate | 0.3 |
| Gene recombination rate | 0.1 |
| Gene transposition rate | 0.1 |
| Constants per gene | 10 |
| Data type | Floating-point |
| Lower bound | −10 |
| Upper bound | 10 |
| RNC mutation | 0.01 |
| Dc mutation | 0.044 |
| Dc inversion | 0.1 |
| Dc IS transposition | 0.1 |
Descriptors and their physical-chemical meanings, coefficient, error, and Student's t-test in HM.
| Number | Descriptor | Physical-chemical meanings | Coefficient | Error |
|
|---|---|---|---|---|---|
| 0 | Intercept | −1.9501 | 1.2612 | −1.5463 | |
| 1 | LUMO | LUMO energy | 5.0431 | 4.8720 | |
| 2 | MRECO | Min resonance energy for a C–O bond | −3.6715 | 6.7200 | −5.4635 |
| 3 | KSIND | Kier shape index (order 3) | −2.0681 | 7.7119 | −2.6816 |
| 4 | ZX | ZX Shadow/ZX Rectangle | −7.0757 | 2.1621 | −3.2726 |
| 5 | MASEOAT | Min atomic state energy for a O atom | 8.4808 | 4.3585 | 1.9458 |
Correlation matrix of the 5 descriptors.
| Descriptor | LUMO | MRECO | KSIND | ZX | MASEOAT |
|---|---|---|---|---|---|
| LUMO | 1.0000 | ||||
| MRECO | 0.1497 | 1.0000 | |||
| KSIND | −0.5319 | −0.4830 | 1.0000 | ||
| ZX | −0.0117 | 0.3261 | 0.1729 | 1.0000 | |
| MASEOAT | 0.1171 | 0.3261 | −0.5478 | 0.6954 | 1.0000 |
Figure 2Plot of predicted log (IC50) versus experimental values for the training and test sets by HM.
Figure 3Plot of predicted log (IC50) versus experimental values for the training sets by GEP.
Figure 4Plot of predicted log (IC50) versus experimental values for the test sets by GEP.