| Literature DB >> 22580401 |
Khac-Minh Thai1, Quang-Huynh Bui, Thanh-Dao Tran, Thi-Ngoc-Phuong Huynh.
Abstract
Benzo[c]phenanthridine (BCP) derivatives were identified as topoisomerase I (TOP-I) targeting agents with pronounced antitumor activity. In this study, hologram-QSAR, 2D-QSAR and 3D-QSAR models were developed for BCPs on topoisomerase I inbibitory activity and cytotoxicity against seven tumor cell lines including RPMI8402, CPT-K5, P388, CPT45, KB3-1, KBV-1and KBH5.0. The hologram, 2D, and 3D-QSAR models were obtained with the square of correlation coefficient R² = 0.58-0.77, the square of the crossvalidation coefficient q² = 0.41-0.60 as well as the external set's square of predictive correlation coefficient r² = 0.5-0.80. Moreover, the assessment method based on reliability test with confidence level of 95% was used to validate the predictive power of QSAR models and to prevent over-fitting phenomenon of classical QSAR models. Our QSAR model could be applied to design new analogues of BCPs with higher antitumor and topoisomerase I inhibitory activity.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22580401 PMCID: PMC6268722 DOI: 10.3390/molecules17055690
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Structure of nitidine and fagaronine.
Dataset and biological activities used in this study.
| Bioactivity | Number of compounds | Number of compounds on each skeleton | |||||
|---|---|---|---|---|---|---|---|
| G1 | G2 | G3 | G4 | G5 | G6 | ||
| RPMI | 133 | 65 | 9 | 13 | 6 | 10 | 30 |
| CPTk5 | 101 | 50 | 8 | 12 | 6 | 8 | 17 |
| P388 | 82 | 53 | 1 | 4 | 5 | 9 | 10 |
| CPT45 | 73 | 45 | 0 | 4 | 5 | 9 | 10 |
| U937 | 39 | 23 | 9 | 0 | 6 | 0 | 1 |
| U937rs | 33 | 20 | 7 | 0 | 5 | 0 | 1 |
| KB3-1 | 83 | 53 | 10 | 13 | 6 | 0 | 1 |
| KBV-1 | 81 | 52 | 9 | 13 | 6 | 0 | 1 |
| KBH | 60 | 46 | 0 | 13 | 0 | 0 | 1 |
| TOP-I | 94 | 52 | 8 | 12 | 5 | 9 | 8 |
Figure 2General structural skeletons of the BCPs dataset.
Dataset division.
| Bioactivity | RPMI 8402 | CPT-K5 | P388 | CPT45 | KB3-1 | KBV-1 | KBH5.0 | TOP-I |
|---|---|---|---|---|---|---|---|---|
| Number of compounds | 133 | 101 | 82 | 73 | 83 | 81 | 60 | 94 |
| Training set | 105 | 80 | 66 | 58 | 68 | 60 | 48 | 74 |
| External test set | 28 | 21 | 16 | 15 | 15 | 21 | 12 | 20 |
Figure 3The relationship between observed and predicted data from QSAR model and its 95% confidence interval of (A) RPMI8402 cell line from 3D QSAR with steric analysis fields and (B) CPT45 cell line from 2D QSAR. Compound of training set are in blue circle and test set in red triangle.
Results of 2D-QSAR.
| Model | RPMI | CPTk5 | P388 | CPT45 | KB3-1 | KBV | KBH | TOP-I |
|---|---|---|---|---|---|---|---|---|
| Number compounds in training set | 105 | 80 | 66 | 58 | 68 | 60 | 48 | 74 |
| Number compounds in external test set | 28 | 21 | 16 | 15 | 15 | 21 | 12 | 20 |
| R2 (Training set) | 0.584 | 0.452 | 0.655 | 0.472 | 0.627 | 0.632 | 0.536 | 0.602 |
| Standard Error (Training set) | 0.543 | 0.400 | 0.271 | 0.338 | 0.414 | 0.248 | 0.218 | 0.355 |
| q2 (L.O.O.) | 0.511 | 0.302 | 0.417 | 0.230 | 0.537 | 0.474 | 0.394 | 0.475 |
| Standard Error (L.O.O.) | 0.641 | 0.522 | 0.489 | 0.527 | 0.520 | 0.364 | 0.29 | 0.477 |
| rt2 (External set) | 0.514 | 0.248 | 0.334 | 0.043 | 0.514 | 0.314 | 0.053 | 0.657 |
| Standard Error (External set) | 0.803 | 0.434 | 0.799 | 0.943 | 0.665 | 0.858 | 0.78 | 0.417 |
| p-value of model | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Number of 2D molecular descriptor | 7 | 9 | 10 | 9 | 6 | 8 | 6 | 10 |
| Greates p-value of used descriptors | 0.039 | 0.018 | 0.014 | 0.000 | 0.026 | 0.002 | 0.026 | 0.008 |
| δ | 1.45 | 1.30 | 1.10 | 1.20 | 1.25 | 1.00 | 1.00 | 1.25 |
| Assessment | + | + | + | + | + | + | + | + |
| Range of prediction | −1.5 | −1.2 | 0 | −0.5 | −0.5 | −1.2 | 0.5 | −2 |
| 0.452 | 0.655 | 0.472 | 0.632 | 0.536 | ||||
| 0.374 | 0.424 | 0.284 | 0.506 | 0.264 | ||||
| 0.413 | 0.540 | 0.378 | 0.569 | 0.400 | ||||
| 0.078 | 0.231 | 0.188 | 0.126 | 0.272 | ||||
| 0.248 | 0.256 | 0.018 | 0.278 | 0.035 | ||||
| 0.169 | 0.187 | 0.012 | 0.180 | −0.007 | ||||
| 0.208 | 0.222 | 0.015 | 0.229 | 0.014 | ||||
| 0.079 | 0.068 | 0.006 | 0.099 | 0.042 |
Results of Hologram-QSAR.
| Model | RPMI | CPTk5 | P388 | CPT45 | KB3-1 | KBV | KBH | TOP-I |
|---|---|---|---|---|---|---|---|---|
| Number compounds in training set | 105 | 80 | 66 | 58 | 68 | 60 | 48 | 74 |
| Number compounds in external test set | 28 | 21 | 16 | 15 | 15 | 21 | 12 | 20 |
| R2 (Training set) | 0.765 | 0.573 | 0.439 | 0.483 | 0.622 | 0.501 | 0.624 | 0.616 |
| Standard Error (Training set) | 0.489 | 0.428 | 0.675 | 0.586 | 0.658 | 0.418 | 0.439 | 0.594 |
| q2 (L.O.O.) | 0.568 | 0.320 | 0.182 | 0.123 | 0.514 | 0.328 | 0.482 | 0.406 |
| Standard Error (L.O.O.) | 0.777 | 0.733 | 0.828 | 0.777 | 0.757 | 0.697 | 0.516 | 0.754 |
| rt2 (External set) | 0.525 | 0.285 | 0.302 | 0.010 | 0.541 | 0.708 | 0.439 | 0.690 |
| Standard Error (External set) | 0.567 | 0.382 | 0.941 | 0.635 | 0.752 | 0.306 | 0.339 | 0.433 |
| Hologram lengths | 151 | 53 | 353 | 199 | 61 | 71 | 59 | 307 |
| Principal components | 6 | 5 | 6 | 3 | 3 | 3 | 3 | 4 |
| Limitation of atoms in each fragment | 5–10 | 5–10 | 5–8 | 5–6 | 2–8 | 5–6 | 5–7 | 1–7 |
| p-value | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| δ | 1.10 | 1.20 | 1.30 | 1.20 | 1.35 | 1.2 | 0.80 | 1.20 |
| Assessment | + | + | + | + | + | + | + | + |
| Range of prediction | −1 | −1.5 | 0 | 0.2 | 0 | −0.5 | 0.5 | −2 |
| 0.572 | 0.439 | 0.482 | 0.624 | |||||
| 0.504 | 0.132 | 0.297 | 0.387 | |||||
| 0.538 | 0.286 | 0.389 | 0.506 | |||||
| 0.068 | 0.307 | 0.185 | 0.237 | |||||
| 0.276 | 0.289 | 0.006 | 0.367 | |||||
| 0.159 | −0.033 | 0.005 | −0.054 | |||||
| 0.217 | 0.128 | 0.005 | 0.154 | |||||
| 0.117 | 0.322 | 0.001 | 0.415 |
Results of 3D-QSAR.
| Model | RPMI-fs43 | RPMI-fs45 | CPTk5-sh44 | P388-s15 | CPT45-s53 | KB3-s34 | KB3-e12 | KB3-h34 | KB3-eh32 | KBV-s34 | KBH-fs43 | TOP-I -s34 | TOP-I -h54 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number compounds in training set | 105 | 105 | 80 | 66 | 58 | 68 | 68 | 68 | 68 | 60 | 48 | 74 | 74 |
| Number compounds in test set | 28 | 28 | 21 | 16 | 15 | 15 | 15 | 15 | 15 | 21 | 12 | 20 | 20 |
| R2 (Training set) | 0.734 | 0.812 | 0.650 | 0.667 | 0.496 | 0.721 | 0.698 | 0.768 | 0.731 | 0.629 | 0.696 | 0.701 | 0.700 |
| Standard Error (Training set) | 0.601 | 0.510 | 0.522 | 0.537 | 0.589 | 0.597 | 0.593 | 0.528 | 0.559 | 0.523 | 0.394 | 0.535 | 0.536 |
| q2 (L.O.O.) | 0.594 | 0.607 | 0.338 | 0.309 | 0.203 | 0.584 | 0.552 | 0.539 | 0.582 | 0.372 | 0.330 | 0.423 | 0.345 |
| Standard Error (L.O.O.) | 0.742 | 0.737 | 0.718 | 0.774 | 0.741 | 0.707 | 0.722 | 0.744 | 0.697 | 0.680 | 0.586 | 0.743 | 0.792 |
| rt2 (External set) | 0.685 | 0.701 | 0.471 | 0.570 | 0.202 | 0.661 | 0.496 | 0.636 | 0.620 | 0.436 | 0.282 | 0.795 | 0.836 |
| Standard Error (External set) | 0.742 | 0.724 | 0.545 | 0.739 | 0.570 | 0.647 | 0.789 | 0.670 | 0.686 | 0.857 | 0.796 | 0.518 | 0.338 |
| 3D-descriptor | Fs | fs | sh | S | S | S | e | h | eh | s | Fs | S | H |
| Column filter | 4 | 4 | 4 | 1 | 5 | 3 | 1 | 3 | 3 | 3 | 4 | 3 | 5 |
| Principal component | 3 | 5 | 4 | 5 | 3 | 4 | 2 | 4 | 2 | 4 | 3 | 4 | 4 |
| p-value | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| δ | 1.10 | 1.00 | 1.05 | 1.05 | 1.20 | 1.15 | 1.20 | 1.10 | 1.15 | 1.00 | 0.75 | 1.10 | 1.2 |
| Assessment | + | - | + | + | + | + | + | + | + | +/− | + | + | + |
| Range of prediction | −1.5 | − | −1.5 | −0.2 | −0.2 | −0.5 | 0 | −0.2 | −0.2 | 0 | −0.5 | −2.5 | −2.5 |
| 0.812 | 0.650 | 0.667 | 0.496 | 0.697 | 0.767 | 0.730 | 0.628 | 0.696 | 0.700 | ||||
| 0.703 | 0.592 | 0.442 | 0.312 | 0.531 | 0.634 | 0.579 | 0.501 | 0.494 | 0.577 | ||||
| 0.758 | 0.621 | 0.554 | 0.404 | 0.614 | 0.701 | 0.654 | 0.564 | 0.595 | 0.638 | ||||
| 0.109 | 0.058 | 0.225 | 0.183 | 0.166 | 0.133 | 0.151 | 0.127 | 0.202 | 0.123 | ||||
| 0.324 | 0.429 | 0.569 | 0.188 | 0.461 | 0.636 | 0.591 | 0.332 | 0.282 | 0.523 | ||||
| −0.451 | 0.294 | 0.332 | 0.063 | 0.162 | 0.409 | 0.349 | −0.014 | −0.009 | 0.318 | ||||
| 0.064 | 0.362 | 0.450 | 0.126 | 0.312 | 0.522 | 0.470 | 0.159 | 0.137 | 0.421 | ||||
| 0.775 | 0.135 | 0.237 | 0.125 | 0.299 | 0.278 | 0.242 | 0.346 | 0.291 | 0.206 |
Figure 4The summary for BCPs structures—antitumor activity relationship.
Figure 5Process of combined QSAR studies.
Chemical structure of 137 benzo[c]phenanthridine analogues.
Description of 35 molecular descriptors using to create the 2D QSAR models.
| No | Molecular descriptor | Description |
|---|---|---|
| 1 | a_acc | Number of hydrogen bond acceptor atoms |
| 1 | a_acc | Number of hydrogen bond acceptor atoms |
| 2 | a_aro | Number of aromatic atoms |
| 3 | a_ICM | Atom information content (mean). This is the entropy of the element distribution in the molecule (including implicit hydrogens but not lone pair pseudo-atoms). |
| 4 | a_nN | Number of nitrogen atoms: #{Zi | Zi = 7}. |
| 5 | a_nO | Number of oxygen atoms: #{Zi | Zi = 8}. |
| 6 | b_1rotR | Fraction of rotatable single bonds: b_1rotN divided by b_heavy. |
| 7 | BCUT_PEOE_0 | The BCUT descriptors [Pearlman 1998] are calculated from the eigenvalues of a modified adjacency matrix. |
| 8 | BCUT_PEOE_1 | |
| 9 | BCUT_PEOE_2 | |
| 10 | chi1v_C | Carbon valence connectivity index (order 1). |
| 11 | density | Molecular mass density: Weight divided by vdw_vol (amu/Å3). |
| 12 | diameter | Largest value in the distance matrix |
| 13 | GCUT_PEOE_1 | The GCUT descriptors are calculated from the eigenvalues of a modified graph distance adjacency matrix. |
| 14 | GCUT_SLOGP_0 | The GCUT descriptors using atomic contribution to logP instead of partial charge. |
| 15 | GCUT_SLOGP_1 | |
| 16 | GCUT_SMR_0 | The GCUT descriptors using atomic contribution to molar refractivity instead of partial charge. |
| 17 | opr_leadlike | Atom Counts and Bond Counts: One if and only if opr_violation < 2 otherwise zero. |
| 18 | PEOE_VSA_FHYD | Fractional hydrophobic van der Waals surface area. |
| 19 | PEOE_VSA_FNEG | Fractional negative van der Waals surface area. |
| 20 | PEOE_VSA_NEG | Total negative van der Waals surface area. |
| 21 | PEOE_VSA+0 | Sum of |
| 22 | PEOE_VSA+1 | PEOE: Sum of |
| 23 | PEOE_VSA+2 | PEOE: Sum of |
| 24 | PEOE_VSA+3 | PEOE: Sum of |
| 25 | PEOE_VSA-0 | PEOE: Sum of |
| 26 | PEOE_VSA-1 | PEOE: Sum of |
| 27 | petitjean | Largest value in the distance matrix |
| 28 | SlogP | Log of the octanol/water partition coefficient (including implicit hydrogens). |
| 29 | SlogP_VSA1 | Subdivided Surface Areas: Sum of |
| 30 | SlogP_VSA5 | Subdivided Surface Areas: Sum of |
| 31 | SlogP_VSA9 | Subdivided Surface Areas: Sum of |
| 32 | VDistMa | Adjacency and Distance Matrix Descriptors: If |
| 33 | vsa_acc | Approximation to the sum of VDW surface areas (Å2) of pure hydrogen bond acceptors |
| 34 | vsa_other | Approximation to the sum of VDW surface areas (Å2) of atoms typed as “other”. |
| 35 | vsa_pol | Approximation to the sum of VDW surface areas (Å2) of polar atoms (atoms that are both hydrogen bond donors and acceptors), such as -OH. |
Figure 6Structure of BMC_05_6782_9b and 3D alignment of 137 BCPs chemical structures.
Cross-validation results of 3D-QSAR with KB3-1 cells.
| 3D descriptor field | q2 for each column filter values | ||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | |
| S | 0.583 | 0.583 | 0.584 | 0.579 | 0.575 |
| E | 0.552 | 0.551 | 0.553 | 0.540 | 0.542 |
| H | 0.542 | 0.541 | 0.539 | 0.509 | 0.485 |
| D | 0.014 | 0.013 | 0.004 | 0.000 | 0.000 |
| A | 0.414 | 0.415 | 0.417 | 0.423 | 0.429 |
| s.e | 0.580 | 0.580 | 0.580 | 0.576 | 0.576 |
| s.h | 0.551 | 0.555 | 0.555 | 0.545 | 0.536 |
| s.d | 0.448 | 0.446 | 0.473 | 0.485 | 0.502 |
| s.a | 0.498 | 0.507 | 0.501 | 0.503 | 0.505 |
| e.h | 0.582 | 0.581 | 0.582 | 0.571 | 0.560 |
| e.d | 0.560 | 0.562 | 0.568 | 0.545 | 0.534 |
| e.a | 0.561 | 0.558 | 0.552 | 0.550 | 0.547 |
| h.d | 0.437 | 0.438 | 0.438 | 0.411 | 0.406 |
| h.a | 0.466 | 0.465 | 0.465 | 0.463 | 0.467 |
| d.a | 0.409 | 0.412 | 0.417 | 0.423 | 0.427 |
| s.e.h | 0.585 | 0.581 | 0.584 | 0.579 | 0.570 |
| s.e.d | 0.578 | 0.578 | 0.586 | 0.584 | 0.575 |
| s.e.a | 0.563 | 0.564 | 0.562 | 0.561 | 0.558 |
| s.h.d | 0.514 | 0.527 | 0.533 | 0.528 | 0.516 |
| s.h.a | 0.511 | 0.515 | 0.511 | 0.510 | 0.511 |
| s.d.a | 0.499 | 0.507 | 0.514 | 0.530 | 0.509 |
| e.h.d | 0.563 | 0.565 | 0.573 | 0.560 | 0.552 |
| e.h.a | 0.560 | 0.557 | 0.551 | 0.546 | 0.540 |
| e.d.a | 0.526 | 0.528 | 0.533 | 0.531 | 0.520 |
| h.d.a | 0.444 | 0.446 | 0.445 | 0.443 | 0.449 |
| s.e.h.d | 0.572 | 0.574 | 0.579 | 0.571 | 0.562 |
| s.e.h.a | 0.569 | 0.569 | 0.565 | 0.561 | 0.555 |
| s.e.d.a | 0.548 | 0.551 | 0.555 | 0.558 | 0.544 |
| s.h.d.a | 0.493 | 0.507 | 0.515 | 0.518 | 0.501 |
| e.h.d.a | 0.528 | 0.529 | 0.530 | 0.523 | 0.513 |
| s.e.h.d.a | 0.545 | 0.547 | 0.543 | 0.540 | 0.537 |
| CoMFA | 0.549 | 0.549 | 0.546 | 0.547 | 0.551 |
s: steric, e: electrostatic, h: hydrophobic; d: H-bond donor; a: H- bond acceptor.