| Literature DB >> 28911371 |
Hayriye Yilmaz1, Natalia Sizochenko2, Bakhtiyor Rasulev3, Andrey Toropov4, Yahya Guzel5, Viktor Kuz'min6, Danuta Leszczynska7, Jerzy Leszczynski8.
Abstract
A quantitative structure-activity relationship (QSAR) study was performed on a set of amino-substituted nitrogen heterocyclic urea derivatives. Two novel approaches were applied: (1) the simplified molecular input-line entry systems (SMILES) based optimal descriptors approach; and (2) the fragment-based simplex representation of molecular structure (SiRMS) approach. Comparison with the classic scheme of building up the model and balance of correlation (BC) for optimal descriptors approach shows that the BC scheme provides more robust predictions than the classic scheme for the considered pIC50 of the heterocyclic urea derivatives. Comparison of the SMILES-based optimal descriptors and SiRMS approaches has confirmed good performance of both techniques in prediction of kinase insert domain containing receptor (KDR) inhibitory activity, expressed as a logarithm of inhibitory concentration (pIC50) of studied compounds.Entities:
Keywords: KDR inhibitors; QSAR; SMILES; SiRMS; amino-substituted nitrogen heterocyclic ureas; descriptors
Year: 2015 PMID: 28911371 PMCID: PMC9351780 DOI: 10.1016/j.jfda.2015.03.001
Source DB: PubMed Journal: J Food Drug Anal Impact factor: 6.157
Statistical quality of models developed by the CORAL approach.
| Trshd | Nact | Probe | Training set | Calibration set | Test set | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Split 1 Balance of correlations | |||||||||||||||
| 0 | 93 | 1 | 39 | 0.8506 | 0.313 | 211 | 13 | 0.9815 | 0.531 | 585 | 11 | 0.7537 | 0.497 | 28 | 0.7369 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 0 | 93 | 3 | 39 | 0.8496 | 0.314 | 209 | 13 | 0.9873 | 0.513 | 852 | 11 | 0.7502 | 0.500 | 27 | 0.7468 |
| 0 | 0.8508 | 0.313 | 211 | 0.9846 | 0.518 | 722 | 0.7637 | 0.487 | 29 | 0.7557 | |||||
| 1 | 90 | 1 | 39 | 0.8482 | 0.316 | 207 | 13 | 0.9887 | 0.506 | 962 | 11 | 0.7413 | 0.513 | 26 | 0.7320 |
| 1 | 90 | 2 | 39 | 0.8460 | 0.318 | 203 | 13 | 0.9850 | 0.517 | 723 | 11 | 0.7526 | 0.505 | 27 | 0.7408 |
| 1 | 90 | 3 | 39 | 0.8505 | 0.313 | 210 | 13 | 0.9871 | 0.516 | 840 | 11 | 0.7047 | 0.548 | 21 | 0.6737 |
| 1 | 0.8482 | 0.316 | 207 | 0.9869 | 0.513 | 842 | 0.7329 | 0.522 | 25 | 0.7155 | |||||
| 2 | 73 | 1 | 39 | 0.8329 | 0.331 | 184 | 13 | 0.9816 | 0.580 | 587 | 11 | 0.7218 | 0.527 | 23 | 0.7212 |
| 2 | 73 | 2 | 39 | 0.8403 | 0.324 | 195 | 13 | 0.9762 | 0.554 | 450 | 11 | 0.7309 | 0.519 | 24 | 0.7065 |
| 2 | 73 | 3 | 39 | 0.8294 | 0.335 | 180 | 13 | 0.9839 | 0.560 | 674 | 11 | 0.7245 | 0.528 | 24 | 0.6873 |
| 2 | 0.8342 | 0.330 | 186 | 0.9806 | 0.565 | 570 | 0.7257 | 0.524 | 24 | 0.7050 | |||||
| Split 1 Classic scheme | |||||||||||||||
| 0 | 93 | 1 | 52 | 0.8648 | 0.295 | 320 | 11 | 0.7990 | 0.485 | 36 | 0.6607 | ||||
| 0 | 93 | 2 | 52 | 0.8647 | 0.295 | 319 | 11 | 0.7137 | 0.591 | 22 | 0.5350 | ||||
| 0 | 93 | 3 | 52 | 0.8630 | 0.297 | 315 | 11 | 0.7789 | 0.515 | 32 | 0.6551 | ||||
| 0 | 0.8642 | 0.295 | 318 | 0.7639 | 0.530 | 30 | 0.6169 | ||||||||
| 1 | 90 | 1 | 52 | 0.8654 | 0.294 | 321 | 11 | 0.7616 | 0.535 | 29 | 0.6273 | ||||
| 1 | 90 | 2 | 52 | 0.8652 | 0.294 | 321 | 11 | 0.7855 | 0.510 | 33 | 0.6356 | ||||
| 1 | 90 | 3 | 52 | 0.8668 | 0.293 | 325 | 11 | 0.8001 | 0.531 | 36 | 0.6684 | ||||
| 1 | 0.8658 | 0.294 | 323 | 0.7824 | 0.525 | 33 | 0.6438 | ||||||||
| 2 | 73 | 1 | 52 | 0.8588 | 0.301 | 304 | 11 | 0.7484 | 0.525 | 27 | 0.6310 | ||||
| 2 | 73 | 2 | 52 | 0.8564 | 0.304 | 298 | 11 | 0.7655 | 0.497 | 29 | 0.6812 | ||||
| 2 | 73 | 3 | 52 | 0.8544 | 0.306 | 293 | 11 | 0.7564 | 0.504 | 28 | 0.6847 | ||||
| 2 | 0.8565 | 0.304 | 299 | 0.7568 | 0.508 | 28 | 0.6656 | ||||||||
| Split 2 Balance of correlations | |||||||||||||||
| 0 | 29 | 1 | 42 | 0.8023 | 0.395 | 162 | 11 | 0.8870 | 0.439 | 71 | 10 | 0.7752 | 0.400 | 28 | 0.5824 |
| 0 | 29 | 2 | 42 | 0.8036 | 0.394 | 164 | 11 | 0.8848 | 0.425 | 69 | 10 | 0.7629 | 0.417 | 26 | 0.5875 |
| 0 | 29 | 3 | 42 | 0.8024 | 0.395 | 162 | 11 | 0.8879 | 0.438 | 71 | 10 | 0.5886 | 0.560 | 11 | 0.4300 |
| 0 | 0.8028 | 0.395 | 163 | 0.8866 | 0.434 | 70 | 0.7089 | 0.459 | 22 | 0.5333 | |||||
| 1 | 29 | 1 | 42 | 0.8027 | 0.395 | 163 | 11 | 0.8857 | 0.417 | 70 | 10 | 0.6891 | 0.479 | 18 | 0.5334 |
| 1 | 29 | 2 | 42 | 0.8002 | 0.397 | 160 | 11 | 0.8868 | 0.439 | 71 | 10 | 0.6689 | 0.496 | 16 | 0.5318 |
| 1 | 29 | 3 | 42 | 0.8025 | 0.395 | 163 | 11 | 0.8872 | 0.446 | 71 | 10 | 0.7280 | 0.438 | 21 | 0.5773 |
| 1 | 0.8018 | 0.396 | 162 | 0.8866 | 0.434 | 70 | 0.6953 | 0.471 | 18 | 0.5475 | |||||
| 2 | 28 | 1 | 42 | 0.6852 | 0.499 | 87 | 11 | 0.8873 | 0.389 | 71 | 10 | 0.7887 | 0.397 | 30 | 0.5450 |
| 2 | 28 | 2 | 42 | 0.6867 | 0.497 | 88 | 11 | 0.8870 | 0.389 | 71 | 10 | 0.6295 | 0.532 | 14 | 0.3921 |
| 2 | 28 | 3 | 42 | 0.6838 | 0.500 | 87 | 11 | 0.8898 | 0.404 | 73 | 10 | 0.7627 | 0.409 | 26 | 0.5067 |
| 2 | 0.6852 | 0.499 | 87 | 0.8881 | 0.394 | 71 | 0.7270 | 0.446 | 23 | 0.4813 | |||||
| Split 2 Classic scheme | |||||||||||||||
| 0 | 29 | 1 | 53 | 0.8078 | 0.383 | 214 | 10 | 0.7227 | 0.500 | 21 | 0.5357 | ||||
| 0 | 29 | 2 | 53 | 0.8057 | 0.385 | 211 | 10 | 0.7701 | 0.454 | 27 | 0.5577 | ||||
| 0 | 29 | 3 | 53 | 0.8067 | 0.384 | 213 | 10 | 0.5627 | 0.633 | 10 | 0.3850 | ||||
| 0 | 0.8067 | 0.384 | 213 | 0.6852 | 0.529 | 19 | 0.4928 | ||||||||
| 1 | 29 | 1 | 53 | 0.8081 | 0.382 | 215 | 10 | 0.7612 | 0.458 | 25 | 0.5240 | ||||
| 1 | 29 | 2 | 53 | 0.8075 | 0.383 | 214 | 10 | 0.7213 | 0.498 | 21 | 0.5341 | ||||
| 1 | 29 | 3 | 53 | 0.8078 | 0.383 | 214 | 10 | 0.7077 | 0.511 | 19 | 0.5139 | ||||
| 1 | 0.8078 | 0.383 | 214 | 0.7301 | 0.489 | 22 | 0.5240 | ||||||||
| 2 | 28 | 1 | 53 | 0.7094 | 0.470 | 125 | 10 | 0.7628 | 0.474 | 26 | 0.5223 | ||||
| 2 | 28 | 2 | 53 | 0.7123 | 0.468 | 126 | 10 | 0.7498 | 0.474 | 24 | 0.4727 | ||||
| 2 | 28 | 3 | 53 | 0.7124 | 0.468 | 126 | 10 | 0.6805 | 0.540 | 17 | 0.4303 | ||||
| 2 | 0.7114 | 0.469 | 126 | 0.7311 | 0.496 | 22 | 0.4751 | ||||||||
| Split 3 Balance of correlations | |||||||||||||||
| 0 | 31 | 1 | 40 | 0.7755 | 0.372 | 131 | 13 | 0.9646 | 0.348 | 300 | 10 | 0.5978 | 0.844 | 12 | 0.5730 |
| 0 | 31 | 2 | 40 | 0.7762 | 0.371 | 132 | 13 | 0.9628 | 0.355 | 285 | 10 | 0.5654 | 0.896 | 10 | 0.5378 |
| 0 | 31 | 3 | 40 | 0.7734 | 0.373 | 130 | 13 | 0.9662 | 0.357 | 314 | 10 | 0.5860 | 0.857 | 11 | 0.5695 |
| 0 | 0.7750 | 0.372 | 131 | 0.9645 | 0.354 | 300 | 0.5831 | 0.866 | 11 | 0.5601 | |||||
| 1 | 31 | 1 | 40 | 0.7775 | 0.370 | 133 | 13 | 0.9641 | 0.361 | 295 | 10 | 0.5876 | 0.862 | 11 | 0.5638 |
| 1 | 31 | 2 | 40 | 0.7737 | 0.373 | 130 | 13 | 0.9659 | 0.355 | 312 | 10 | 0.5792 | 0.857 | 11 | 0.5701 |
| 1 | 31 | 3 | 40 | 0.7758 | 0.371 | 132 | 13 | 0.9603 | 0.333 | 266 | 10 | 0.5612 | 0.918 | 10 | 0.5226 |
| 1 | 0.7757 | 0.371 | 131 | 0.9634 | 0.350 | 291 | 0.5760 | 0.879 | 11 | 0.5522 | |||||
| 2 | 28 | 1 | 40 | 0.6032 | 0.494 | 58 | 13 | 0.9515 | 0.273 | 216 | 10 | 0.6680 | 0.910 | 16 | 0.4494 |
| 2 | 28 | 2 | 40 | 0.6039 | 0.494 | 58 | 13 | 0.9529 | 0.269 | 223 | 10 | 0.6675 | 0.910 | 16 | 0.4503 |
| 2 | 28 | 3 | 40 | 0.6069 | 0.492 | 59 | 13 | 0.9511 | 0.264 | 214 | 10 | 0.6727 | 0.896 | 16 | 0.4597 |
| 2 | 0.6047 | 0.493 | 58 | 0.9518 | 0.269 | 217 | 0.6694 | 0.905 | 16 | 0.4532 | |||||
| Split 3 Classic scheme | |||||||||||||||
| 0 | 29 | 1 | 53 | 0.8125 | 0.338 | 221 | 10 | 0.6292 | 0.777 | 14 | 0.6130 | ||||
| 0 | 29 | 2 | 53 | 0.8131 | 0.338 | 222 | 10 | 0.6174 | 0.794 | 13 | 0.5942 | ||||
| 0 | 29 | 3 | 53 | 0.8119 | 0.339 | 220 | 10 | 0.6407 | 0.756 | 14 | 0.6104 | ||||
| 0 | 0.8125 | 0.338 | 221 | 0.6291 | 0.776 | 14 | 0.6058 | ||||||||
| 1 | 29 | 1 | 53 | 0.8125 | 0.338 | 221 | 10 | 0.6233 | 0.789 | 13 | 0.6047 | ||||
| 1 | 29 | 2 | 53 | 0.8130 | 0.338 | 222 | 10 | 0.6139 | 0.795 | 13 | 0.5857 | ||||
| 1 | 29 | 3 | 53 | 0.8120 | 0.339 | 220 | 10 | 0.6307 | 0.769 | 14 | 0.6062 | ||||
| 1 | 0.8125 | 0.338 | 221 | 0.6226 | 0.785 | 13 | 0.5989 | ||||||||
| 2 | 28 | 1 | 53 | 0.6817 | 0.441 | 109 | 10 | 0.7703 | 0.735 | 27 | 0.5530 | ||||
| 2 | 28 | 2 | 53 | 0.6835 | 0.440 | 110 | 10 | 0.7601 | 0.756 | 25 | 0.5410 | ||||
| 2 | 28 | 3 | 53 | 0.6846 | 0.439 | 111 | 10 | 0.7428 | 0.788 | 23 | 0.5216 | ||||
| 2 | 0.6833 | 0.440 | 110 | 0.7577 | 0.759 | 25 | 0.5385 | ||||||||
The values in bold are values for the best selected model.
c = calibration set; F = Fischer ratio; n = number of compounds in the set; Nact = number of SMILES attributes involved in building up a model; probe = number of runs of the Monte Carlo method calculation; r = correlation coefficient; s = root-mean-standard error; t = training set; Thrsd = threshold; v = test (validation) set.
Fig. 1Graphical representation of the model calculated with Equation 4 (CORAL).
Experimental and calculated values of the activity pIC50 for 63 amino-substituted nitrogen heterocyclic ureas (CORAL).
| Set | SMILES | DCW | Exp | Calc | Exp–Calc | ID |
|---|---|---|---|---|---|---|
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccccc1 | 69.700 | 7.190 | 7.331 | −0.141 | 1 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)F | 72.280 | 7.640 | 7.624 | 0.016 | 4 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccc(cc1)C | 76.495 | 7.920 | 8.103 | −0.183 | 7 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)CC | 77.901 | 8.220 | 8.263 | −0.043 | 8 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)Cl | 77.552 | 8.100 | 8.223 | −0.123 | 9 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)Br | 69.940 | 7.440 | 7.358 | 0.082 | 10 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)C(F)(F)F | 77.679 | 8.000 | 8.238 | −0.238 | 11 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)O | 68.989 | 7.260 | 7.250 | 0.010 | 12 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cc(ccc1F)C | 72.783 | 8.400 | 7.681 | 0.719 | 13 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccc(c(c1)C)F | 75.566 | 8.400 | 7.998 | 0.402 | 14 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccc(c(c1)F)C | 74.699 | 7.440 | 7.899 | −0.459 | 15 |
| + | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cc(ccc1F)C(F)(F)F | 73.656 | 7.050 | 7.781 | −0.731 | 16 |
| + | n1n(c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)C)C | 68.176 | 7.960 | 7.158 | 0.802 | 17 |
| + | n1n(c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cc(ccc1F)C)CCOC | 60.506 | 5.510 | 6.287 | −0.777 | 19 |
| + | n1[nH]c2c(c1N)c(ccc2C)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C | 80.405 | 8.520 | 8.547 | −0.027 | 20 |
| + | n1[nH]c2c(c1N)c(ccc2OC)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 76.195 | 7.590 | 8.069 | −0.479 | 21 |
| + | n1[nH]c2c(c1N)c(ccc2F)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 78.237 | 8.300 | 8.301 | −0.001 | 22 |
| + | n1[nH]c2c(c1N)c(ccc2OCCN(CC)CC)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 71.920 | 7.460 | 7.583 | −0.123 | 25 |
| + | n1[nH]c2c(c1N)c(ccc2OCCN1CCCC1=O)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C | 72.061 | 7.600 | 7.599 | 0.001 | 27 |
| + | n1[nH]c2c(c1N)c(ccc2OCCOC)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 72.237 | 7.680 | 7.619 | 0.061 | 28 |
| + | n1[nH]c2c(c1N)c(ccc2CNN1CCOCC1)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C | 61.583 | 6.410 | 6.409 | 0.001 | 29 |
| + | n1[nH]c2c(c1N)c(ccc2OCCOC)c1ccc(cc1)NC(=O)Nc1cc(ccc1)Cl | 73.293 | 7.890 | 7.739 | 0.151 | 31 |
| + | n1[nH]c2c(c1N)c(ccc2OCCOC)c1ccc(cc1)NC(=O)Nc1c(ccc(c1)C)F | 71.308 | 7.680 | 7.514 | 0.166 | 32 |
| + | n1[nH]c2c(c1N)c(ccc2OCCN1CCOCC1)c1ccc(cc1)NC(=O)Nc1c(ccc(c1)C)F | 65.316 | 7.210 | 6.833 | 0.377 | 34 |
| + | n1n(c2c(c1NC(=O)C)c(ccc2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)C | 48.445 | 4.920 | 4.917 | 0.003 | 35 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)Br | 68.721 | 7.140 | 7.220 | −0.080 | 37 |
| + | c1cc2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N | 72.939 | 7.330 | 7.699 | −0.369 | 38 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)c1ccccc1 | 72.105 | 7.660 | 7.604 | 0.056 | 39 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)c1ccsc1 | 75.226 | 7.960 | 7.959 | 0.001 | 42 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)c1ccc2c(c1)OCO2 | 73.421 | 7.770 | 7.754 | 0.016 | 44 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)c1nn(cc1)C | 79.131 | 8.400 | 8.403 | −0.003 | 46 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)C#N | 58.251 | 6.030 | 6.031 | −0.001 | 47 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)C(=O)NC | 65.586 | 7.000 | 6.864 | 0.136 | 49 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)C(=O)N1CCN(CC1)C | 59.955 | 6.100 | 6.224 | −0.124 | 50 |
| + | c1cc2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C(F)(F)F)N | 73.255 | 8.050 | 7.735 | 0.315 | 54 |
| + | c1cc2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)Cl)N | 70.138 | 7.360 | 7.381 | −0.021 | 55 |
| + | c1cc2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1c(ccc(c1)C(F)(F)F)F)N | 71.148 | 7.850 | 7.496 | 0.354 | 57 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C(F)(F)F)N)c1cn(nc1)C | 78.680 | 8.520 | 8.351 | 0.169 | 58 |
| + | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(c(cc1)F)C(F)(F)F)N)c1cn(nc1)C~ | 76.573 | 8.220 | 8.112 | 0.108 | 61 |
| − | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 76.495 | 8.520 | 8.103 | 0.417 | 2 |
| − | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccccc1F | 70.364 | 7.090 | 7.407 | −0.317 | 3 |
| − | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccc(cc1)F | 72.280 | 7.170 | 7.624 | −0.454 | 5 |
| − | n1[nH]c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1ccccc1C | 70.813 | 7.060 | 7.458 | −0.398 | 6 |
| − | n1[nH]c2c(c1N)c(ccc2OCCN1CCCC1)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 73.014 | 7.510 | 7.708 | −0.198 | 26 |
| − | n1[nH]c2c(c1N)c(ccc2CN1CCN(CC1)C)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C | 65.923 | 5.920 | 6.902 | −0.982 | 30 |
| − | n1[nH]c2c(c1N)c(ccc2OCCN(CC)CC)c1ccc(cc1)NC(=O)Nc1c(ccc(c1)C)F | 70.991 | 7.130 | 7.478 | −0.348 | 33 |
| − | c1cc2n(n1)c(c(c(n2)C)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N | 72.643 | 7.410 | 7.666 | −0.256 | 40 |
| − | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)C1CC1 | 74.085 | 7.800 | 7.829 | −0.029 | 43 |
| − | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)C(=O)OCC | 73.238 | 7.600 | 7.733 | −0.133 | 48 |
| − | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1ccccc1)N)Br | 65.435 | 5.800 | 6.847 | −1.047 | 53 |
| − | c1cc2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1c(ccc(c1)C)F)N | 71.142 | 7.130 | 7.495 | −0.365 | 56 |
| − | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1c(ccc(c1)C)F)N)c1cn(nc1)C | 76.567 | 8.300 | 8.111 | 0.189 | 60 |
| # | n1n(c2c(c1N)c(ccc2)c1ccc(cc1)NC(=O)Nc1cccc(c1)C)CCO | 57.426 | 6.220 | 5.937 | 0.283 | 18 |
| # | n1[nH]c2c(c1N)c(ccc2Br)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 69.665 | 8.000 | 7.327 | 0.673 | 23 |
| # | n1[nH]c2c(c1N)c(ccc2OCCN(C)C)c1ccc(cc1)NC(=O)Nc1cccc(c1)C | 69.109 | 7.420 | 7.264 | 0.156 | 24 |
| # | n1n(c2c(c1N(C)C)c(ccc2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)C | 51.535 | 5.490 | 5.268 | 0.222 | 36 |
| # | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)c1cccs1 | 73.770 | 7.820 | 7.794 | 0.026 | 41 |
| # | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)c1cnccc1 | 75.269 | 8.050 | 7.964 | 0.086 | 45 |
| # | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)C)N)C(=O)NCCN(CC)CC | 61.312 | 6.300 | 6.378 | −0.078 | 51 |
| # | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1ccc(cc1)C)N)Br | 68.721 | 6.090 | 7.220 | −1.130 | 52 |
| # | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)Cl)N)c1cn(nc1)C | 75.563 | 8.400 | 7.997 | 0.403 | 59 |
| # | c1c(c2n(n1)c(c(cn2)c1ccc(cc1)NC(=O)Nc1cc(ccc1)F)N)c1cn(nc1)C | 73.281 | 8.000 | 7.738 | 0.262 | 62 |
| # | c12c(c(c(cc1)c1ccc(cc1)NC(=O)Nc1cccc(c1)C)N)nccn2 | 66.390 | 6.770 | 6.955 | −0.185 | 63 |
CORAL = CORrelations And Logic; SMILES = simplified molecular input-line entry systems; DCW = descriptor of correlation weights.
Exp and Calc are experimental and calculated pIC50; “+”, “−”, and “#” are indicators for the training, calibration, and test sets, respectively.
Correlation weights for calculation of DCW (SMILES) used in Eq. (1).
| SAk | CW (SAk) | Ntrain | Ncalib | Ntest |
|---|---|---|---|---|
| #. | −2.44150 | 1 | 0 | 0 |
| (...(.. | 7.79887 | 6 | 0 | 0 |
| (. | −0.95894 | 39 | 13 | 11 |
| ++++B2–B3== | −1.52444 | 1 | 0 | 0 |
| ++++F—B2== | 0.26281 | 14 | 5 | 1 |
| ++++F—N=== | −0.16006 | 14 | 5 | 1 |
| ++++F—O=== | 0.20794 | 14 | 5 | 1 |
| ++++CL–N=== | 0.33213 | 3 | 0 | 1 |
| ++++CL–O=== | 1.13663 | 3 | 0 | 1 |
| ++++Br–B2== | −0.13863 | 2 | 1 | 2 |
| ++++Br–N=== | −2.68450 | 2 | 1 | 2 |
| ++++Br–O=== | 0.03606 | 2 | 1 | 2 |
| ++++Cl–B2== | 2.87200 | 3 | 0 | 1 |
| ++++N—B2== | 8.05269 | 39 | 13 | 11 |
| ++++N—B3== | −3.19050 | 1 | 0 | 0 |
| ++++N—O=== | 9.31931 | 39 | 13 | 11 |
| ++++N—S=== | −0.19131 | 1 | 0 | 1 |
| ++++O—B2== | 8.84956 | 39 | 13 | 11 |
| ++++O—B3== | −4.31250 | 1 | 0 | 0 |
| ++++O—S=== | 0.99619 | 1 | 0 | 1 |
| ++++S—B2== | −2.30469 | 1 | 0 | 1 |
| 1...(.. | 3.79988 | 39 | 13 | 11 |
| 1. | −0.79406 | 39 | 13 | 11 |
| 2...(.. | 1.41125 | 29 | 10 | 8 |
| 2. | −1.97956 | 39 | 13 | 11 |
| 2...1.. | 0.99500 | 0 | 0 | 1 |
| =...(.. | 1.90125 | 39 | 13 | 11 |
| =. | 0.57331 | 39 | 13 | 11 |
| =...1.. | 5.67387 | 1 | 0 | 0 |
| C...#.. | −3.43750 | 1 | 0 | 0 |
| C...(.. | 0.29106 | 39 | 13 | 11 |
| C. | −0.42569 | 39 | 13 | 11 |
| C...1.. | 1.26863 | 4 | 4 | 0 |
| C...2.. | 5.18650 | 2 | 1 | 0 |
| C...C.. | 1.87500 | 10 | 5 | 3 |
| BOND10000000 | 9.25481 | 38 | 13 | 11 |
| BOND11000000 | −2.22175 | 1 | 0 | 0 |
| F...(.. | −0.57613 | 14 | 4 | 1 |
| F. | −0.28706 | 14 | 5 | 1 |
| F...1.. | 4.16887 | 3 | 1 | 0 |
| F...2.. | 7.23338 | 1 | 0 | 0 |
| EC0-C...1... | 2.28525 | 29 | 10 | 11 |
| EC0-C...2... | −0.04388 | 39 | 13 | 11 |
| EC0-C...3... | −2.05969 | 39 | 13 | 11 |
| EC0-C...4... | 7.18350 | 6 | 0 | 0 |
| EC0-F...1... | −1.51262 | 14 | 5 | 1 |
| EC0-Br..1... | −0.00681 | 2 | 1 | 2 |
| EC0-Cl..1... | 1.99719 | 3 | 0 | 1 |
| EC0-N...1... | 15.74219 | 38 | 13 | 10 |
| EC0-N...2... | −1.69731 | 39 | 13 | 11 |
| EC0-N...3... | −3.86419 | 21 | 9 | 9 |
| EC0-O...1... | −5.31931 | 39 | 13 | 11 |
| EC0-O...2... | 1.61137 | 10 | 3 | 1 |
| EC0-s...2... | 1.09856 | 1 | 0 | 1 |
| H. | 1.07331 | 22 | 7 | 2 |
| Br..(.. | −0.09175 | 2 | 1 | 2 |
| Br | −1.51944 | 2 | 1 | 2 |
| Br..2.. | 1.00200 | 0 | 0 | 1 |
| Cl..(.. | −3.56550 | 3 | 0 | 1 |
| Cl | 0.43450 | 3 | 0 | 1 |
| N...#.. | −2.11037 | 1 | 0 | 0 |
| N...(.. | −0.65725 | 39 | 13 | 11 |
| N. | −1.94531 | 39 | 13 | 11 |
| N...1.. | −2.28325 | 26 | 7 | 4 |
| N...C.. | −2.36919 | 39 | 13 | 11 |
| N...N.. | −3.82512 | 1 | 0 | 0 |
| O...(.. | 2.47275 | 39 | 13 | 11 |
| O. | −2.50881 | 39 | 13 | 11 |
| O...2.. | 3.87100 | 8 | 2 | 1 |
| O...=.. | 3.97175 | 39 | 13 | 11 |
| O...C.. | −1.99800 | 10 | 3 | 2 |
| NOSP11000000 | 9.06450 | 39 | 13 | 11 |
| [. | 1.98438 | 22 | 7 | 2 |
| [...1.. | 3.74019 | 22 | 7 | 2 |
| [...H.. | 2.99800 | 22 | 7 | 2 |
| c...(.. | 1.69150 | 39 | 13 | 11 |
| c. | 0.98838 | 39 | 13 | 11 |
| c...1.. | 0.26863 | 39 | 13 | 11 |
| c...2.. | 0.58394 | 39 | 13 | 11 |
| c...N.. | 9.94150 | 39 | 13 | 11 |
| c...[.. | 4.05669 | 22 | 7 | 2 |
| c...c.. | −1.39544 | 39 | 13 | 11 |
| n...(.. | 1.18850 | 17 | 6 | 9 |
| n. | −0.35256 | 39 | 13 | 11 |
| n...1.. | 6.86037 | 39 | 13 | 10 |
| n...2.. | 3.49619 | 14 | 6 | 7 |
| n...H.. | 0.54488 | 22 | 7 | 2 |
| n...[.. | 1.73238 | 22 | 7 | 2 |
| n...c.. | 1.68369 | 14 | 5 | 7 |
| n...n.. | −1.56450 | 1 | 0 | 0 |
| s. | −0.36238 | 1 | 0 | 1 |
| s...1.. | 1.00200 | 0 | 0 | 1 |
| s...c.. | 0.79387 | 1 | 0 | 1 |
CW = correlation weight; DCW = descriptor of correlation weights; SMILES = simplified molecular input-line entry systems.
The Ntrain, Ncalib, and Ntest are the frequencies of SAk in the training, calibration, and test sets, respectively.
Summarized statistical evaluation of each model developed by the SiRMS approach.
| Model (split) |
| straining | q2 | scross-validation |
| stest |
|---|---|---|---|---|---|---|
| 1S | 0.86 | 0.31 | 0.81 | 0.37 | 0.75 | 0.47 |
| 2S | 0.84 | 0.33 | 0.79 | 0.39 | 0.70 | 0.50 |
| 3S | 0.82 | 0.35 | 0.76 | 0.42 | 0.72 | 0.49 |
q = LOO cross-validation coefficient; R2 = correlation coefficient; s = standard error; SiRMS = simplex representation of molecular structure.
Fig. 2Diagram of relative influence (%) of various groups of SiRMS descriptors.
Fig. 3Plot of experimental (observed) versus predicted log values, SiRMS approach.