| Literature DB >> 25574393 |
Mukesh C Sharma1, Smita Sharma2.
Abstract
A QSAR study on thiophenyl derivatives as SGLT2 inhibitors as potential antidiabetic agents was performed with thirty-three compounds. Comparison of the obtained results indicated the superiority of the genetic algorithm over the simulated annealing and stepwise forward-backward variable method for feature selection. The best 2D QSAR model showed satisfactory statistical parameters for the data set (r (2) = 0.8499, q (2) = 0.8267, and pred_r (2) = 0.7729) with four descriptors describing the nature of substituent groups and the environment of the substitution site. Evaluation of the model implied that electron-rich substitution position improves the inhibitory activity. The good predictive 3D-QSAR models by k-nearest neighbor (kNN) method for molecular field analysis (MFA) have cross-validated coefficient q (2) value of 0.7663 and predicted r (2) value of 0.7386. The results have showed that thiophenyl groups are necessary for activity and halogen, bulky, and less bulky groups in thiophenyl nucleus enhanced the biological activity. These studies are promising for the development of novel SGLT2 inhibitor, which may have potent antidiabetic activity.Entities:
Year: 2014 PMID: 25574393 PMCID: PMC4276301 DOI: 10.1155/2014/739646
Source DB: PubMed Journal: Int J Med Chem ISSN: 2090-2077
Structure and biological activity of thiophenyl derivatives hSGLT2 inhibitors.
|
| ||||
|---|---|---|---|---|
| S. number | X | R | IC50 | pIC50 |
| 1 | Cl |
| 86.5 | 7.0629 |
| 2 | Cl |
| 34.6 | 7.4609 |
| 3* | Cl |
| 140 | 6.8538 |
| 4 | Cl |
| 65.0 | 7.1870 |
| 5 | Cl |
| 54.6 | 7.2628 |
| 6* | Cl |
| 111 | 6.9546 |
| 7 | Cl |
| 94 | 7.0268 |
| 8 | Cl |
| 115 | 6.9393 |
| 9 | Cl |
| 70.7 | 7.1505 |
| 10* | Cl |
| 12.8 | 7.8927 |
| 11 | Cl |
| 48.4 | 7.3151 |
| 12 | Cl |
| 11.9 | 7.9244 |
| 13 | Cl |
| 57.2 | 7.2426 |
| 14* | Cl |
| 162 | 6.7904 |
| 15 | Cl |
| 60.2 | 7.2204 |
| 16 | Cl |
| 4.47 | 8.3496 |
| 17 | Cl |
| 10.3 | 7.9871 |
| 18 | Cl |
| 91.3 | 7.0395 |
| 19* | H |
| 11.5 | 7.9393 |
| 20 | H |
| 8.73 | 8.0589 |
| 21 | H |
| 50.2 | 7.2992 |
| 22* | H |
| 27.5 | 7.5606 |
| 23 | H |
| 21.1 | 7.6757 |
| 24 | H |
| 71.4 | 7.1463 |
| 25* | H |
| 68.9 | 7.1617 |
| 26 | H |
| 451 | 6.3458 |
| 27 | H |
| 88.3 | 7.0540 |
| 28 | H |
| 69.6 | 7.1573 |
| 29* | H |
| 24.8 | 7.6055 |
| 30 | H |
| 59.8 | 7.2232 |
| 31 | Br |
| 12.4 | 7.9065 |
| 32 | OMe |
| 49.5 | 7.3053 |
| 33* | Me |
| 29.3 | 7.5331 |
*Structures with (*) are included in test set validation.
Unicolumn statistics of training and test set for 2D QSAR studies.
| Data set | Average | Max | Min | Std. dev. | Sum |
|---|---|---|---|---|---|
| 2D QSAR | |||||
| Training | 1.6127 | 2.6542 | 0.6503 | 0.4426 | 41.9299 |
| Test | 1.7765 | 2.2095 | 1.1072 | 0.3889 | 12.4355 |
|
| |||||
| 3D QSAR | |||||
| Training | 1.5401 | 2.2095 | 0.6503 | 0.4163 | 36.9636 |
| Test | 1.9335 | 2.6542 | 1.3945 | 0.3452 | 17.4018 |
Selected descriptors values for QSAR models.
| SsCH3count | T_C_Cl_1 | S_1044 | E_184 | E_1139 | H_1153 |
|---|---|---|---|---|---|
| 13.35725 | 1 | −0.00234 | −0.00103 | 0.070427 | 0.068996 |
| 14.89175 | 1 | −0.00252 | −0.03374 | 0.039663 | 0.110293 |
| 13.05024 | 1 | −0.00332 | −0.0034 | 0.045418 | 0.156499 |
| 12.66982 | 1 | −0.00271 | −0.07924 | 0.015991 | 0.205079 |
| 13.19176 | 1 | −0.00718 | −0.05341 | 0.01499 | 0.156726 |
| 13.4038 | 1 | −0.00276 | −0.02181 | 0.126288 | 0.213233 |
| 13.05024 | 1 | −0.00272 | −0.11568 | 0.049628 | 0.170716 |
| 14.11489 | 1 | −0.00229 | −0.04659 | 0.03041 | 0.190254 |
| 13.19176 | 1 | −0.0037 | −0.05161 | 0.007637 | 0.195659 |
| 13.85076 | 1 | −0.00269 | −0.09992 | 0.148325 | 0.170358 |
| 13.95209 | 1 | −0.002 | −0.05398 | 0.046206 | 0.178527 |
| 13.49164 | 1 | −0.00307 | −0.04381 | 0.022364 | 0.196138 |
| 13.8452 | 1 | −0.00213 | −0.03588 | 0.01794 | 0.135694 |
| 13.11122 | 1 | −0.00219 | −0.05675 | 0.025435 | 0.117383 |
| 11.57052 | 1 | −0.0077 | −0.03111 | 0.040108 | 0.13964 |
| 13.05024 | 1 | −0.00148 | −0.06435 | 0.079842 | 0.242054 |
| 13.05024 | 1 | −0.00131 | −0.03439 | 0.040536 | 0.219392 |
| 12.37104 | 1 | −0.00205 | −0.00201 | 0.013029 | 0.200942 |
| 13.85076 | 0 | −0.00288 | −0.02182 | 0.060327 | 0.072212 |
| 13.49164 | 0 | −0.00392 | −0.07865 | 0.052315 | 0.116761 |
| 11.65703 | 0 | −0.00194 | −0.0367 | 0.068249 | 0.151754 |
| 12.45755 | 1 | −0.00638 | −0.03708 | 0.025638 | 0.202708 |
| 13.35725 | 0 | −0.00325 | −0.0005 | 0.016382 | 0.181212 |
| 12.3107 | 0 | −0.00272 | −0.06519 | 0.139852 | 0.220012 |
| 13.11122 | 0 | −0.00186 | −0.04364 | 0.073036 | 0.168351 |
| 12.30098 | 0 | −0.00406 | −0.01576 | 0.034676 | 0.18917 |
| 13.1015 | 1 | −0.00226 | −0.03521 | 0.000739 | 0.211024 |
| 11.56143 | 1 | −0.00212 | −0.07403 | 0.159932 | 0.171132 |
| 12.11965 | 1 | −0.00331 | −0.02547 | 0.047064 | 0.188637 |
| 12.3107 | 1 | −0.00178 | −0.03979 | 0.004465 | 0.186037 |
| 12.92017 | 1 | −0.0024 | −0.03588 | 0.030881 | 0.137912 |
| 13.11122 | 1 | −0.01145 | −0.05675 | 0.032062 | 0.126376 |
| 13.35725 | 1 | −0.00109 | −0.00103 | 0.070427 | 0.137455 |
| 14.89175 | 1 | −0.0011 | −0.03374 | 0.039663 | 0.232216 |
| 13.05024 | 1 | −0.00082 | −0.0034 | 0.045418 | 0.257901 |
| 12.66982 | 1 | −0.00144 | −0.07924 | 0.015991 | 0.22212 |
| 13.19176 | 1 | −0.00178 | −0.05341 | 0.01499 | 0.144703 |
| 13.4038 | 1 | −0.00071 | −0.02181 | 0.126288 | 0.132103 |
| 13.05024 | 1 | −0.00234 | −0.11568 | 0.049628 | 0.068996 |
| 14.11489 | 1 | −0.00252 | −0.04659 | 0.03041 | 0.110293 |
| 13.19176 | 1 | −0.00332 | −0.05161 | 0.007637 | 0.156499 |
Figure 1(a) Thiophenyl ring (template structure). (b) Alignment of thiophenyl derivatives. (c) Contribution charts of the descriptors for the 2D QSAR model-1. (d) Plot of observed versus predicted activity by 2D QSAR model-1. (e) Contribution plot for steric and electrostatic interactions GA-PLS model. (f) Plot of observed versus predicted activity by 3D QSAR GA-PLS model. (g) Contribution plot for steric and electrostatic interactions SA-PLS model. (h) Plot of observed versus predicted activity by 3D QSAR SA-PLS model. (i) Contribution plot for steric, hydrophobic, and electrostatic interactions SW-PLS model. (j) Plot of observed versus predicted activity by 3D QSAR SW-PLS model.
Comparative observed and predicted activities (LOO) of thiophenyl SGLT2 inhibitors.
| Com | pIC50 | 2D model-1 | 3D model-GA-PLS | 3D model-SA-PLS | 3D model-SW-PLS | ||||
|---|---|---|---|---|---|---|---|---|---|
| Pred. | Res. | Pred. | Res. | Pred. | Res. | Pred. | Res. | ||
| 1 | 7.0629 | 7.1845 | −0.1216 | 7.1847 | −0.1218 | 7.1718 | −0.1089 | 7.0553 | 0.0076 |
| 2 | 7.4609 | 7.4978 | −0.0369 | 7.3679 | 0.093 | 7.1283 | 0.3326 | 7.2289 | 0.232 |
| 3 | 6.8538 | 6.8737 | −0.0199 | 6.7565 | 0.0973 | 6.9154 | −0.0616 | 6.7295 | 0.1243 |
| 4 | 7.187 | 7.2319 | −0.0449 | 7.0789 | 0.1081 | 7.1122 | 0.0748 | 7.1047 | 0.0823 |
| 5 | 7.2628 | 7.2564 | 0.0064 | 7.4543 | −0.1915 | 7.2428 | 0.02 | 7.3525 | −0.0897 |
| 6 | 6.9546 | 6.9448 | 0.0098 | 6.9121 | 0.0425 | 6.9393 | 0.0153 | 6.9875 | −0.0329 |
| 7 | 7.0268 | 7.1464 | −0.1196 | 7.0723 | −0.0455 | 7.1174 | −0.0906 | 7.1653 | −0.1385 |
| 8 | 6.9393 | 6.9815 | −0.0422 | 6.9644 | −0.0251 | 6.9163 | 0.023 | 6.9128 | 0.0265 |
| 9 | 7.1505 | 7.1854 | −0.0349 | 7.0746 | 0.0759 | 7.1649 | −0.0144 | 7.1761 | −0.0256 |
| 10 | 7.8927 | 7.8289 | 0.0638 | 7.8694 | 0.0233 | 7.8265 | 0.0662 | 7.8656 | 0.0271 |
| 11 | 7.3151 | 7.3283 | −0.0132 | 7.3729 | −0.0578 | 7.3247 | −0.0096 | 7.1826 | 0.1325 |
| 12 | 7.9244 | 7.9135 | 0.0109 | 7.9918 | −0.0674 | 7.9418 | −0.0174 | 7.9356 | −0.0112 |
| 13 | 7.2426 | 7.2857 | −0.0431 | 7.2498 | −0.0072 | 7.3821 | −0.1395 | 7.2148 | 0.0278 |
| 14 | 6.7904 | 6.7845 | 0.0059 | 6.7954 | −0.005 | 6.7991 | −0.0087 | 6.7894 | 0.001 |
| 15 | 7.2204 | 7.2138 | 0.0066 | 7.2525 | −0.0321 | 7.2862 | −0.0658 | 7.1966 | 0.0238 |
| 16 | 8.3496 | 8.3791 | −0.0295 | 8.3875 | −0.0379 | 8.3424 | 0.0072 | 8.3446 | 0.005 |
| 17 | 7.9871 | 7.9845 | 0.0026 | 7.9949 | −0.0078 | 7.9326 | 0.0545 | 7.9674 | 0.0197 |
| 18 | 7.0395 | 7.1278 | −0.0883 | 7.1392 | −0.0997 | 7.0734 | −0.0339 | 7.0673 | −0.0278 |
| 19 | 7.9393 | 7.9243 | 0.015 | 7.9418 | −0.0025 | 7.9388 | 0.0005 | 7.9132 | 0.0261 |
| 20 | 8.0589 | 7.9275 | 0.1314 | 8.0692 | −0.0103 | 8.0872 | −0.0283 | 8.1471 | −0.0882 |
| 21 | 7.2992 | 7.2961 | 0.0031 | 7.1491 | 0.1501 | 7.1449 | 0.1543 | 7.2307 | 0.0685 |
| 22 | 7.5606 | 7.5884 | −0.0278 | 7.4281 | 0.1325 | 7.5161 | 0.0445 | 7.5319 | 0.0287 |
| 23 | 7.6757 | 7.6978 | −0.0221 | 7.5946 | 0.0811 | 7.5988 | 0.0769 | 7.5975 | 0.0782 |
| 24 | 7.1463 | 7.1326 | 0.0137 | 7.1384 | 0.0079 | 7.2864 | −0.1401 | 7.1125 | 0.0338 |
| 25 | 7.1617 | 7.0968 | 0.0649 | 7.1948 | −0.0331 | 7.1289 | 0.0328 | 7.1799 | −0.0182 |
| 26 | 6.3458 | 6.3291 | 0.0167 | 6.3259 | 0.0199 | 6.1643 | 0.1815 | 7.4332 | −0.0874 |
| 27 | 7.054 | 7.1853 | −0.1313 | 6.9342 | 0.1198 | 6.9768 | 0.0772 | 6.9087 | 0.1453 |
| 28 | 7.1573 | 7.2218 | −0.0645 | 7.0202 | 0.1371 | 7.2738 | −0.1165 | 7.2684 | −0.1111 |
| 29 | 7.6055 | 7.6283 | −0.0228 | 7.6163 | −0.0108 | 7.6023 | 0.0032 | 7.6132 | −0.0077 |
| 30 | 7.2232 | 7.2346 | −0.0114 | 7.2458 | −0.0226 | 7.2282 | −0.005 | 7.2547 | −0.0315 |
| 31 | 7.9065 | 7.8579 | 0.0486 | 7.8881 | 0.0184 | 7.8194 | 0.0871 | 7.8658 | 0.0407 |
| 32 | 7.3053 | 7.2982 | 0.0071 | 7.0424 | 0.2629 | 7.1921 | 0.1132 | 7.8928 | −0.5875 |
| 33 | 7.5331 | 7.4728 | 0.0603 | 7.3466 | 0.1865 | 7.3308 | 0.2023 | 7.2733 | 0.2598 |
Correlation matrix between descriptors present in the best QSAR model-1.
| Parameter | pIC50 | SsCH3count | T_C_Cl_1 | LUMO energy | SaaSE-index |
|---|---|---|---|---|---|
| pIC50 | 1.0000 | ||||
| SsCH3count | 0.5431 | 1.0000 | |||
| T_C_Cl_1 | 0.2165 | 0.4312 | 1.0000 | ||
| LUMO energy | 0.4964 | 0.5731 | 0.6823 | 1.0000 | |
| SaaSE-index | 0.3731 | 0.4633 | 0.6591 | 0.8504 | 1.0000 |