| Literature DB >> 34760008 |
Fariba Masoomi Sefiddashti1, Saeid Asadpour1, Hedayat Haddadi1, Shima Ghanavati Nasab1.
Abstract
BACKGROUND ANDEntities:
Keywords: Artificial neural network; Cancer; Multiple linear regression; Pyrimidine derivatives; QSAR
Year: 2021 PMID: 34760008 PMCID: PMC8562410 DOI: 10.4103/1735-5362.327506
Source DB: PubMed Journal: Res Pharm Sci ISSN: 1735-5362
Structural formulae of compounds and their percent inhibition values
| Order | Structure | % of inhibition |
|---|---|---|
| 1 |
| 8 |
| 2 |
| 15 |
| 3 |
| 5 |
| 4 |
| 8 |
| 5 |
| 14 |
| 6 |
| 14 |
| 7 |
| 32 |
| 8 |
| 19 |
| 9 |
| 97 |
| 10 |
| 62 |
| 11 |
| 61 |
| 12 |
| 72 |
| 13 |
| 98 |
| 14 |
| 14 |
| 15 |
| 45 |
| 16 |
| 23 |
| 17 |
| 27 |
| 18 |
| 77 |
| 19 |
| 100 |
| 20 |
| 40 |
| 21 |
| 73 |
| 22 |
| 39 |
| 23 |
| 47 |
| 24 |
| 67 |
| 25 |
| 19 |
| 26 |
| 10 |
| 27 |
| 83 |
| 28 |
| 87 |
| 29 |
| 86 |
| 30 |
| 94 |
| 31 |
| 46 |
| 32 |
| 96 |
| 33 |
| 100 |
R2, RMSE, Q2, adjusted R2 values for models with the different number of descriptors.
| Order | Adjusted R2 | Q2 | RMSE | R2 |
|---|---|---|---|---|
| 1 | 0.527003 | 0.027606 | 22.33229 | 0.541784 |
| 2 | 0.704868 | 0.572353 | 17.35135 | 0.723314 |
| 3 | 0.778927 | 0.738329 | 14.74748 | 0.799653 |
| 4 | 0.827726 | 0.821183 | 12.83163 | 0.84926 |
| 5 | 0.869454 | 0.871045 | 10.97305 | 0.889852 |
| 6 | 0.884936 | 0.88934 | 10.18958 | 0.906511 |
| 7 | 0.909911 | 0.890616 | 10.14926 | 0.929618 |
| 8 | 0.926377 | 0.890344 | 10.2435 | 0.944782 |
R2, Regression coefficient; RMSE, root-mean-square error.
Descriptors used in the 2D-QSAR study.
| Descriptor types | Descriptor blocks type | Descriptor description |
|---|---|---|
| RDF035u | RDF descriptors | Radial distribiution Function -5.3 / unweighted |
| Mor24v | 3DMoRSE | 3D-MORSE-signal 24 / weighted by atomic van der volumes |
| EEig11r | 3DMoRSE | Eigenvalue 11 from edge adj. matrix weighted by resonance integral |
| G2s | WHIM descriptors | 2nd component symmetry directional WHIM index / weighted by atomic electropological states |
| ATS3v | 2Dauto correlation | Broto-Moreau autocorrelation of a topological structure-lag 3/weighted by atomic van der Waals vol |
values of the obtained parameters of the studied derivatives of furopyrimidine and thienopyrimidine
| Number | RDF035u | Mor24v | EEig11r | ATS3v | G2s |
|---|---|---|---|---|---|
| 1 | 26.484 | -0.41 | 2.044 | 3.739 | 0.167 |
| 2 | 27.26 | -0.443 | 2.009 | 3.739 | 0.167 |
| 3 | 22.56 | -0.276 | 2.14 | 3.739 | 0.167 |
| 4 | 23.356 | -0.168 | 2.005 | 3.763 | 0.174 |
| 5 | 24.407 | -0.304 | 2.005 | 3.739 | 0.167 |
| 6 | 24.234 | -0.288 | 2.271 | 3.861 | 0.162 |
| 7 | 23.311 | -0.237 | 2.011 | 3.746 | 0.165 |
| 8 | 26.9 | -0.409 | 2.01 | 3.687 | 0.167 |
| 9 | 28.934 | -0.395 | 2.179 | 3.736 | 0.182 |
| 10 | 27.84 | -0.318 | 2.193 | 3.76 | 0.164 |
| 11 | 26.824 | -0.369 | 2.164 | 3.736 | 0.165 |
| 12 | 26.841 | -0.351 | 2.01 | 3.736 | 0.165 |
| 13 | 28.809 | -0.377 | 2.329 | 3.805 | 0.18 |
| 14 | 29.849 | -0.374 | 2.322 | 3.806 | 0.163 |
| 15 | 26.845 | -0.416 | 2.162 | 3.754 | 0.165 |
| 16 | 28.53 | -0.345 | 2.01 | 3.783 | 0.164 |
| 17 | 25.623 | -0.279 | 2.007 | 3.664 | 0.167 |
| 18 | 28.411 | -0.27 | 2.179 | 3.714 | 0.165 |
| 19 | 27.922 | -0.246 | 2.193 | 3.739 | 0.164 |
| 20 | 25.499 | -0.303 | 2.007 | 3.714 | 0.165 |
| 21 | 28.285 | -0.21 | 2.328 | 3.785 | 0.164 |
| 22 | 26.108 | -0.41 | 2.07 | 3.718 | 0.169 |
| 23 | 27.997 | -0.402 | 2.179 | 3.766 | 0.168 |
| 24 | 27.069 | -0.353 | 2.193 | 3.789 | 0.184 |
| 25 | 26.056 | -0.364 | 2.164 | 3.766 | 0.168 |
| 26 | 26.003 | -0.387 | 2.07 | 3.766 | 0.168 |
| 27 | 27.842 | -0.396 | 2.333 | 3.833 | 0.167 |
| 28 | 28.994 | -0.398 | 2.324 | 3.833 | 0.174 |
| 29 | 26.499 | -0.143 | 2.068 | 3.696 | 0.169 |
| 30 | 28.306 | -0.263 | 2.179 | 3.744 | 0.168 |
| 31 | 27.494 | -0.042 | 2.193 | 3.768 | 0.167 |
| 32 | 26.049 | -0.191 | 2.068 | 3.744 | 0.168 |
| 33 | 28.785 | -0.19 | 2.333 | 3.813 | 0.176 |
Correlation matrix between different obtained descriptors.
| Descriptor types | RDF035u | Mor24v | EEig11r | ATS3v | G2s |
|---|---|---|---|---|---|
| RDF035u | 1 | ||||
| Mor24v | -0.2354 | 1 | |||
| EEig11r | 0.562653 | -0.02447 | 1 | ||
| ATS3v | 0.255481 | -0.08419 | 0.723526 | 1 | |
| G2s | 0.190981 | -0.08694 | 0.209459 | 0.158042 | 1 |
Observed and calculated values of inhibition percent according to multiple linear regression method for the calibration and test sets.
| Calibration set | Inhibition% (observed) | Inhibition% (predicted) | Residual | Relative error% |
|---|---|---|---|---|
| 1 | 8 | 21.30 | -13.30 | -166.31 |
| 3 | 5 | 14.02 | -9.02 | -180.54 |
| 4 | 8 | 14.97 | -6.97 | -87.07 |
| 5 | 14 | 6.71 | 7.29 | 52.06 |
| 6 | 14 | 15.37 | -1.37 | -9.78 |
| 8 | 19 | 32.61 | -13.61 | -71.65 |
| 10 | 62 | 61.90 | 0.10 | 0.17 |
| 11 | 61 | 47.89 | 13.17 | 21.50 |
| 13 | 98 | 95.33 | 2.67 | 2.73 |
| 14 | 14 | 24.61 | 10.61 | -75.75 |
| 15 | 45 | 37.76 | 7.24 | 16.09 |
| 16 | 23 | 29.55 | -6.55 | -28.48 |
| 17 | 27 | 40.48 | -13.48 | -49.92 |
| 18 | 77 | 83.43 | -6.43 | -8.35 |
| 20 | 40 | 21.79 | 18.21 | 45.54 |
| 22 | 39 | 29.38 | 9.62 | 24.66 |
| 23 | 47 | 54.44 | -7.44 | -15.83 |
| 24 | 67 | 66.88 | 0.12 | 0.18 |
| 25 | 19 | 37.08 | -18.08 | -95.14 |
| 26 | 10 | 0.65 | -9.35 | 93.51 |
| 27 | 83 | 95.59 | 12.59 | -15.17 |
| 28 | 87 | 79.83 | 7.17 | 8.24 |
| 29 | 86 | 70.21 | 15.78 | 18.37 |
| 30 | 94 | 79.49 | 14.51 | 15.43 |
| 32 | 96 | 93.04 | -2.96 | 3.09 |
| 33 | 100 | 111.34 | -11.34 | -11.34 |
|
| ||||
|
| ||||
| 2 | 15 | 19.31 | 4.31 | -28.70 |
| 7 | 32 | 18.00 | -14.00 | 93.51 |
| 9 | 97 | 88.72 | 8.28 | 8.54 |
| 12 | 72 | 84.33 | 12.33 | -17.13 |
| 19 | 100 | 76.44 | 23.56 | 23.56 |
| 21 | 73 | 61.67 | -11.33 | 15.52 |
| 31 | 46 | 47.07 | -0.02 | -2.32 |
Fig. 1Predicted inhibition percent activities by multiple linear regression in comparison with experimental for (A) model and (B) test set.
R2 and Q2 values after several Y- randomization tests.
| Iteration | R2 | Q2 |
|---|---|---|
| 1 | 0.16 | 0.01 |
| 2 | 0.15 | 0.00 |
| 3 | 0.20 | 0.03 |
| 4 | 0.33 | 0.11 |
| 5 | 0.18 | 0.02 |
| 6 | 0.14 | 0.00 |
| 7 | 0.24 | 0.05 |
Observed values and calculated values of inhibition percent according to the artificial neural network method.
| Training set | Inhibition% (observed) | Inhibition% (observed) | Residual | Relative error% |
|---|---|---|---|---|
| 1 | 8 | 8.02 | -0.02 | 0.25 |
| 2 | 15 | 15.08 | -0.08 | 0.53 |
| 4 | 8 | 8.38 | -0.38 | 4.75 |
| 5 | 14 | 14.03 | -0.03 | 0.21 |
| 7 | 32 | 35.24 | -3.24 | 10.13 |
| 8 | 19 | 19.10 | -0.10 | 0.53 |
| 9 | 97 | 97.00 | 0.00 | 0.00 |
| 10 | 62 | 62.25 | -0.25 | 0.40 |
| 11 | 61 | 61.2 | -0.2 | 0.33 |
| 14 | 14 | 14.07 | -0.07 | 0.50 |
| 16 | 23 | 23.21 | -0.21 | 0.91 |
| 17 | 27 | 27.06 | -0.06 | 0.22 |
| 18 | 77 | 77.13 | -0.13 | 0.17 |
| 19 | 100 | 100.00 | 0.00 | 0.00 |
| 20 | 40 | 40.10 | -0.10 | 0.25 |
| 22 | 39 | 39.01 | -0.01 | 0.03 |
| 23 | 47 | 40.59 | 6.41 | -13.64 |
| 24 | 67 | 64.58 | 2.42 | -3.61 |
| 25 | 19 | 21.47 | -2.47 | 13.00 |
| 26 | 10 | 10.01 | -0.01 | 0.10 |
| 27 | 83 | 83.10 | -0.10 | 0.12 |
| 30 | 94 | 95.62 | -1.62 | 1.72 |
| 33 | 100 | 100.00 | 0.00 | 0.00 |
|
| ||||
|
| ||||
| 6 | 14 | 13.73 | 0.27 | -1.93 |
| 12 | 72 | 66.86 | 5.14 | -7.14 |
| 21 | 73 | 73.13 | -0.13 | 0.18 |
| 29 | 86 | 86.01 | -0.01 | 0.01 |
| 32 | 96 | 99.47 | -3.47 | 3.61 |
|
| ||||
|
| ||||
| 3 | 5 | 5.00 | 0.00 | 0.00 |
| 13 | 98 | 98.00 | 0.00 | 0.00 |
| 15 | 45 | 45.21 | -0.21 | 0.47 |
| 28 | 87 | 87.02 | -0.02 | 0.02 |
| 31 | 46 | 46.08 | -0.08 | 0.17 |
N = 33; Rtrain = 0.998; Rtest = 0.999; Rvalidation = 0.999; Rall = 0.998; R2CV= 0.99998; root-mean-square error = 1.78
Fig. 2Predicted inhibition percent activities by artificial neural network in comparison with experimental.
Fig. 3Residual plot of furopyrimidine and thienopyrimidine derivatives by artificial neural network model.
Performance comparison between models obtained by MLR and ANN.
| Models | Calibration | Prediction | ||||
|---|---|---|---|---|---|---|
|
| ||||||
| RMSE | R2 | Q2 | RMSE | R2 | Q2 | |
| MLR | 10.97 | 0.889 | 0.87 | 14.54 | 0.684 | 0.75 |
| ANN | 1.78 | 0.998 | 0.99 | 0.17 | 0.999 | 0.99 |
MLR, multiple linear regression; ANN, artificial neural network; RMSE, root-mean-square error; R2, regression coefficient.
Comparing values of inhibition percent experimental and predicted results using MLR and ANN methods.
| Number | Inhibition (observed) | MLR (predicted) | Residual | Relative error (%) | ANN (predicted) | Residual | Relative error (%) |
|---|---|---|---|---|---|---|---|
| 1 | 8 | 21.305 | -13.305 | -166.313 | 8.017 | -0.017 | -0.219 |
| 2 | 15 | 19.305 | -4.305 | -28.7 | 15.082 | -0.082 | -0.545 |
| 3 | 5 | 14.027 | -9.027 | -180.54 | 5.001 | -0.001 | -0.027 |
| 4 | 8 | 14.966 | -6.966 | -87.075 | 8.382 | -0.382 | -4.773 |
| 5 | 14 | 6.711 | 7.289 | 52.064 | 14.031 | -0.031 | -0.218 |
| 6 | 14 | 15.369 | -1.369 | -9.779 | 13.734 | 0.266 | 1.899 |
| 7 | 32 | 18.175 | 13.825 | 43.203 | 35.244 | -3.244 | -10.137 |
| 8 | 19 | 32.614 | -13.614 | -71.653 | 19.1 | -0.1 | -0.525 |
| 9 | 97 | 88.716 | 8.284 | 8.54 | 96.998 | 0.002 | 0.002 |
| 10 | 62 | 61.903 | 0.097 | 0.156 | 62.25 | -0.25 | -0.403 |
| 11 | 61 | 47.884 | 13.116 | 21.502 | 61.195 | -0.195 | -0.32 |
| 12 | 72 | 84.331 | -12.331 | -17.126 | 66.864 | 5.136 | 7.133 |
| 13 | 98 | 95.327 | 2.673 | 2.728 | 97.998 | 0.002 | 0.002 |
| 14 | 14 | 24.605 | -10.605 | -75.75 | 14.068 | -0.068 | -0.487 |
| 15 | 45 | 37.76 | 7.24 | 16.089 | 45.207 | -0.207 | -0.46 |
| 16 | 23 | 29.55 | -6.55 | -28.478 | 23.208 | -0.208 | -0.903 |
| 17 | 27 | 40.479 | -13.479 | -49.922 | 27.056 | -0.056 | -0.207 |
| 18 | 77 | 83.432 | -6.432 | -8.353 | 77.126 | -0.126 | -0.163 |
| 19 | 100 | 76.441 | 23.559 | 23.559 | 99.999 | 0.001 | 0.001 |
| 20 | 40 | 21.785 | 18.215 | 45.538 | 40.099 | -0.099 | -0.248 |
| 21 | 73 | 61.667 | 11.333 | 15.525 | 73.131 | -0.131 | -0.179 |
| 22 | 39 | 29.382 | 9.618 | 24.662 | 39.011 | -0.011 | -0.029 |
| 23 | 47 | 54.44 | -7.44 | -15.83 | 40.586 | 6.414 | 13.646 |
| 24 | 67 | 66.878 | 0.122 | 0.182 | 64.579 | 2.421 | 3.613 |
| 25 | 19 | 37.076 | -18.076 | -95.137 | 21.466 | -2.466 | -12.979 |
| 26 | 10 | 0.649 | 9.351 | 93.51 | 10.007 | -0.007 | -0.073 |
| 27 | 83 | 95.591 | -12.591 | -15.17 | 83.1 | -0.1 | -0.12 |
| 28 | 87 | 79.829 | 7.171 | 8.243 | 87.023 | -0.023 | -0.026 |
| 29 | 86 | 70.205 | 15.795 | 18.366 | 86.008 | -0.008 | -0.009 |
| 30 | 94 | 79.492 | 14.508 | 15.434 | 95.62 | -1.62 | -1.724 |
| 31 | 46 | 47.071 | -1.071 | -2.328 | 46.084 | -0.084 | -0.184 |
| 32 | 96 | 93.038 | 2.962 | 3.085 | 99.47 | -3.47 | -3.615 |
| 33 | 100 | 111.342 | -11.342 | -11.342 | 99.996 | 0.004 | 0.004 |
MLR, multiple linear regression; ANN, artificial neural network.