| Literature DB >> 28496470 |
Abstract
In this work the electrooxidation half-wave potentials of some Benzoxazines were predicted from their structural molecular descriptors by using quantitative structure-property relationship (QSAR) approaches. The dataset consist the half-wave potential of 40 benzoxazine derivatives which were obtained by DC-polarography. Descriptors which were selected by stepwise multiple selection procedure are: HOMO energy, partial positive surface area, maximum valency of carbon atom, relative number of hydrogen atoms and maximum electrophilic reaction index for nitrogen atom. These descriptors were used for development of multiple linear regression (MLR) and artificial neural network (ANN) models. The statistical parameters of MLR model are standard errors of 0.016 and 0.018 for training and test sets, respectively. Also, these values are 0.012 and 0.017 for training and test sets of ANN model, respectively. The predictive power of these models was further examined by leave-eight-out cross validation procedure. The obtained statistical parameters are Q2 = 0.920 and SPRESS = 0.020 for MLR model and Q2 = 0.949 and SPRESS = 0.015 for ANN model, which reveals the superiority of ANN over MLR model. Moreover, the results of sensitivity analysis on ANN model indicate that the order of importance of descriptors is: Relative number of H atom > HOMO energy > Maximum electrophyl reaction index for N atom > Partial positive surface area (order-3) > maximum valency of C atom.Entities:
Keywords: Artificial neural network; Benzoxazines; Half wave potential; Quantitative structure-property relationship; pharmaceutical property
Year: 2017 PMID: 28496470 PMCID: PMC5423242
Source DB: PubMed Journal: Iran J Pharm Res ISSN: 1726-6882 Impact factor: 1.696
Figure 1Plot of R2 for the obtained models versus the number of descriptors involved
Structures, experimental, MLR and ANN-predicted values of oxidation half-wave
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 40 | S | 6-Cl | 3-Cl | 1.520 | 1.498 | 1.503 |
| 1 | O | 7-OCH3 | 1.420 | 1.413 | 1.419 | |
| 2 | O | 7-OCH3 | 4-F | 1.430 | 1.434 | 1.424 |
| 3 | O | 7-OCH3 | 4-Br | 1.440 | 1.465 | 1.453 |
| 4 | O | 7-OCH3 | 3-F | 1.445 | 1.458 | 1.441 |
| 5 | O | 7-OCH3 | 3-Cl | 1.450 | 1.461 | 1.447 |
| 6 | O | 7-CH3 | 4-CH3 | 1.415 | 1.412 | 1.418 |
| 7 | O | 6-CH3 | 4-CH3 | 1.420 | 1.416 | 1.423 |
| 8T | O | 4-Br | 1.490 | 1.522 | 1.533 | |
| 9 | O | 6-OCH3 | 4-CH3 | 1.450 | 1.425 | 1.433 |
| 10 | O | 6-OCH3 | 4-F | 1.460 | 1.480 | 1.460 |
| 11 | O | 6-OCH3 | 4-Br | 1.465 | 1.487 | 1.474 |
| 12 | O | 6-OCH3 | 4-Cl | 1.470 | 1.487 | 1.473 |
| 13 | O | 6-OCH3 | 3-F | 1.480 | 1.481 | 1.462 |
| 14 | O | 6-OCH3 | 4-CN | 1.510 | 1.516 | 1.512 |
| 15 | O | 6-Cl | 1.530 | 1.525 | 1.535 | |
| 16 | O | 6-Cl | 3-Cl | 1.590 | 1.589 | 1.591 |
| 17 | S | 7-OCH3 | 4-CH3 | 1.280 | 1.312 | 1.309 |
| 18 | S | 7-OCH3 | 1.315 | 1.323 | 1.318 | |
| 19 | S | 7-OCH3 | 4-F | 1.350 | 1.348 | 1.366 |
| 20 | S | 7-OCH3 | 4-Br | 1.360 | 1.378 | 1.359 |
| 21 | S | 7-OCH3 | 4-Cl | 1.370 | 1.368 | 1.362 |
| 22 | S | 7-OCH3 | 3-F | 1.390 | 1.363 | 1.376 |
| 23T | S | 7-OCH3 | 3-Cl | 1.395 | 1.370 | 1.364 |
| 24 | S | 7-OCH3 | 4-CF3 | 1.405 | 1.433 | 1.430 |
| 25T | S | 7-OCH3 | 3,4-Cl2 | 1.420 | 1.417 | 1.408 |
| 26 | S | 7-CH3 | 4-CH3 | 1.305 | 1.323 | 1.308 |
| 27T | S | 6-CH3 | 4-CH3 | 1.320 | 1.328 | 1.308 |
| 28 | S | 4-Br | 1.420 | 1.449 | 1.428 | |
| 29 | S | 6-OCH3 | 4-CH3 | 1.330 | 1.336 | 1.326 |
| 30 | S | 6-OCH3 | 1.360 | 1.353 | 1.350 | |
| 31 | S | 6-OCH3 | 4-F | 1.380 | 1.395 | 1.403 |
| 32 | S | 6-OCH3 | 4-Br | 1.400 | 1.406 | 1.408 |
| 33 | S | 6-OCH3 | 4-Cl | 1.400 | 1.402 | 1.402 |
| 34 | S | 6-OCH3 | 3-F | 1.410 | 1.399 | 1.407 |
| 35 | S | 6-OCH3 | 3-Cl | 1.430 | 1.405 | 1.404 |
| 36 | S | 6-OCH3 | 4-CF3 | 1.440 | 1.455 | 1.451 |
| 37T | S | 6-OCH3 | 3,4-Cl2 | 1.445 | 1.451 | 1.444 |
| 38 | S | 6-OCH3 | 4-CN | 1.450 | 1.437 | 1.438 |
| 39 | S | 6-Cl | 1.420 | 1.443 | 1.420 |
T: denotes the test set.
Figure 2Scatter plot of samples for training and test sets according to the mean distances distribution
Specification of multiple linear regression model
|
|
|
|
|
|
|---|---|---|---|---|
| Relative number of H atom | X1 | -0.13 | ±0.027 | -0.796 |
| Partial positive surface area(order-3) | X2 | -0.1 | ±0.004 | -0.086 |
| Maximum electrophyl reaction index for N atom | X3 | 0.023 | ±0.002 | 0.075 |
| HOMO energy | X4 | -0.079 | ±0.051 | 1.010 |
| Maximum valency of C atom | X5 | 2.298 | ±1.012 | 8.880 |
| Constant | -7.903 | ±3.65 |
n =35, R =0.97, SE = 0.016, F =512
Figure 3Calculated. E1/2 versus Experimental E1/2 plot
Internal correlation matrix between molecular descriptors
|
|
|
|
|
| |
|---|---|---|---|---|---|
| X1 | 1.000 | 0.255 | 0.075 | 0.650 | 0.670 |
| X2 | 1.000 | -0.027 | -0.010 | -0.163 | |
| X3 | 1.000 | -0.352 | 0.009 | ||
| X4 | 1.000 | 0.253 | |||
| X5 | 1.000 |
The statistical results of ANN and MLR models
|
|
|
|
| |||
|---|---|---|---|---|---|---|
| MLR | 0.969 | 0.016 | 0.970 | 0.018 | 0.920 | 0.020 |
| R | SE | R | SE | Q2 | SPRESS | |
| ANN | 0.983 | 0.012 | 0.971 | 0.017 | 0.949 | 0.015 |
R, SE, Q2 and SPRESS are regression coefficient, standard error, correlation coefficient of cross validation and square of predictive error sum of squares respectively.
Architecture of ANN
|
|
|
|---|---|
| No. of Output Layer Nods | 1 |
| No. of Hidden Layer Nods | 2 |
| Weight Learning Rate | 0.2 |
| Bias Learning Rate | 0.6 |
| Momentum | 0.3 |
| No. of Input Layer Nods | 5 |
Figure 4Residual versus Experimental E1/2 plot
Figure 5Principal component analysis on the selected molecular descriptors for the consensus model
Figure 6Sensitivity analysis results