| Literature DB >> 35552494 |
Ang Lu1, Shi-Meng Yuan2, Huai Xiao2,3, Da-Song Yang2, Zhi-Qiong Ai1, Qi-Yan Li4,3, Yu Zhao2,3, Zhuang-Zhi Chen5, Xiu-Mei Wu6,7.
Abstract
Phenolic compounds (PCs) could be applied to reduce reactive oxygen species (ROS) levels, and are used to prevent and treat diseases related to oxidative stress. QSAR study was applied to elucidate the relationship between the molecular descriptors and physicochemical properties of polyphenol analogues and their DPPH radical scavenging capability, to guide the design and discovery of highly-potent antioxidant substances more efficiently. PubMed database was used to collect 99 PCs with antioxidant activity, whereas, 105 negative PCs were found in ChEMBL database; their molecular descriptors were generated with Python's Rdkit package. While the molecular descriptors significantly related to the antioxidant activity of PCs were filtered by t-test. The prediction QSAR model was then established by discriminant analysis, and the obtained model was verified by the back-substitution and Leave-One-Out cross-validation methods along with heat map. It was revealed that the anti-DPPH radical activity of PCs was correlated with the drug-likeness and molecular fingerprints, physicochemical, topological, constitutional and electronic property. The established QSAR model could explicitly predict the antioxidant activity of polyphenols, thus were applicable to evaluate the potential of candidates as antioxidants.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35552494 PMCID: PMC9098848 DOI: 10.1038/s41598-022-11925-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Scheme 1The red area is the process of data mining, yellow is the process of filtering molecular descriptor, and blue is the process of model fitting evaluation.
Results of filtering molecular descriptors by t-test.
| Descriptors | Typea | Descriptors | Typea | Descriptors | Typea | |||
|---|---|---|---|---|---|---|---|---|
| qed | − 10.570 | DF | PEOE_VSA14 | 4.316 | P | HeavyAtomCount | 4.415 | C |
| MinAbsPartialCharge | 8.807 | E | PEOE_VSA2 | 6.152 | P | NHOHCount | 6.640 | C |
| FpDensityMorgan2 | − 4.704 | DF | PEOE_VSA3 | 2.684 | P | NOCount | 6.429 | C |
| FpDensityMorgan3 | − 4.755 | DF | PEOE_VSA6 | − 6.030 | P | NumAromaticCarbocycles | 4.053 | C |
| FpDensityMorgan1 | − 4.376 | DF | PEOE_VSA8 | 2.058 | P | NumAromaticRings | 3.794 | C |
| HeavyAtomMolWt | 4.695 | P | PEOE_VSA9 | 2.629 | P | NumHAcceptors | 6.024 | C |
| MaxAbsEStateIndex | 6.917 | T | SMR_VSA1 | 7.549 | P | NumHDonors | 7.201 | C |
| MaxAbsPartialCharge | − 7.485 | E | SMR_VSA10 | 5.525 | P | NumHeteroatoms | 6.315 | C |
| MaxEStateIndex | 6.917 | T | SMR_VSA3 | − 4.382 | P | NumSaturatedHeterocycles | − 2.706 | C |
| MinPartialCharge | 7.485 | E | SMR_VSA4 | − 2.525 | P | NumSaturatedRings | − 2.230 | C |
| ExactMolWt | 4.331 | P | SMR_VSA5 | − 2.577 | P | MolMR | 2.875 | P |
| MolWt | 4.330 | P | SMR_VSA6 | − 5.399 | P | fr_Al_COO | 3.677 | C |
| NumValenceElectrons | 3.988 | E | SMR_VSA9 | 9.010 | P | fr_Ar_OH | 10.104 | C |
| MinEStateIndex | − 7.975 | T | SlogP_VSA11 | 8.878 | P | fr_COO | 3.849 | C |
| MinAbsEStateIndex | − 10.317 | T | SlogP_VSA2 | 2.459 | P | fr_COO2 | 3.849 | C |
| MaxPartialCharge | 8.807 | E | SlogP_VSA3 | 2.541 | P | fr_C_O | 8.317 | C |
| BertzCT | 6.238 | C | SlogP_VSA5 | − 2.360 | P | fr_C_O_noCOO | 7.579 | C |
| Chi0 | 4.749 | P | SlogP_VSA8 | 4.451 | P | fr_NH0 | − 5.262 | C |
| Chi0n | 2.881 | P | TPSA | 7.602 | P | fr_Ndealkylation2 | − 4.482 | C |
| Chi0v | 2.792 | P | EState_VSA1 | 3.270 | T | fr_aldehyde | 3.849 | C |
| Chi1 | 4.148 | P | EState_VSA10 | 6.841 | T | fr_allylic_oxid | 4.613 | C |
| HallKierAlpha | − 9.187 | P | EState_VSA2 | 12.192 | T | fr_aryl_methyl | − 2.603 | C |
| Kappa1 | 3.852 | P | EState_VSA3 | 3.701 | T | fr_benzene | 4.053 | C |
| Kappa2 | 2.233 | P | EState_VSA4 | − 5.536 | T | fr_bicyclic | 2.508 | C |
| Kappa3 | 2.431 | P | EState_VSA7 | − 2.701 | T | fr_ester | 2.899 | C |
| LabuteASA | 3.827 | P | EState_VSA8 | − 6.401 | T | fr_ether | 2.749 | C |
| PEOE_VSA1 | 5.858 | P | EState_VSA9 | 4.202 | T | fr_phenol | 10.104 | C |
| PEOE_VSA11 | 9.775 | P | VSA_EState9 | 6.962 | T | fr_phenol_noOrthoHbond | 9.761 | C |
| PEOE_VSA12 | 3.818 | P | FractionCSP3 | − 8.698 | C | fr_piperdine | − 3.833 | C |
All the P values were less than 0.05.
aThe types of molecular descriptors: “DF” denotes drug-likeness and molecular fingerprints “P” denotes physicochemical property, “T” denotes topological property, “C” denotes constitutional property, “E” denotes electronic property.
Results of discriminant analysis.
| Independent Variable | Molecular descriptors | Lambda | Coefficients of positive samples | Coefficients of negative samples | |
|---|---|---|---|---|---|
| X1 | qed | 0.408 | < 0.001 | 47.691 | 62.455 |
| X2 | MinAbsPartialCharge | 0.337 | < 0.001 | 19.218 | − 52.885 |
| X3 | FpDensityMorgan2 | 0.177 | < 0.001 | − 67.637 | − 112.116 |
| X4 | FpDensityMorgan3 | 0.172 | < 0.001 | 14.345 | 30.362 |
| X5 | FpDensityMorgan1 | 0.239 | < 0.001 | 151.179 | 200.918 |
| X6 | MinAbsEStateIndex | 0.337 | < 0.001 | 23.987 | 43.389 |
| X7 | Kappa2 | 0.303 | < 0.001 | 0.353 | 2.401 |
| X8 | PEOE_VSA6 | 0.229 | < 0.001 | 0.416 | 0.820 |
| X9 | SMR_VSA4 | 0.258 | < 0.001 | − 0.458 | − 0.078 |
| X10 | SlogP_VSA5 | 0.182 | < 0.001 | − 0.481 | − 0.614 |
| X11 | VSA_EState9 | 0.196 | < 0.001 | 2.353 | 1.871 |
| X12 | NOCount | 0.211 | < 0.001 | − 9.660 | − 5.499 |
| X13 | fr_C_O_noCOO | 0.248 | < 0.001 | − 10.962 | − 14.661 |
| X14 | fr_allylic_oxid | 0.380 | < 0.001 | − 2.070 | − 3.966 |
| X15 | fr_aryl_methyl | 0.167 | < 0.001 | 2.719 | 3.980 |
| X16 | fr_ester | 0.275 | < 0.001 | − 7.376 | -0.809 |
Figure 1Relationship between antioxidant activity and molecular descriptors of PCs.
Classification results of back-substitution method.
| Predicted group membership | Actual classification | Total | |
|---|---|---|---|
| Positive samples (%) | Negative sample (%) | ||
| Positive samples | 96 (96.97%) | 0 (0%) | 96 |
| Negative sample | 3 (3.03%) | 105 (100.00%) | 108 |
| Total | 99 | 105 | 204 |
.McNemar’s test: P = 0.250.
Classification results of jackknife cross-validation method.
| Predicted group membership | Actual classification | Total | |
|---|---|---|---|
| Positive samples (%) | Negative sample (%) | ||
| Positive samples | 95 (95.96%) | 1 (0.95%) | 96 |
| Negative sample | 4 (4.04%) | 104 (99.05%) | 108 |
| Total | 99 | 105 | 204 |
McNemar’s test: P = 0.375.
Figure 2Heat map of the molecular descriptors matrix.