| Literature DB >> 31973244 |
Cátia Teixeira1, Cristina Ventura2, José R B Gomes3, Paula Gomes1, Filomena Martins4.
Abstract
Tuberculosis, caused by Mycobacterium tuberculosis (Mtb), remains one of the top ten causes of death worldwide and the main cause of mortality from a single infectious agent. The upsurge of multi- and extensively-drug resistant tuberculosis cases calls for an urgent need to develop new and more effective antitubercular drugs. As the cinnamoyl scaffold is a privileged and important pharmacophore in medicinal chemistry, some studies were conducted to find novel cinnamic acid derivatives (CAD) potentially active against tuberculosis. In this context, we have engaged in the setting up of a quantitative structure-activity relationships (QSAR) strategy to: (i) derive through multiple linear regression analysis a statistically significant model to describe the antitubercular activity of CAD towards wild-type Mtb; and (ii) identify the most relevant properties with an impact on the antitubercular behavior of those derivatives. The best-found model involved only geometrical and electronic CAD related properties and was successfully challenged through strict internal and external validation procedures. The physicochemical information encoded by the identified descriptors can be used to propose specific structural modifications to design better CAD antitubercular compounds.Entities:
Keywords: Mycobacterium tuberculosis; QSAR model.; antitubercular agents; cinnamic acids; multi-linear regression analysis
Mesh:
Substances:
Year: 2020 PMID: 31973244 PMCID: PMC7037561 DOI: 10.3390/molecules25030456
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Structure of the cinnamic acid derivatives (CAD) used to derive the multiple linear regression (MLR)-based model. Compounds marked with an asterisk belong to the test set.
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| [ | H | farnesyl | H | 1.28 |
| [ | H | isopentenyl | H | 168.23 |
|
| [ | H | isopentenyl | H | 95.97 |
| [ | OCH3 | H | H | 23.78 |
|
| [ | H | methyl | H | 225.52 |
| [ | H | methyl | H | 950.00 |
|
| [ | OCH3 | H | OCH3 | 384.16 |
| [ | H | isopentenyl | H | 2.30 |
|
| [ | OCH3 | H | H | 423.21 |
| [ | H | CF3 | H | 1.10 |
|
| [ | H | H | H | 237.44 |
| [ | H | CF3CH2 | H | 2.20 |
|
| [ | OCH3 | H | H | 27.94 |
| [ | H | geranyl | H | 1.90 |
|
| [ | H | H | H | 31.21 |
| [ | H | ethyl | H | 1.30 |
|
| [ | H | geranyl | H | 0.26 |
| [ | H | isopentenyl | H | 21.00 |
|
| [ | H | isopentenyl | H | 199.11 |
| [ | H | CF3CH2 | H | 20.00 |
|
| [ | H | isopentenyl | H | 51.88 |
| [ | H | ethyl | H | 12.00 |
|
| [ | H | methyl | H | 247.75 |
| [ | H | CF3 | H | 21.00 |
|
| [ | OCH3 | methyl | H | 439.65 |
| [ | H | geranyl | H | 72.00 |
|
| [ | H | methyl | H | 1560.59 |
| [ | H | methyl | H | 50.00 |
|
| [ | H | geranyl | H | 72.30 | ||||||
1 Compound identified as outlier. 2 MIC values were retrieved from references [11,16]. * Test set compounds.
Best model found by MLR analysis: pMIC = a0 + a1a3 + a2a1 + a3PSA + a4HansPol. Molecular descriptors are in bold, and values in parenthesis correspond to the significance level of each adjusted parameter. All descriptors were normalized.
|
| ||||
|---|---|---|---|---|
| a0 ± s(a0) | a1a31 ± s(a1) | a2a11 ± s(a2) | a3PSA2 ± s(a3) | a4HansPol3 ± s(a4) |
| −5.061 ± 0.442 | 2.899 ± 0.332 | 2.082 ± 0.268 | 1.673 ± 0.311 | −1.800 ± 0.354 |
1a1 and a3 correspond to the angles represented in picture of Table 2. 2 PSA: polar surface area. 3 HansPol: Hansen polarity.
Summary of statistical results for the best-found quantitative structure–activity relationships (QSAR) model.
| Set |
|
|
|
|
|
|
|
|
|
| Δ |
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training | 19 | 0.357 | 0.909 | 35 | - | - | - | - | 0.930 | - | - | - |
| Test | 7 | 0.297 | 0.920 | 58 | 0.913 | 0.100 | 0.260 | 0.294 | 0.933 | 0.879 | 0.070 | 0.953 |
1 Number of compounds. 2 Standard deviation of fit. 3 Determination coefficient. 4 The F statistics.5 Determination coefficient of regression through the origin. 6 Average error. 7 Absolute average error. 8 Root-mean square error. 9 Cross-validation correlation coefficient. 10 Average value between observed vs. predicted and predicted vs. observed Roy’s parameter, r2m, for the test set. 11 Absolute difference between observed vs. predicted and predicted vs. observed Roy’s parameter, r2m, for the test set. 12 Concordance correlation coefficient.
Figure 1Log(1/MIC)pred vs. log(1/MIC)exp according to the best built QSAR model.
Figure 2Williams plot for the built model representing the leverage values for the training and test set compounds.