| Literature DB >> 34912782 |
Davood Gheidari1, Morteza Mehrdad1, Mahboubeh Ghahremani2.
Abstract
Candida albicans is a pathogenic opportunistic yeast found in the human gut flora. It may also live outside of the human body, causing diseases ranging from minor to deadly. Candida albicans begins as a budding yeast that can become hyphae in response to a variety of environmental or biological triggers. The hyphae form is responsible for the development of multidrug resistant biofilms, despite the fact that both forms have been associated to virulence Here, we have proposed a linear and SPA-linear quantitative structure activity relationship (QSAR) modeling and prediction of Candida albicans inhibitors. A data set that consisted of 60 derivatives of benzoxazoles, benzimidazoles, oxazolo (4, 5-b) pyridines have been used. In this study, that after applying the leverage analysis method to detect outliers' molecules, the total number of these compounds reached 55. SPA-MLR model shows superiority over the multiple linear regressions (MLR) by accounting 90% of the Q 2 of anti-fungus derivatives 'activity. This paper focuses on investigating the role of SPA-MLR in developing model. The accuracy of SPA-MLR model was illustrated using leave-one-out (LOO). The mean effect of descriptors and sensitivity analysis show that RDF090u is the most important parameter affecting the as behavior of the inhibitors of Candida albicans.Entities:
Keywords: Candida albicans; MLR; QSAR; benzimidazoles; biological activity
Year: 2021 PMID: 34912782 PMCID: PMC8667819 DOI: 10.3389/fchem.2021.774416
Source DB: PubMed Journal: Front Chem ISSN: 2296-2646 Impact factor: 5.221
FIGURE 1Example of SPA with n = 4 and k (0) = 3. Result of the first iteration: k (Gheidari et al., 2020) = 1.
FIGURE 2Structures of the benzoxazole, benzimidazoles and pyridine derivatives.
Chemical structure, experimental activity and predicted activities of the Candida albicans inhibitors.
| NO | Compound name | X | Y | Z | R | R1 | R2 | pIC50 exp | pIC50 pre |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2-phenylbenzo[d]oxazole | CH | O | — | H | H | H | −1.2511 | −0.9429 |
| 2 | 2−(4−tert−butylphenyl)benzo[d]oxazole | CH | O | — | C(CH3)3 | H | H | −0.7824 | −0.7398 |
| 3 | 4−(benzo[d]oxazol−2−yl)benzenamine | CH | O | — | NH2 | H | H | −1.1135 | −1.2878 |
| 4 | 4−(benzo[d]oxazol−2−yl)−N−methylbenzenamine | CH | O | — | NHCH3 | H | H | −0.9931 | −0.7232 |
| 5 | 5−chloro−2−(4−ethylphenyl)benzo[d]oxazole | CH | O | — | C2H5 | Cl | H | −0.7308 | −0.2049 |
| 6 | N−(4−(5−chlorobenzo[d]oxazol−2−yl)phenyl)acetamide | CH | O | — | NHCOCH3 | Cl | H | −0.5330 | −0.2210 |
| 7 | 4−(5−chlorobenzo[d]oxazol−2−yl)−N−methylbenzenamine | CH | O | — | NHCH3 | Cl | H | −0.7027 | outlier |
| 8 | 5−chloro−2−(4−chlorophenyl)benzo[d]oxazole | CH | O | — | Cl | Cl | H | −0.6630 | outlier |
| 9 | 5−chloro−2−(4−nitrophenyl)benzo[d]oxazole | CH | O | — | NO2 | Cl | H | −0.5926 | outlier |
| 10 | 5−nitro−2−phenylbenzo[d]oxazole | CH | O | — | H | NO2 | H | 0.4257 | 0.5189 |
| 11 | 5−nitro−2−p−phenylbenzo[d]oxazole | CH | O | — | CH3 | NO2 | H | 0.5375 | 0.6246 |
| 12 | 2−(4−tert−butylphenyl)−5−nitrobenzo[d]oxazole | CH | O | — | C(CH3)3 | NO2 | H | 0.8256 | 0.2413 |
| 13 | 4−(5−nitrobenzo[d]oxazol−2−yl)benzenamine | CH | O | — | NH2 | NO2 | H | 0.5461 | −0.0056 |
| 14 | 2−(4−chlorophenyl)−5−nitrobenzo[d]oxazole | CH | O | — | Cl | NO2 | H | 0.6837 | 0.7435 |
| 15 | 2−(4−bromophenyl)−5−nitrobenzo[d]oxazole | CH | O | — | Br | NO2 | H | 0.9589 | 1.1683 |
| 16 | 2−(4−ethylphenyl)benzo[d]oxazol−5−amine | CH | O | — | C2H5 | NH2 | H | −0.8770 | −1.4654 |
| 17 | 2−(4−fluorophenyl)benzo[d]oxazol−5−amine | CH | O | — | F | NH2 | H | −0.9587 | −1.1124 |
| 18 | 5−methyl−2−p−tolylbenzo[d]oxazole | CH | O | — | CH3 | CH3 | H | −1.001 | −0.9365 |
| 19 | 2−(4−ethylphenyl)−5−methylbenzo[d]oxazole | CH | O | — | C2H5 | CH3 | H | −0.8856 | −1.2869 |
| 20 | 2−(4−methoxyphenyl)−5−methylbenzo[d]oxazole | CH | O | — | OCH3 | CH3 | H | −0.8727 | −0.5214 |
| 21 | 4−(5−nitrobenzo[d]oxazol−2−yl)benzenamine | CH | O | — | F | CH3 | H | −0.9673 | −0.3499 |
| 22 | N−(4−(5−methylbenzo[d]oxazol−2−yl)phenyl)acetamide | CH | O | — | NHCOCH3 | CH3 | H | −0.6706 | −0.8745 |
| 23 | N−methyl−4−(5−methylbenzo[d]oxazol−2−yl)benzenamine | CH | O | — | NHCH3 | CH3 | H | −0.8770 | −0.6418 |
| 24 | N,N−dimethyl−4−(5−methylbenzo[d]oxazol−2−yl)benzenamine | CH | O | — | N(CH3)2 | CH3 | H | −0.7695 | −0.4968 |
| 25 | 2−p−tolyloxazolo[4,5−b]pyridine | N | O | — | CH3 | H | H | 0.1806 | −0.0856 |
| 26 | 2−(4−ethylphenyl)oxazolo[4,5−b]pyridine | N | O | — | C2H5 | H | H | 0.3010 | 1.1657 |
| 27 | 2−(4−methoxyphenyl)oxazolo[4,5−b]pyridine | N | O | — | OCH3 | H | H | 0.3182 | 0.2267 |
| 28 | 2−(4−ethoxyphenyl)oxazolo[4,5−b]pyridin | N | O | — | OC2H5 | H | H | 0.4300 | 1.1248 |
| 29 | 4−(oxazolo[4,5−b]pyridin−2−yl)benzenamine | N | O | — | NH2 | H | H | 0.1892 | 0.4186 |
| 30 | 2−(4−nitrophenyl)oxazolo[4,5−b]pyridine | N | O | — | NO2 | H | H | 0.4386 | 0.0819 |
| 31 | 2−(4−chlorophenyl)−5−nitrobenzo[d]oxazole | CH | O | CH2 | H | H | H | 0.1720 | −0.3748 |
| 32 | 2−(4−methoxybenzyl)benzo[d]oxazole | CH | O | CH2 | OCH3 | H | H | 0.4257 | 0.3954 |
| 33 | 2−(4−chlorobenzyl)benzo[d]oxazole | CH | O | CH2 | Cl | H | H | 0.4601 | 0.1951 |
| 34 | 2−(4−nitrobenzyl)benzo[d]oxazole | CH | O | CH2 | NO2 | H | H | 0.5375 | −0.0171 |
| 35 | 2−benzyl−5−chlorobenzo[d]oxazole | CH | O | CH2 | H | Cl | H | 0.4601 | 0.1356 |
| 36 | 2−(4−methoxybenzyl)−5−chlorobenzo[d]oxazole | CH | O | CH2 | OCH3 | Cl | H | 0.6751 | 0.8628 |
| 37 | 2−(4−bromobenzyl)−5−chlorobenzo[d]oxazole | CH | O | CH2 | Br | Cl | H | 0.9761 | 1.2167 |
| 38 | 2−(4−nitrobenzyl)−5−chlorobenzo[d]oxazole | CH | O | CH2 | NO2 | Cl | H | 0.7740 | 0.6188 |
| 39 | 2−benzyl−5−nitrobenzo[d]oxazole | CH | O | CH2 | H | NO2 | H | 1.8317 | 1.5786 |
| 40 | 2−(4−methoxybenzyl)−5−nitrobenzo[d]oxazole | CH | O | CH2 | OCH3 | NO2 | H | 2.0381 | 1.5331 |
| 41 | 2−(4−bromobenzyl)−5−nitrobenzo[d]oxazole | CH | O | CH2 | Br | NO2 | H | 2.3305 | 2.1263 |
| 42 | 2−(4−chlorobenzyl)−5−nitrobenzo[d]oxazole | CH | O | CH2 | Cl | NO2 | H | 2.0682 | 1.8156 |
| 43 | 2−(4−nitrobenzyl)−5−nitrobenzo[d]oxazole | CH | O | CH2 | NO2 | NO2 | H | 2.1370 | 1.8988 |
| 44 | 5−methyl−2−(phenoxymethyl)benzo[d]oxazole | CH | O | CH2O | H | CH3 | H | −0.8727 | −0.9144 |
| 45 | 6−nitro−2−(phenoxymethyl)benzo[d]oxazole | CH | O | CH2O | H | CH3 | H | −1.9391 | −0.2366 |
| 46 | 5−chloro−6−nitro−2−(phenoxymethyl)benzo[d]oxazole | CH | O | CH2O | H | CH3 | H | −1.7112 | −1.3976 |
| 47 | 2−((4−chlorophenoxy)methyl)−5−chloro−6−nitrobenzo[d]oxazole | CH | O | CH2O | Cl | Cl | NO2 | −1.5134 | −1.9055 |
| 48 | 5−nitro−2−((phenylthio)methyl)benzo[d]oxazole | CH | O | CH2S | H | NO2 | H | 0.7568 | 0.5983 |
| 49 | 5−methyl−2−((phenylthio)methyl)benzo[d]oxazole | CH | O | CH2S | H | CH3 | H | −0.7480 | −0.1654 |
| 50 | 2−(phenoxymethyl)oxazolo[4,5−b]pyridine | N | O | CH2O | H | H | H | 0.3311 | 0.0363 |
| 51 | 2−((p−tolyloxy)methyl)oxazolo[4,5−b]pyridine | N | O | CH2O | H | H | H | 0.5848 | 0.3516 |
| 52 | 2−((4−chlorophenoxy)methyl)−5−methyl−1H−benzo[d]imidazole | CH | NH | CH2O | Cl | CH3 | H | −0.6276 | −1.2245 |
| 53 | 5−nitro−2−((phenylthio)methyl)−1H−benzo[d]imidazole | CH | NH | CH2S | H | NO2 | H | 0.7525 | 1.4148 |
| 54 | 5−methyl−2−((phenylthio)methyl)−1H−benzo[d]imidazole | CH | NH | CH2S | H | CH3 | H | −0.7480 | −0.9096 |
| 55 | methyl 2−(phenoxymethyl)benzo[d]oxazole−5−carboxylate | CH | O | CH2O | H | COOCH3 | H | −0.5545 | −0.4934 |
| 56 | methyl 2−((4−chlorophenoxy)methyl)benzo[d]oxazole−5−carboxylate | CH | O | CH2S | Cl | COOCH3 | H | −0.3396 | −0.6923 |
| 57 | methyl 2−((4−chlorophenoxy)methyl)−1H−benzo[d]imidazole−5−carboxylate | CH | NH | CH2O | Cl | COOCH3 | H | −0.3482 | −0.2209 |
| 58 | methyl 2−((phenylthio)methyl)−1H−benzo[d]imidazole−5−carboxylate | CH | NH | CH2S | H | COOCH3 | H | −0.4600 | −0.2282 |
| 59 | 5−nitro−2−phenethylbenzo[d]oxazole | CH | O | C2H4 | H | NO2 | H | 0.6885 | outlier |
| 60 | 2−phenethyloxazolo[4,5−b]pyridine | N | O | C2H4 | H | H | H | 0.3671 | outlier |
List of molecular descriptors that created by Dragon program.
| Group name | Dimensionality | No. of descriptors | No. of descriptors in model |
|---|---|---|---|
| Constitutional descriptors | 0 | 47 | 31 |
| Functional groups | 1 | 121 | 16 |
| Atom−centered fragments | 1 | 120 | 14 |
| Empirical descriptors | 1 | 3 | 2 |
| Properties | 1 | 3 | 2 |
| Topological descriptors | 2 | 266 | 42 |
| Molecular walk counts | 2 | 21 | 4 |
| BCUT descriptors | 2 | 64 | 7 |
| Galvez topological charge indices | 2 | 21 | 3 |
| 2D autocorrelation descriptors | 2 | 96 | 18 |
| Charge descriptors | 3 | 14 | 9 |
| Aromaticity indices | 3 | 4 | 2 |
| Randic molecular profiles | 3 | 41 | 11 |
| Geometrical descriptors | 3 | 70 | 10 |
| RDF descriptors | 3 | 150 | 21 |
| 3D−MoRSE descriptors | 3 | 160 | 23 |
| WHIM descriptors | 3 | 99 | 17 |
| GETAWAY descriptors | 3 | 197 | 25 |
| Sum | 1,497 | 257 |
Selected descriptors of multiple linear regression.
| Descriptor | Type of descriptor | Notation | Coefficient |
|---|---|---|---|
| Information Content index (neighborhood symmetry of 2−order) | Information | IC2 | 0 |
| highest eigenvalue n. 8 of Burden matrix/weighted by atomic masses | BCUT | BEHm8 | −0.295 |
| quadrupole x−component value | Geometrical | Qxxe | 0.27 |
| Radial Distribution Function − 105 | RDF | RDF105m | 0.22 |
| Radial Distribution Function − 050/weighted by van der Waals volume | RDF | RDF050v | 0.285 |
| signal 16/unweighted | 3D−MoRSE | Mor16u | −0.673 |
| signal 22/unweighted | 3D−MoRSE | Mor22u | 0.193 |
| signal 32/unweighted | 3D−MoRSE | Mor32u | 0.41 |
| signal 16/unweighted | 3D−MoRSE | Mor16m | 0.32 |
| signal 13/unweighted | 3D−MoRSE | Mor31m | 0.416 |
| 2nd component accessibility directional WHIM index/weighted by van der Waals volume | WHIM | E2V | 0.149 |
| signal 30/weighted by van der Waals volume | 3D−MoRSE | Mor30V | 0.281 |
R Cal = 0.96, R Pre = 0.60, SE Cal = 0.17, SE Pre = 0.73, F = 6.675, REP % = 2.2.
FIGURE 3The Leverage graph based on the number of samples.
FIGURE 4The effect of the number of descriptors on the value of R2 in the Stepwise-MRL model.
Correlation matrix for the twelve chosen descriptors by means of Stepwise−MRL.
| IC2 | BEHm8 | QXXe | RDF105m | RDF050v | Mor16u | Mor22u | Mor32u | Mor16m | Mor31m | Mor30v | E2v | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IC2 | 1 | — | — | — | — | — | — | — | — | — | — | — |
| BEHm8 | 0.71289 | 1 | — | — | — | — | — | — | — | — | — | — |
| QXXe | 0.291867 | 0.449767 | 1 | — | — | — | — | — | — | — | — | — |
| RDF105m | 0.324553 | 0.366036 | −0.21287 | 1 | — | — | — | — | — | — | — | — |
| RDF050v | 0.046017 | 0.325575 | 0.887022 | −0.18233 | 1 | — | — | — | — | — | — | — |
| Mor16u | −0.52103 | −0.47151 | −0.14793 | −0.5522 | 0.151878 | 1 | — | — | — | — | — | — |
| Mor22u | −0.04642 | −0.10047 | 0.569542 | −0.05905 | 0.558575 | 0.115754 | 1 | — | — | — | — | — |
| Mor32u | 0.339555 | 0.123565 | 0.448986 | −0.50284 | 0.085389 | −0.25631 | 0.324334 | 1 | — | — | — | — |
| Mor16m | −0.26427 | −0.3979 | −0.51787 | −0.17431 | −0.35125 | 0.599117 | −0.27855 | −0.36023 | 1 | — | — | — |
| Mor31m | −0.24366 | −0.16379 | 0.339254 | −0.49366 | 0.411267 | 0.179403 | 0.229171 | 0.227343 | −0.35411 | 1 | — | — |
| Mor30v | 0.056618 | −0.09594 | −0.6632 | 0.325611 | −0.47839 | 0.35268 | −0.27186 | −0.54116 | 0.522641 | −0.52574 | 1 | — |
| E2v | 0.23577 | 0.498444 | 0.836901 | −0.32191 | 0.658324 | −0.18101 | 0.36587 | 0.600501 | −0.58302 | 0.238531 | −0.65492 | 1 |
Statistical parameters and the name of descriptors in SPA−MLR model.
| Descriptor | Type of descriptor | Notation | Coefficient |
|---|---|---|---|
| Moran autocorrelation of lag 8 weighted by van der Waals volume | 2D autocorrelations | MATS8v | 0.166586 |
| Geary autocorrelation of lag 5 weighted by Sanderson electronegativity | 2D autocorrelations | GATS5e | −0.23697 |
| Geary autocorrelation of lag 7 weighted by Sanderson electronegativity | 2D autocorrelations | GATS7e | −0.17958 |
| Harmonic Oscillator Model of Aromaticity index | Geometrical | HOMA | −0.172 |
| Radial Distribution Function − 090/unweighted | RDF | RDF090u | 0.331935 |
| Radial Distribution Function − 030/weighted by mass | RDF | RDF030m | −0.61428 |
| signal 13/unweighted | 3D−MoRSE | Mor13u | 0.42232 |
| signal 14/unweighted | 3D−MoRSE | Mor14u | 0.16831 |
| signal 32/unweighted | 3D−MoRSE | Mor32u | 0.497132 |
| signal 07/weighted by van der Waals volume | 3D−MoRSE | Mor07v | −0.29036 |
| signal 25/weighted by van der Waals volume | 3D−MoRSE | Mor25v | −0.36172 |
| leverage−weighted autocorrelation of lag 1/unweighted | GETAWAY | HATS1u | 0.131395 |
| H autocorrelation of lag 7/weighted by mass | GETAWAY | H7m | 0.085106 |
SEc = 0.30, R2 = 0.89.
The correlation matrix for the thirteen selected descriptors using SPA−MLR method.
| MATS8v | GATS5e | GATS7e | HOMA | RDF090u | RDF030m | Mor13u | Mor14u | Mor32u | Mor07v | Mor25v | HATS1u | H7m | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MATS8v | 1 | — | — | — | — | — | — | — | — | — | — | — | — |
| GATS5e | −0.30741 | 1 | — | — | — | — | — | — | — | — | — | — | — |
| GATS7e | −0.19173 | −0.362 | 1 | — | — | — | — | — | — | — | — | — | — |
| HOMA | 0.230387 | 0.434716 | 0.019335 | 1 | — | — | — | — | — | — | — | — | — |
| RDF090u | 0.114087 | −0.18651 | 0.010619 | −0.73187 | 1 | — | — | — | — | — | — | — | — |
| RDF030m | 0.65665 | −0.09176 | 0.366094 | 0.283897 | 0.21654 | 1 | — | — | — | — | — | — | — |
| Mor13u | 0.178512 | −0.02735 | 0.518447 | 0.26828 | −0.13945 | 0.606682 | 1 | — | — | — | — | — | — |
| Mor14u | −0.04171 | 0.252716 | −0.53748 | −0.15829 | −0.01353 | −0.19766 | −0.19999 | 1 | — | — | — | — | — |
| Mor32u | 0.003791 | −0.03079 | −0.44505 | −0.30275 | 0.273409 | −0.46963 | −0.43707 | 0.091052 | 1 | — | — | — | — |
| Mor07v | −0.03777 | 0.086135 | 0.019646 | −0.3145 | 0.405478 | 0.105646 | 0.354031 | 0.31194 | 0.123109 | 1 | — | — | — |
| Mor25v | −0.09809 | 0.477063 | 0.082629 | 0.173332 | 0.11963 | 0.349208 | 0.255182 | 0.201301 | −0.07042 | 0.305127 | 1 | — | — |
| HATS1u | −0.26587 | −0.38164 | −0.1431 | −0.66778 | 0.094499 | −0.51172 | −0.08684 | 0.406374 | 0.202037 | 0.314072 | −0.38987 | 1 | — |
| H7m | 0.419392 | −0.25319 | 0.343601 | 0.35923 | −0.03676 | 0.623623 | 0.286597 | −0.11998 | −0.13062 | −0.23068 | 0.300966 | −0.46849 | 1 |
FIGURE 5Experimental pIC50 versus calculated pIC50 plot in internal validation.
FIGURE 6Experimental pIC50 versus residual plot in internal validation.
Statistical results of SPA−MLR model compared to Stepwise−MRL model in external validation method.
| Q2 | SEP | REP % | |
|---|---|---|---|
| Stepwise−MRL | 0.60 | 0.73 | 2.2 |
| SPA−MLR | 0.90 | 0.36 | 1.1 |
FIGURE 7Experimental pIC50 versus residual plot in external validation.
FIGURE 8Relative mean effects for SPA-MLR model.