Literature DB >> 23407575

Forward Modeling of the Coumarin Antifungals; SPR/SAR Based Perspective.

Saeed Soltani1, Shima Dianat, Soroush Sardari.   

Abstract

Although, coumarins are a group of compounds which are naturally found in some plants, they can be synthetically produced as well. Because of their diverse derivatives, origin and properties most of them can be used for medicinal purposes. For example, they can be used against fungal diseases or in studying structure and biological properties of antifungal agents to discover new compounds with the similar activity. A Structure Property/Activity Relationship (SAR) can be utilized in prediction of biological activity of desired molecules.In order to represent a relationship between the physicochemical properties of coumarin compounds and their biological activities, 68 coumarins and coumarin derivatives with already reported antifungal activities were selected and eleven attributes were generated. The descriptors were used to perform artificial neural network (ANN) and to build a model for predicting effectiveness of the new ones. The correlation coefficient between the experimental and the predicted MIC values pertaining to all the coumarins was 0.984. This study paves the way for further researches about antifungal activity of coumarins, and offers a powerful tool in modeling and prediction of their bioactivities.

Entities:  

Keywords:  Antifungal activity; Coumarin; Modeling; Neural network

Year:  2009        PMID: 23407575      PMCID: PMC3558124     

Source DB:  PubMed          Journal:  Avicenna J Med Biotechnol        ISSN: 2008-2835


Introduction

During the last two decades, human fungal infections have increased among immune compromised individuals (1). Candida albicans (C. albicans) is the major agent of candidosis in humans (2) which is the commonest invasive fungal infection in patients with malignant haematological disease and in bone marrow transplant recipients (3). One common cause of mortality among hospitalized patients is nosocomial infection due to opportunistic fungal pathogens (4). The development of azole-based antifungal drugs has revolutionized the treatment of many fungal infections, but therapy may still necessitate application of the highly toxic drug amphotericin B or a combination of drugs. Due to rapid emergence of resistance in fungal pathogens to the conventional drugs, discovery of new potent antifungal compounds is necessary. Plant extracts containing coumarin derivatives demonstrate antifungal activity (5) and some synthetic coumarin derivatives are also active against the yeast C. albicans (6). Coumarin is a benzopyrone and a naturally occurring constituent of many plants and essential oils, including tonka beans, sweet clover, woodru, oil of cassia and lavender (7). The presence of phenolic, hydroxy and carboxylic acid groups on the coumarin nucleus has been considered necessary for antimicrobial activity (8). The coumarins are extremely variable in structure and due to the various types of substitutions in the basic structural form their biological activity is influenced (9). As a result, a lot of biological parameters should be evaluated to increase our understanding of the mechanisms by which these coumarins act and a careful structure-property/activity-relationship study of coumarins should be conducted. The so called "Cheminformatics" was introduced to the common use. It is often described as part of the analytical chemistry that by making use of mathematics, probability theory, mathematical statistics, as well as the decision-making theory and computer techniques, has been applied to a diverse range of problems in the field of chemistry (10). By combining together the elements of informatics and chemical analysis, cheminformatics appeared to be particularly useful in the professional work of pharmacists. It is concerned with the search for new chemical compounds as potential drugs, clinical analysis of these compounds, optimization of drug formulation, evaluation of its quality as well as leading to recognition of complicated processes in which the drug substances are involved in a human organism (11). Among the multivariate analyses used in the cheminformatics, the principal component analysis (PCA), cluster analysis (CA) and artificial neural networks (ANNs) have been the most widely used methods (12). Their valuable features are that they can present the correct interpretation of the measured data and obtain the maximum useful information from them (13). A feed-forward Multi-layer Perceptron (MLP) neural network is the most commonly used paradigm in medicinal chemistry. They usually consist of an input layer, one output layer and one or two hidden or middle layer(s). All units in one layer are connected to all the units in the next layers (14). The signals flow from the first input layer forward through hidden nodes, where a weighed sum of inputs is computed and passed through activation function and the result is finally presented to the output layer. This process is called “feed-forward” (15). A proper weight setting is not known beforehand and hence, initially, the weights are given a random value. The process of updating the weights to a correct set of values is called “Training or Learning”, which is mostly achieved by means of Backpropagation (BP) algorithm (16). The BP is a generalization of the least mean squared algorithm that modifies network weight to minimize the mean squared error between the desired and actual outputs of the network. The BP uses supervised learning in which the network is trained using data for which inputs as well as desired outputs are known (17). The application of ANN's in solving different problems in pharmacy is receiving growing attention when it comes to data analysis problems (18). It is mainly because they are applicable in every situation in which a relationship between predictor variables (inputs) and predicted variables (output) exists, even when that relationship is very complex and not easy to express in the usual terms of correlation or differences between groups. Therefore, anywhere that there are problems of prediction, classification or control, neural networks prove to be helpful (19). Accordingly, in this study, neural computing is used for building an efficient model in order to evaluate the relationship between physicochemical properties and bioactivity of antifungal coumarins.

Materials and Methods

Data set

The data set was composed of 68 coumarins and coumarin derivatives selected on the basis of antifungal activity. Antifungal activity of compounds from Table 1 that were screened by the well dilution method has been taken from the literature (20–27).
Table 1

Structure and bioactivity of studied coumarins

NumberCompoundMIC(µg/ml) observedMIC(µg/ml) predicted*Ref
1 62.529120
2 25029020
3 25029020
4 25029020
5 100026420
6 100028220
7 100030220
8 200034120
9 100028220
10 25027420
11 25013720
12 25012520
13 100028520
14 100029320
15 100026220
16 100015920
17 100027220
18 100023020
19 50016620
20 100022520
21 25027920
22 100026920
23 100028120
24 25030920
25 50028020
26 50028620
27 50030120
28 50031520
29 25029020
30 50028420
31 50027920
32 62.529020
33 6420521
34 7023721
35 8027921
36 2525221
37 93.7523221
38 51226722
39 6432222
40 78.7518123
41 22.623023
42 42.6528423
43 31.429023
44 16.6526423
45 518924
46 2521524
47 50027025
48 15.613725
49 15.613825
50 31.313125
51 15.613625
52 15.614325
53 7.814125
54 12512925
55 7.813625
56 25028226
57 25020526
58 25028726
59 3752355727
60 3321333227
61 4310377427
62 1979168227
63 3478304127
64 2705254727
65 2150234327
66 2035249027
67 3256248627
68 1870171427

The observed MICs and structures of coumarin compounds are derived from mentioned references in the table, but predicted MICs have been calculated by our ANN model.

Structure and bioactivity of studied coumarins The observed MICs and structures of coumarin compounds are derived from mentioned references in the table, but predicted MICs have been calculated by our ANN model. Authors encountered problems related to reporting of antifungal activity according to the two different forms of minimal inhibitory concentration (MIC) and 50% inhibitory concentration (IC50) which disabled the analysis of data set with adequate care. To make the dataset uniform, we multiplied the IC50 values by two to obtain a close equivalent of MIC level. Thus, the number generated is approximately equal to MIC for complete inhibition. Preliminary results have shown that coumarins possess considerable antifungal activity (5). Therefore, antifungal screening results of isolates of C. albicans were used for the modeling of activity against this microorganism.

Descriptors generation

Eleven attributes have been generated for the description of selected coumarin deriveatives that included eight quantum chemical descriptors; molar refractivity (cm 3), molar volume (cm 3), parachor (cm 3), index of refraction, surface tension (dyne/cm), density (g/cm), polarizability (10-24cm 3), molecular mass (Da) and three regular calculated descriptors (% carbon, % hydrogen, % oxygen). Calculation of quantum chemical descriptors was preceded by molecular geometry optimization based on the PM3 semiempirical approach. Both semiempirical and regular calculations were carried out by ACDLAB 11.02 release 21, May 2008 for in vacuo systems. Besides, quantum chemical descriptors, the regular calculated descriptors, % carbon, % hydrogen, and % oxygen) were included in the pool that make better understanding of structure–function activity of coumarin antifungal.

Learning tools

In this study the artificial neural network application of Easy-NNplus 8.0 release 2007, was utilized for SAR model development. Since this technique has been thoroughly described in the reference (28), a detailed description of the method has been omitted. However, a specific implementation of the method for this study is given below. A standard feed-forward network, with back propagation rule and with one, two or three hidden layer architecture was chosen. The physico-chemical descriptors were used as the inputs, while MIC was the output of the network architecture. In order to avert an over-fitting problem, which is usually produced by more weights due to higher numbers of neurons in input and hidden layers (29), the number of neurons was kept to minimum. However, to produce the optimum architectture, powerful enough to model the functions and keep the errors below 0.05%, number of nodes in the hidden layer(s) were varied.

Model validation

Model validation process provides a reasonable mean for understanding and approach to molecular design and action mechanism analysis. Applied primary validation methods involved the use of random number generators as a part of the learning process. In order to analyze the influence of inherent randomness on the prediction stability, ten repetitions of the complete validation process with different random seeds were made in all cases (Y-scrambling test). Accuracy has been selected for evaluation of predictive performance of a single validation process, while a correlation coefficient (CO) of accuracies obtained across ten repetitions was established as a measure of learning stability. Also cross-validation was applied by leave-n-out method.

Results

The results of this paper are based on investigation and analysis of collected or calculated data of several coumarin structural descriptors. The artificial neural network system was performed to build a powerful model for prediction of lead and template antifungal coumarins. Table 2 shows results of the various architectures of the neural network system. The numbers of hidden layer nodes were varied according to different node numbers and layers. One of the best architectures, considering the correlation behavior and output cycles of calculation was 11-8-4-1. The importance of an input descriptor is determined by the sum of the absolute values of the weights of all the outgoing architecture connections from the input node to the next layer. Some factors, such as surface tension, percent of oxygen, index of refraction, and percentage H have appeared among the most important factors. The least important descriptor was determined as the density. A range of predicted activity varied from 125.6796 to 3774.3753. The correlation coefficients between the experimental and the predicted MIC value pertaining to all the coumarins was 0.984 (Figure 1).
Table 2

Various architecture of neural network and their criteria used in this study

ArchitectureLayer numberNumber of training cyclesAverage error for training setAverage error for validation set
11-4-1 13630.0099870.008889
11-7-1 12580.0099410.009839
11-14-1 13200.0099980.009787
11-16-1 13270.0099810.008973
11-5-4-1 26150.0099870.009876
11-8-4-1 23330.0099240.00459
11-8-7-1 24350.0099320.009567
11-8-12-1 23500.009960.00657
11-4-5-4-1 312590.099990.07789
11-8-4-4-1 315540.099990.09054
11-12-5-4-1 311980.088120.08639
11-12-7-3-1 39470.068120.07687
Figure 1

Plot of predicted activity versus the observed one

Plot of predicted activity versus the observed one Various architecture of neural network and their criteria used in this study Compounds 67, 15, and 5 corresponded to the highest error that was generated during the training cycles. Y-Randomization result showed that the classification accuracy for randomized data sets was significantly lower than for the original data sets (data not shown) and hence we concluded that there is no evidence of over-fitting in our models. Cross validation is done by leave-some-out (some= 4) validating method. Validation showed that average of absolute errors was 0.379.

Discussion

The artificial neural networks (ANNs) have become an important modeling technique in numerous areas of chemistry and pharmacy (30). The mathematical adaptability of ANN commends them as a powerful tool for pattern classification and building predictive models. A particular advantage of ANNs is their inherent ability to incorporate nonlinear dependencies between the dependent and independent variables without using an explicit mathematical function. This study presents an approach to correlate the antifungal activity score data for a data set of drug-like molecules with the structural descriptors. In this study a nonlinear modeling technique of artificial neural network (ANN) with back propagation learning algorithm and sigmoid activation function was used. In this work, a MLP network (29) was developed and used to obtain a nonlinear SAR model. Topologically, it consisted of input, hidden, and output layers of neurons or units connected by weights. Each input layer node corresponded to a single independent variable (physicochemical descriptor) with the exception of the bias node. Similarly, each output layer node corresponded to a different dependent variable (property under investigation). In this study, all descriptors were derived solely from molecular structures which did not require experimental data or expensive theoretical calculations (to be obtained). The ANN model was trained only on the training set since the validation set was used to monitor the external prediction error and thus to avoid overtraining. Among the 11 architectures constructed, the best ANN architecture we found was 11–8–4–1. That is, in the first layer eleven inputs comprised of eleven input descriptors, hidden layer comprised of seven neurons, and the last output layer comprised of one neuron for the property modeled. The statistical criteria obtained for the ANN model are shown in Table 2. As it can be seen from this table the error for the training set is quite low. In addition, the errors for the validation set are also low showing the good prediction ability. The range of observed and predicted data criterion is very close to each other, that is, the overall prediction is close to experimental. Also, from these result we can conclude that the ANN model satisfactorily predicts the classification nature of the experimental data. Here, we should take into account that a large number of molecular descriptors are usually used in SAR methods. The specific biological action of drugs is frequently described by hydrophobic, electronic, steric and physicochemical properties. Physicochemical properties characterize the pharmacodynamic properties in the ligand– receptor interaction. They define the ability of the drug to join to the receptor. The results of this ANN-based study indicate that surface tension is one of the most important factors in coumarin bioactivity. Surface tension of the molecule causes it to creep around the membrane, leading to formation of a layer of loaded molecules at the cell membrane quickly (31). This finding could describe how the LogP is the main sensitivity descriptor of the trained network. Sensitivity analysis is a measure of how the outputs change when the inputs are changed. Result of this paper could help to predict bioactivity of new coumarins.
  21 in total

Review 1.  Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research.

Authors:  S Agatonovic-Kustrin; R Beresford
Journal:  J Pharm Biomed Anal       Date:  2000-06       Impact factor: 3.935

2.  Antimicrobial activity of two novel coumarin derivatives: 3-cyanonaphtho[1,2-(e)] pyran-2-one and 3-cyanocoumarin.

Authors:  A A Zaha; A Hazem
Journal:  New Microbiol       Date:  2002-04       Impact factor: 2.479

3.  Antifungal, antioxidant and larvicidal activities of compounds isolated from the heartwood of Mansonia gagei.

Authors:  P Tiew; J R Ioset; U Kokpol; W Chavasiri; K Hostettmann
Journal:  Phytother Res       Date:  2003-02       Impact factor: 5.878

4.  Random forest: a classification and regression tool for compound classification and QSAR modeling.

Authors:  Vladimir Svetnik; Andy Liaw; Christopher Tong; J Christopher Culberson; Robert P Sheridan; Bradley P Feuston
Journal:  J Chem Inf Comput Sci       Date:  2003 Nov-Dec

Review 5.  Application of artificial neural networks in the design of controlled release drug delivery systems.

Authors:  Yichun Sun; Yingxu Peng; Yixin Chen; Atul J Shukla
Journal:  Adv Drug Deliv Rev       Date:  2003-09-12       Impact factor: 15.470

6.  Thermoanalytical, chemical and principal component analysis of plant drugs.

Authors:  Marek Wesołowski; Pawel Konieczyński
Journal:  Int J Pharm       Date:  2003-08-27       Impact factor: 5.875

Review 7.  Cheminformatics in anti-infective agents discovery.

Authors:  S Sardari; M Dezfulian
Journal:  Mini Rev Med Chem       Date:  2007-02       Impact factor: 3.862

8.  Application quantum and physico chemical molecular descriptors utilizing principal components to study mode of anticoagulant activity of pyridyl chromen-2-one derivatives.

Authors:  M S Bhatia; K B Ingale; P B Choudhari; N M Bhatia; R L Sawant
Journal:  Bioorg Med Chem       Date:  2008-12-29       Impact factor: 3.641

9.  Antimicrobial activity of trifluoromethyl ketones and their synergism with promethazine.

Authors:  M Kawase; N Motohashi; H Sakagami; T Kanamoto; H Nakashima; L Ferenczy; K Wolfard; C Miskolci; J Molnár
Journal:  Int J Antimicrob Agents       Date:  2001-08       Impact factor: 5.283

Review 10.  Clinical application of artificial neural network (ANN) modeling to predict pharmacokinetic parameters of severely ill patients.

Authors:  Shigeo Yamamura
Journal:  Adv Drug Deliv Rev       Date:  2003-09-12       Impact factor: 15.470

View more
  3 in total

1.  Modeling of thermodynamic and physico-chemical properties of coumarins bioactivity against Candida albicans using a Levenberg-Marquardt neural network.

Authors:  Seyyedeh Soghra Mousavi; Hanieh Bokharaie; Shadi Rahimi; Sima Azadi Soror; Mehrdad Hamidi
Journal:  Adv Appl Bioinform Chem       Date:  2010-08-13

2.  Synthesis and biological evaluation of propargyl acetate derivatives as anti-mycobacterial agents.

Authors:  Parisa Azerang; Ali Hossein Rezayan; Soroush Sardari; Farzad Kobarfard; Mitra Bayat; Kimia Tabib
Journal:  Daru       Date:  2012-12-11       Impact factor: 3.117

3.  Synthesis of Novel Fluorene Bisamide Derivatives via Ugi Reaction and Evaluation their Biological Activity against Mycobacterium species.

Authors:  Ali Hossein Rezayan; Safoura Hariri; Parisa Azerang; Ghazaleh Ghavami; Isabel Portugal; Soroush Sardari
Journal:  Iran J Pharm Res       Date:  2017       Impact factor: 1.696

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.