Literature DB >> 27964704

Common SAR Derived from Linear and Non-linear QSAR Studies on AChE Inhibitors used in the Treatment of Alzheimer's Disease.

Babitha Pallikkara Pulikkal1,2, Sahila Mohammed Marunnan3, Srinivas Bandaru4, Mukesh Yadav4, Anuraj Nayarisseri4,5, Sivanpillai Sureshkumar1,6.   

Abstract

BACKGROUND: Deficits in cholinergic neurotransmission due to the degeneration of cholinergic neurons in the brain are believed to be one of the major causes of the memory impairments associated with AD. Targeting acetyl cholinesterase (AChE) surfaced as a potential therapeutic target in the treatment of Alzheimer's disease. The present study is pursued to develop quantitative structure activity relationship (QSAR) models to determine chemical descriptors responsible for AChE activity.
METHODS: Two different sets of AChE inhibitors, dataset-I (30 compounds) and dataset-II (20 compounds) were investigated through MLR aided linear and SVM aided non-linear QSAR models.
RESULTS: The obtained QSAR models were found statistically fit, stable and predictive on validation scales. These QSAR models were further investigated for their common structure-activity relationship in terms of overlapping molecular descriptors selection. Atomic mass weighted 3D Morse descriptors (MATS5m) and Radial Distribution Function (RDF045m) descriptors were found in common SAR for both the datasets. Electronegativity weighted (MATS5e, HATSe, and Mor17e) descriptors have also been identified in regulative roles towards endpoint values of dataset-I and dataset-II.
CONCLUSION: The common SAR identified in these linear and non-linear QSAR models could be utilized to design novel inhibitors of AChE with improved biological activity. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

Entities:  

Keywords:  AChE inhibitors; Alzheimer's disease; SAR.; descriptors sensitivity; linear and non-linear QSAR models

Mesh:

Substances:

Year:  2017        PMID: 27964704      PMCID: PMC5725541          DOI: 10.2174/1570159X14666161213142841

Source DB:  PubMed          Journal:  Curr Neuropharmacol        ISSN: 1570-159X            Impact factor:   7.363


INTRODUCTION

Alzheimer’s disease (AD) is one of the most dreaded forms of progressive neurodegenerative disorder of the modern era. It is associated with cognitive, functional and behavioral impairments which affect the brain regions 
that control thought, memory and language leading to a devastating status affecting predominantly elderly people. Alzheimer’s disease (AD) and other forms of dementia are growing public health problems in developing countries, whose aging population is increasing rapidly [1]. It is estimated that by the year 2020, approximately 70% of the world’s population aged 60 and above will suffer with Alzheimer’s in developing countries [2-5]. However, the precise cause of AD at the molecular grounds remains enigmatic, deficits in cholinergic neuro-
transmission due to the degeneration of cholinergic neurons in the brain are believed to be one of the major causes of the memory impairments associated with AD [6-8]. During the course of the normal aging process, concentrations of ACh tend to decrease, resulting in the sporadic lapses of short-term memory that elderly individuals tend to experience from time to time [9]. Since cholinergic transmission is prominently involved in human memory system, AChE is a potential pharmacological target for treatment of AD. Although there is no complete cure for Alzheimer's, there are two drugs presently marketed which inhibit acetyl cholinesterase - the enzyme which inactivates AChE at the synapse. Tetrohydroaminoacridine (THA), marketed in 1993, under the name Tacrine or Cognex, was the first Alzheimer's drug to be approved by the USFDA and has been shown to improve memory and language deficits in early stages of the disease [10]. A second drug, Donepezil was approved by the FDA in 1996, marketed under the name Aricept that targets only ACh in the brain and allows for the same improvements in cognitive function as THA [11]. Although these drugs helps to ease some of the symptoms of the disease, nevertheless have narrow therapeutic window and suffer side effects like lack of the drug's substrate specificity [12-14]. In the view of the given problem, the scientific team around the globe put forth active compounds targeting AchE anticipating for better management of Alzheimer’s disease. The present study focuses on building linear MLR and non-linear Gaussian kernel aided SVM QSAR models from two datasets which structurally belong to different scaffold and are derived from two different schemes of synthesis. The synthesis of derivatives was aimed to explore new candidates which can inhibit AChE more efficiently towards AD treatment. Few molecules among newly synthesized derivatives appeared promising when evaluated in vitro and effectively inhibited AChE. We aim to derive common structure-activity relationship from two different datasets through QSAR models which could assist to understand the overlapping structural features towards minimum chemical structure requirement to inhibit AChE in AD treatment.

Materials and Methods

Selection of Compound Dataset

The first dataset includes 1,2,3,4-tetrahydroisoquinoline derivative (30 molecules) synthesized by N. Toda and colleagues [15]. The second dataset involved Galantamine, Tacrine and 18 coumarintacrine hybrids synthesized and evaluated by Qi Sun et al. [16]. These two sets of compounds were subjected to MLR (Linear) and SVM (Non-linear) QSAR studies to derive individual QSAR models for each set and finally extract common chemical structure features responsible for structure-activity relationship (SAR) with reference to their action on AchE as a therapeutic target.

Calculation of Molecular Descriptor & Preparation of QSAR Model

The structures of compounds from each series were drawn and optimized in Chem Axon - Marvin Sketch version 5.6.0.2 [17]. The descriptors were calculated using an online web server E-Dragon (version 5.4) [18-20]. More than 2000 descriptors belonging to various classes were further analyzed by data analysis package of Graphpad Prism 6 for MLR analysis and for non-linear SVM analysis, GIST server was employed [21]. All the generated descriptors were further screened to establish valid models. In the process, constant and missing set of descriptors which were considered insignificant in statistical analysis were pruned (pruning parameters; standard deviation ≤0, and missing values greater than equal to 1) [22]. Pearson’s correlation coefficient was used to establish correlation between molecular descriptors and biological responses (endpoints). Odds of redundancy in regression models were thoroughly inspected and removed using correlation matrix [23]. Forward selection wrappers were used to generate MLR [24] and non-linear QSAR [22] models. The SAR models thus obtained were validated using internal validation statistics like cross validated regression analysis, mean and maximum absolute error calculations.

Results and Discussion

Statistical Fitness and Stability of QSAR Models

The present QSAR studies are an attempt to extract common SAR from two different sets of synthesized compounds tested against human AChE targeted in Alzheimer’s disease. Two statistical methods, Multiple Linear Regression (MLR) and Support Vector Machine (SVM), have been used to achieve QSAR models. Gaussian Kernel function was used to obtain non-linear QSAR models in SVM. Table provided below represents selected descriptors in multivariable models respectively for dataset-I and dataset-II. QSAR models obtained from forward selection method show uni-variable to tetra-variable models separately for dataset-I and dataset-II. The variable selection has been achieved under thumb-rule criteria which states that maximum descriptor variables cannot exceed 1/5 of total samples (compounds) in dataset. Dataset-I includes 30 molecules allows to exceed till hexa-variable model, though we decided to limit our models to tetra-variable model as it has gained necessary statistical fitness thereby. Wherein dataset-II includes 20 molecules and allows selecting tetra-variable models. For a comparative study of QSAR models in either of datasets, we limited variables to tetra-variable models. Statistical fitness of QSAR models obtained can be clearly understood from numerical magnitudes of R2, R2A, S.E. and R2CV (N-Fold) produced in Table . Dataset-I with tetra-variable models, (Linear (MLR) R2=0.8961 and Non-linear (SVM) R2=0.9307)) approves acceptable statistical fitness and stability (R2CV=0.8462, 0.734) of QSAR models. Similarly dataset-II with tetra-variable (Linear, R2=0.9057 and Non-linear, R2=0.9307) is found fit and stable (R2 CV =0.8401, 0.9061) in terms of statistical fitness. Maximum and mean absolute errors are diagnosed under the limits. Descriptors selected in forward selection method fall in similar group of indices which also points out similar structural properties underlying the structure-activity relationship.

Descriptor Sensitivity of Linear QSAR Models

The aim of including two datasets of different chemical origin but targeting the same therapeutic target (AChE) is to study and extract common structure-activity information underlying in QSAR models. In order to achieve comparative SAR, we decided to compare the descriptors sets derived from linear (MLR aided) QSAR models. The contribution of a descriptor (X) in regression equation can be understood when it is multiplied by its coefficient (M). Keeping intercept as constant, we obtained products of descriptors magnitudes multiplied with their coefficients e.g. M1X1, M2X2, M3X3 and M4X4. We have plotted a comparative graph taking Y (Activity) and M1X1, M2X2, M3X3 and M4X4. Graph (Fig. ) is comparative contribution of descriptors in regression equation. The magnitude scales on the graph shows activity magnitude scale with positive and negative contribution of molecular descriptors selected thereby in regression equation. For dataset-I, Fig. illustrates PW3 molecular descriptor as major contributor in regulation of pIC50 values and so as in linear QSAR models. Activity scales (pIC50) lies intermediate to PW3 magnitudes and magnitudes of other three descriptors (MATS8e, Mor17e, RDF045m). It can also be understood from the graph below that the increase in magnitudes of PW3 descriptor via change in chemical structures of compounds in dataset-I would bring favorable changes in pIC50 values. This fact can be utilized to design new molecules in the same series. Fig. illustrates comparative contribution of molecular descriptors in regulating activity (pIC50) for dataset-II. As a peculiar observation, the three molecular descriptors (HATS1, Mor04m and G1v) have contributed with their positive magnitudes and therefore increase in their magnitudes would bring an increase in pIC50 values. GAT54e contributed with negative magnitudes and therefore must possess an inverse impact of activity regulation. Descriptors sensitivity could be an important tool in at least linear QSAR models to evaluate and validate the corresponding contribution of molecular descriptors.

Predictability of QSAR Models

Linear and non-linear QSAR models achieved for dataset-I and dataset-II were found statistically fit and stable. To evaluate the predictive powers of linear (MLR) and non-linear (SVM), pIC50 values have been predicted and correlated with their corresponding experimental activities. Regression equations of tetra-variable models for dataset-I and dataset-II have been provided below. 89% confidence has been observed in dataset-I with small standard error values (0.15) wherein 90% confidence was observed in dataset-II with even smaller standard error (0.06). Moving onto F-stat values of dataset-I (F=53.89) and dataset-II (F=36.07) confirm the significance of statistical models from their application point of view. Graphical correlation of predictive powers of QSAR models has been provided below. A straight indication received from graph confirms that SVM aided non-linear QSAR models are statistically fit and more predictive in case of dataset-I and dataset-II. Fig. ( represents graphical correlation of experimental pIC50 and their predicted values from linear QSAR models for dataset-I derived from tetra-variable models. Fig. ( represents graphical respective values of pIC50 from SVM aided non-linear QSAR models. The similar aftermath in predictive powers can be observed for dataset-II from tetra-variable models. Fig. ( and ( presents graphical correlation of experimental and predicted pIC50 values of dataset-II.
Fig. (3)

(A) correlation of experimental and predicted pIC50 calculated from linear (MLR) aided tetra-variable model for dataset -I (B) correlation of experimental and predicted pIC50 calculated from non-linear (SVM) aided tetra-variable model for dataset-I.

Fig. (4)

(A) correlation of experimental and predicted pIC50 calculated from linear (MLR) aided tetra-variable model for dataset -II (B) correlation of experimental and predicted pIC50 calculated from non-linear (SVM) aided tetra-variable model for dataset-II.

Conclusion

Dataset-I Tetra-variable QSAR model pIC50 = - 9.795 + 54.420[PW3] + 2.066[MATS8e] + 0.625[Mor17e] - 0.121[RDF045m] N = 30 R2 = 0.89 S.E. = 0.15 F=53.89 Dataset-II Tetra-variable QSAR model pIC50 = - 4.4792 - 0.9200[GATS4e] - 0.2596 [Mor04m] + 28.2186 [HATS1v] + 9.1796[G1v] N = 20 R2 = 0.9057 S.E. = 0.0649 F=36.037 Comparative study of MLR aided linear and SVM aided non-linear QSAR models reveals the overlapping SAR required to inhibit AChE targeted in the treatment of Alzheimer’s disease. Structure-activity relationship derived from the two different datasets is found to be predominantly regulated by 3D Morse descriptors, which can be clearly observed in all the linear and non-linear QSAR models achieved in present study. Most of the selected 3D Morse descriptors are weighted by their atomic masses which confirms the fact that the distribution of atomic masses in three dimensional spaces is more crucial and not just as an additive structural property. Radial Distribution Function (RDFs) weighted by electronegativity of molecules has also been identified to regulate biological activity of AChE inhibitors. QSAR models have been found to be statistically fit and predictive. The descriptor sensitivity of linear QSAR models could be used to identify the dominant and recessive roles of molecular descriptors selected in multi-variable QSAR models. These overlapping structure-activity relationship can also be utilized to design and synthesize new set of inhibitors for AChE.

ETHICS APPROVAL AND CONSENT TO PARTICIPATE

Not applicable.

HUMAN AND ANIMAL RIGHTS

No Animals/Humans were used for studies that are base of this research.

CONSENT FOR PUBLICATION

Not applicable.
Table 1

Statistical fitness of QSAR models obtained for two datasets.

Dataset QSAR Model Descriptors Variables R2 Max. Abs. Error Mean Abs. Error. R2CV (N-FOLD)
Dataset-ILinear (MLR)MATS5m10.36680.67190.28250.3208
MATS5m, RDF045m20.65880.61990.21370.5802
MATS5m, RDF045m, PW330.80910.48440.1470.7476
MATS5m, RDF045m, PW3, Mor17e40.89610.27740.12210.8462
Non-linear(SVM)MATS5m10.53511.00650.19660.4388
MATS5m, MATS5e20.81230.52960.11640.651
MATS5m, MATS5e, HATSe30.85750.46380.10060.728
MATS5m, MATS5e, HATSe, SdCH2.2.Count40.93040.46400.07960.7354
Dataset-IILinear (MLR)HATS1v10.4060.25790.12520.261
HATS1v, Mor04m20.79940.16460.07010.6933
HATS1v, Mor04m, GATS4e30.86240.15340.05510.7998
HATS1v, Mor04m, GATS4e, G1v40.90570.12000.04380.8401
Non-linear(SVM)p2p2-1C10.43210.37030.09360.5197
p2p2-1C, Mor04m20.94050.09990.02920.882
p2p2-1C, Mor04m, BELv630.94660.09380.02830.9018
p2p2-1C, Mor04m, BELv6, RDF130m40.93700.11880.02450.9061
Table 2

Molecules of dataset-I and dataset-II with their respective experimental pIC50 values, predicted pIC50 values using tetra-variable models achieved in linear (MLR) and non-linear (SVM) QSAR models.

Dataset 1 (30 Compounds) Dataset 2 (20 Compounds)
I II III IV V VI VII VIII
18.0977.9047.9051a-1.537-1.501-1.635
27.7707.5487.3351b-1.646-1.584-1.645
36.9967.1397.0051c-1.596-1.583-1.596
46.6606.5136.5951d-1.554-1.630-1.689
57.2527.2717.0341e-1.845-1.927-1.850
67.9597.8027.9521f-1.881-1.833-1.880
77.4817.3387.4941g-1.223-1.255-1.225
87.7967.7777.8041h-1.490-1.496-1.488
97.9597.7997.9721i-1.386-1.506-1.422
107.3107.4137.3021j-1.479-1.466-1.433
117.4697.6927.4641k-1.749-1.648-1.671
127.6997.8007.6941l-1.775-1.785-1.777
136.7937.0707.0181m-1.625-1.577-1.627
146.5776.3976.6421n-1.742-1.760-1.775
157.0367.0087.1841o-1.705-1.759-1.717
166.8156.8716.8281p-1.820-1.716-1.757
177.1807.3227.1771q-1.960-1.954-1.958
186.9877.1006.9131r-1.893-1.912-1.895
196.8576.8436.876Galath-mine-1.553-1.574-1.553
206.8706.9796.883Tacrine-1.792-1.783-1.794
216.5456.7396.553----
226.8366.6916.849----
237.2607.2377.205----
246.6686.8126.818----
257.2157.2737.206----
266.9366.8256.927----
277.5697.5447.562----
287.2227.3707.217----
297.3677.2567.360----
306.8246.6706.821----

The legends used in table headings have been produced at the bottom of the 2.

I: Molecules of Dataset I (30 Molecules)

II: Experimental Activity (pIC50 values)

III: Predicted Activity (pIC50 values) using Tetra-Variable model derived from MLR aided Linear QSAR models.

IV Predicted Activity (pIC50 values) using Tetra-Variable model derived from SVM (Gaussian function) aided Non-Linear QSAR models.

V: Molecules of Dataset II (20 Molecules)

VI: Experimental Activity (pIC50 values)

VII: Predicted Activity (pIC50 values) using Tetra-Variable model derived from MLR aided Linear QSAR models.

VIII: Predicted Activity (pIC50 values) using Tetra-Variable model derived from SVM (Gaussian function) aided Non-Linear QSAR models.

  19 in total

Review 1.  Acetylcholine, aging, and Alzheimer's disease.

Authors:  J L Muir
Journal:  Pharmacol Biochem Behav       Date:  1997-04       Impact factor: 3.533

2.  Alzheimer's failure raises questions about disease-modifying strategies.

Authors:  Andy Extance
Journal:  Nat Rev Drug Discov       Date:  2010-10       Impact factor: 84.694

3.  Virtual computational chemistry laboratory--design and description.

Authors:  Igor V Tetko; Johann Gasteiger; Roberto Todeschini; Andrea Mauri; David Livingstone; Peter Ertl; Vladimir A Palyulin; Eugene V Radchenko; Nikolay S Zefirov; Alexander S Makarenko; Vsevolod Yu Tanchuk; Volodymyr V Prokopenko
Journal:  J Comput Aided Mol Des       Date:  2005-06       Impact factor: 3.686

4.  Tacrine in Alzheimer's disease.

Authors:  S A Eagger; R Levy; B J Sahakian
Journal:  Lancet       Date:  1991-04-27       Impact factor: 79.321

Review 5.  Education and the prevalence of dementia and Alzheimer's disease.

Authors:  R Katzman
Journal:  Neurology       Date:  1993-01       Impact factor: 9.910

6.  Incidence of Alzheimer's disease in a rural community in India: the Indo-US study.

Authors:  V Chandra; R Pandav; H H Dodge; J M Johnston; S H Belle; S T DeKosky; M Ganguli
Journal:  Neurology       Date:  2001-09-25       Impact factor: 9.910

7.  A 24-week, double-blind, placebo-controlled trial of donepezil in patients with Alzheimer's disease. Donepezil Study Group.

Authors:  S L Rogers; M R Farlow; R S Doody; R Mohs; L T Friedhoff
Journal:  Neurology       Date:  1998-01       Impact factor: 9.910

Review 8.  Current mathematical methods used in QSAR/QSPR studies.

Authors:  Peixun Liu; Wei Long
Journal:  Int J Mol Sci       Date:  2009-04-29       Impact factor: 6.208

9.  Memantine treatment in patients with moderate to severe Alzheimer disease already receiving donepezil: a randomized controlled trial.

Authors:  Pierre N Tariot; Martin R Farlow; George T Grossberg; Stephen M Graham; Scott McDonald; Ivan Gergel
Journal:  JAMA       Date:  2004-01-21       Impact factor: 56.272

10.  Hepatotoxic effects of tacrine administration in patients with Alzheimer's disease.

Authors:  P B Watkins; H J Zimmerman; M J Knapp; S I Gracon; K W Lewis
Journal:  JAMA       Date:  1994-04-06       Impact factor: 56.272

View more
  2 in total

1.  Identification of Potent VEGF Inhibitors for the Clinical Treatment of Glioblastoma, A Virtual Screening Approach.

Authors:  Mohini Yadav; Ravina Khandelwal; Urvy Mudgal; Sivaraj Srinitha; Natasha Khandekar; Anuraj Nayarisseri; Sugunakar Vuree; Sanjeev Kumar Singh
Journal:  Asian Pac J Cancer Prev       Date:  2019-09-01

2.  Editorial: Improving Neuropharmacology using Big Data, Machine Learning and Computational Algorithms.

Authors:  Khader Shameer; Anuraj Nayarisseri; Francisco Xavier Romero Duran; Humberto Gonzalez-Diaz
Journal:  Curr Neuropharmacol       Date:  2017-11-14       Impact factor: 7.363

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.