| Literature DB >> 26600858 |
Mehdi Ahmadi1, Mohsen Shahlaei2.
Abstract
P2X7 antagonist activity for a set of 49 molecules of the P2X7 receptor antagonists, derivatives of purine, was modeled with the aid of chemometric and artificial intelligence techniques. The activity of these compounds was estimated by means of combination of principal component analysis (PCA), as a well-known data reduction method, genetic algorithm (GA), as a variable selection technique, and artificial neural network (ANN), as a non-linear modeling method. First, a linear regression, combined with PCA, (principal component regression) was operated to model the structure-activity relationships, and afterwards a combination of PCA and ANN algorithm was employed to accurately predict the biological activity of the P2X7 antagonist. PCA preserves as much of the information as possible contained in the original data set. Seven most important PC's to the studied activity were selected as the inputs of ANN box by an efficient variable selection method, GA. The best computational neural network model was a fully-connected, feed-forward model with 7-7-1 architecture. The developed ANN model was fully evaluated by different validation techniques, including internal and external validation, and chemical applicability domain. All validations showed that the constructed quantitative structure-activity relationship model suggested is robust and satisfactory.Entities:
Keywords: Artificial neural network (ANN); Genetic algorithm (GA); P2X7 receptor antagonists; Principal component analysis (PCA); QSAR
Year: 2015 PMID: 26600858 PMCID: PMC4623620
Source DB: PubMed Journal: Res Pharm Sci ISSN: 1735-5362
Main structure and details of the compounds used in this study.
Fig. 1The basic design of the algorithm combined genetic algorithms and artificial neural network used in this study.
The result of principal component analysis on the total descriptors.
The experimental pIC50 and the predicted values of the training set and test set.
Fig. 2pIC50 estimated by modeling versus experimental values for training and test sets A; PCR, B; GA-PC-ANN.
Statistical parameters obtained for the QSAR models.
Fig. 3Plot of MSE for training sets versus the number of nodes in hidden layer.
Fig. 4Applicability domains of developed A; PCR and B; GA-PC-ANN.