Literature DB >> 15729848

Genetic algorithms and self-organizing maps: a powerful combination for modeling complex QSAR and QSPR problems.

Ersin Bayram1, Peter Santago, Rebecca Harris, Yun-De Xiao, Aaron J Clauset, Jeffrey D Schmitt.   

Abstract

Modeling non-linear descriptor-target activity/property relationships with many dependent descriptors has been a long-standing challenge in the design of biologically active molecules. In an effort to address this problem, we couple the supervised self-organizing map with the genetic algorithm. Although self-organizing maps are non-linear and topology-preserving techniques that hold great potential for modeling and decoding relationships, the large number of descriptors in typical quantitative structure-activity relationship or quantitative structure-property relationship analysis may lead to spurious correlation(s) and/or difficulty in the interpretation of resulting models. To reduce the number of descriptors to a manageable size, we chose the genetic algorithm for descriptor selection because of its flexibility and efficiency in solving complex problems. Feasibility studies were conducted using six different datasets, of moderate-to-large size and moderate-to-great diversity; each with a different biological endpoint. Since favorable training set statistics do not necessarily indicate a highly predictive model, the quality of all models was confirmed by withholding a portion of each dataset for external validation. We also address the variability introduced onto modeling through dataset partitioning and through the stochastic nature of the combined genetic algorithm supervised self-organizing map method using the z-score and other tests. Experiments show that the combined method provides comparable accuracy to the supervised self-organizing map alone, but using significantly fewer descriptors in the models generated. We observed consistently better results than partial least squares models. We conclude that the combination of genetic algorithms with the supervised self-organizing map shows great potential as a quantitative structure-activity/property relationship modeling tool.

Entities:  

Mesh:

Year:  2004        PMID: 15729848     DOI: 10.1007/s10822-004-5321-2

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  9 in total

1.  Nonlinear mapping networks.

Authors:  D K Agrafiotis; V S Lobanov
Journal:  J Chem Inf Comput Sci       Date:  2000 Nov-Dec

2.  Application of BCUT metrics and genetic algorithm in binary QSAR analysis.

Authors:  H Gao
Journal:  J Chem Inf Comput Sci       Date:  2001 Mar-Apr

Review 3.  Exploring the nature of molecular recognition in nicotinic acetylcholine receptors.

Authors:  J D Schmitt
Journal:  Curr Med Chem       Date:  2000-08       Impact factor: 4.530

4.  Database mining applied to central nervous system (CNS) activity.

Authors:  M Pintore; O Taboureau; F Ros; J R Chrétien
Journal:  Eur J Med Chem       Date:  2001-04       Impact factor: 6.514

5.  Self-organizing neural network for modeling 3D QSAR of colchicinoids.

Authors:  J Polański
Journal:  Acta Biochim Pol       Date:  2000       Impact factor: 2.149

6.  Formulation of de novo substituent constants in correlation analysis: inhibition of dihydrofolate reductase by 2,4-diamino-5-(3,4-dichlorophenyl)-6-substituted pyrimidines.

Authors:  C Hansch; C Silipo; E E Steller
Journal:  J Pharm Sci       Date:  1975-07       Impact factor: 3.534

7.  Rational selection of training and test sets for the development of validated QSAR models.

Authors:  Alexander Golbraikh; Min Shen; Zhiyan Xiao; Yun-De Xiao; Kuo-Hsiung Lee; Alexander Tropsha
Journal:  J Comput Aided Mol Des       Date:  2003 Feb-Apr       Impact factor: 3.686

8.  Applications of neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitors.

Authors:  T A Andrea; H Kalayeh
Journal:  J Med Chem       Date:  1991-09       Impact factor: 7.446

9.  Evolutionary optimization in quantitative structure-activity relationship: an application of genetic neural networks.

Authors:  S S So; M Karplus
Journal:  J Med Chem       Date:  1996-03-29       Impact factor: 7.446

  9 in total
  2 in total

1.  Molecular modeling of mono- and bis-quaternary ammonium salts as ligands at the alpha4beta2 nicotinic acetylcholine receptor subtype using nonlinear techniques.

Authors:  Joshua T Ayers; Aaron Clauset; Jeffrey D Schmitt; Linda P Dwoskin; Peter A Crooks
Journal:  AAPS J       Date:  2005-10-25       Impact factor: 4.009

2.  Random forests for feature selection in QSPR Models - an application for predicting standard enthalpy of formation of hydrocarbons.

Authors:  Ana L Teixeira; João P Leal; Andre O Falcao
Journal:  J Cheminform       Date:  2013-02-11       Impact factor: 5.514

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.