Literature DB >> 12132894

Genetic Algorithm guided Selection: variable selection and subset selection.

Sung Jin Cho1, Mark A Hermsmeier.   

Abstract

A novel Genetic Algorithm guided Selection method, GAS, has been described. The method utilizes a simple encoding scheme which can represent both compounds and variables used to construct a QSAR/QSPR model. A genetic algorithm is then utilized to simultaneously optimize the encoded variables that include both descriptors and compound subsets. The GAS method generates multiple models each applying to a subset of the compounds. Typically the subsets represent clusters with different chemotypes. Also a procedure based on molecular similarity is presented to determine which model should be applied to a given test set compound. The variable selection method implemented in GAS has been tested and compared using the Selwood data set (n = 31 compounds; v = 53 descriptors). The results showed that the method is comparable to other published methods. The subset selection method implemented in GAS has been first tested using an artificial data set (n = 100 points; v = 1 descriptor) to examine its ability to subset data points and second applied to analyze the XLOGP data set (n = 1831 compounds; v = 126 descriptors). The method is able to correctly identify artificial data points belonging to various subsets. The analysis of the XLOGP data set shows that the subset selection method can be useful in improving a QSAR/QSPR model when the variable selection method fails.

Mesh:

Year:  2002        PMID: 12132894     DOI: 10.1021/ci010247v

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  20 in total

Review 1.  Genetic algorithm optimization in drug design QSAR: Bayesian-regularized genetic neural networks (BRGNN) and genetic algorithm-optimized support vectors machines (GA-SVM).

Authors:  Michael Fernandez; Julio Caballero; Leyden Fernandez; Akinori Sarai
Journal:  Mol Divers       Date:  2010-03-20       Impact factor: 2.943

2.  Comparison of commercially available genetic algorithms: gas as variable selection tool.

Authors:  Sabine Schefzick; Mary Bradley
Journal:  J Comput Aided Mol Des       Date:  2004 Jul-Sep       Impact factor: 3.686

3.  Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays.

Authors:  Rajarshi Guha; Stephan C Schürer
Journal:  J Comput Aided Mol Des       Date:  2008-02-19       Impact factor: 3.686

4.  Biomacromolecular quantitative structure-activity relationship (BioQSAR): a proof-of-concept study on the modeling, prediction and interpretation of protein-protein binding affinity.

Authors:  Peng Zhou; Congcong Wang; Feifei Tian; Yanrong Ren; Chao Yang; Jian Huang
Journal:  J Comput Aided Mol Des       Date:  2013-01-10       Impact factor: 3.686

5.  Determination of biochemical oxygen demand and dissolved oxygen for semi-arid river environment: application of soft computing models.

Authors:  Hai Tao; Aiman M Bobaker; Majeed Mattar Ramal; Zaher Mundher Yaseen; Md Shabbir Hossain; Shamsuddin Shahid
Journal:  Environ Sci Pollut Res Int       Date:  2018-11-12       Impact factor: 4.223

6.  Application of a genetic algorithm and an artificial neural network for global prediction of the toxicity of phenols to Tetrahymena pyriformis.

Authors:  Aziz Habibi-Yangjeh; Mohammad Danandeh-Jenagharad
Journal:  Monatsh Chem       Date:  2009-10-13       Impact factor: 1.451

7.  Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling.

Authors:  Xia Li; Shaoqi Rao; Yadong Wang; Binsheng Gong
Journal:  Nucleic Acids Res       Date:  2004-05-17       Impact factor: 16.971

8.  Prediction of survival in surgical unresectable lung cancer by artificial neural networks including genetic polymorphisms and clinical parameters.

Authors:  Te-Chun Hsia; Hung-Chih Chiang; David Chiang; Liang-Wen Hang; Fuu-Jen Tsai; Wen-Chi Chen
Journal:  J Clin Lab Anal       Date:  2003       Impact factor: 2.352

9.  A quantitative structure-property relationship for predicting drug solubility in PEG 400/water cosolvent systems.

Authors:  Erik Rytting; Kimberley A Lentz; Xue-Qing Chen; Feng Qian; Srini Venkatesh
Journal:  Pharm Res       Date:  2004-02       Impact factor: 4.200

10.  A robust hybrid approach based on estimation of distribution algorithm and support vector machine for hunting candidate disease genes.

Authors:  Li Li; Hongmei Chen; Chang Liu; Fang Wang; Fangfang Zhang; Lihua Bai; Yihan Chen; Luying Peng
Journal:  ScientificWorldJournal       Date:  2013-02-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.