Literature DB >> 11045809

Unsupervised forward selection: a method for eliminating redundant variables.

D C Whitley1, M G Ford, D J Livingstone.   

Abstract

An unsupervised learning method is proposed for variable selection and its performance assessed using three typical QSAR data sets. The aims of this procedure are to generate a subset of descriptors from any given data set in which the resultant variables are relevant, redundancy is eliminated, and multicollinearity is reduced. Continuum regression, an algorithm encompassing ordinary least squares regression, regression on principal components, and partial least squares regression, was used to construct models from the selected variables. The variable selection routine is shown to produce simple, robust, and easily interpreted models for the chosen data sets.

Mesh:

Substances:

Year:  2000        PMID: 11045809     DOI: 10.1021/ci000384c

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  26 in total

1.  Virtual generation of agents against Mycobacterium tuberculosis. A QSAR study.

Authors:  Emili Besalú; Robert Ponec; Jesus Vicente de Julián-Ortiz
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

2.  Variable selection and specification of robust QSAR models from multicollinear data: arylpiperazinyl derivatives with affinity and selectivity for alpha2-adrenoceptors.

Authors:  D W Salt; L Maccari; M Botta; M G Ford
Journal:  J Comput Aided Mol Des       Date:  2004 Jul-Sep       Impact factor: 3.686

3.  An automated PLS search for biologically relevant QSAR descriptors.

Authors:  Marius Olah; Cristian Bologa; Tudor I Oprea
Journal:  J Comput Aided Mol Des       Date:  2004 Jul-Sep       Impact factor: 3.686

4.  Comparison of commercially available genetic algorithms: gas as variable selection tool.

Authors:  Sabine Schefzick; Mary Bradley
Journal:  J Comput Aided Mol Des       Date:  2004 Jul-Sep       Impact factor: 3.686

5.  Virtual computational chemistry laboratory--design and description.

Authors:  Igor V Tetko; Johann Gasteiger; Roberto Todeschini; Andrea Mauri; David Livingstone; Peter Ertl; Vladimir A Palyulin; Eugene V Radchenko; Nikolay S Zefirov; Alexander S Makarenko; Vsevolod Yu Tanchuk; Volodymyr V Prokopenko
Journal:  J Comput Aided Mol Des       Date:  2005-06       Impact factor: 3.686

Review 6.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

7.  Automatic QSAR modeling of ADME properties: blood-brain barrier penetration and aqueous solubility.

Authors:  Olga Obrezanova; Joelle M R Gola; Edmund J Champness; Matthew D Segall
Journal:  J Comput Aided Mol Des       Date:  2008-02-14       Impact factor: 3.686

8.  Shape signatures: new descriptors for predicting cardiotoxicity in silico.

Authors:  Dmitriy S Chekmarev; Vladyslav Kholodovych; Konstantin V Balakin; Yan Ivanenkov; Sean Ekins; William J Welsh
Journal:  Chem Res Toxicol       Date:  2008-05-08       Impact factor: 3.739

9.  New predictive models for blood-brain barrier permeability of drug-like molecules.

Authors:  Sandhya Kortagere; Dmitriy Chekmarev; William J Welsh; Sean Ekins
Journal:  Pharm Res       Date:  2008-04-16       Impact factor: 4.200

10.  Predicting inhibitors of acetylcholinesterase by regression and classification machine learning approaches with combinations of molecular descriptors.

Authors:  Dmitriy Chekmarev; Vladyslav Kholodovych; Sandhya Kortagere; William J Welsh; Sean Ekins
Journal:  Pharm Res       Date:  2009-07-15       Impact factor: 4.200

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.