Literature DB >> 11751221

Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method.

L Li1, C R Weinberg, T A Darden, L G Pedersen.   

Abstract

MOTIVATION: We recently introduced a multivariate approach that selects a subset of predictive genes jointly for sample classification based on expression data. We tested the algorithm on colon and leukemia data sets. As an extension to our earlier work, we systematically examine the sensitivity, reproducibility and stability of gene selection/sample classification to the choice of parameters of the algorithm.
METHODS: Our approach combines a Genetic Algorithm (GA) and the k-Nearest Neighbor (KNN) method to identify genes that can jointly discriminate between different classes of samples (e.g. normal versus tumor). The GA/KNN method is a stochastic supervised pattern recognition method. The genes identified are subsequently used to classify independent test set samples.
RESULTS: The GA/KNN method is capable of selecting a subset of predictive genes from a large noisy data set for sample classification. It is a multivariate approach that can capture the correlated structure in the data. We find that for a given data set gene selection is highly repeatable in independent runs using the GA/KNN method. In general, however, gene selection may be less robust than classification. AVAILABILITY: The method is available at http://dir.niehs.nih.gov/microarray/datamining CONTACT: LI3@niehs.nih.gov

Entities:  

Mesh:

Year:  2001        PMID: 11751221     DOI: 10.1093/bioinformatics/17.12.1131

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  63 in total

1.  ESPD: a pattern detection model underlying gene expression profiles.

Authors:  Chun Tang; Aidong Zhang; Murali Ramanathan
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

2.  Identification of Marker Genes for Cancer Based on Microarrays Using a Computational Biology Approach.

Authors:  Xiaosheng Wang
Journal:  Curr Bioinform       Date:  2014-04-01       Impact factor: 3.543

3.  SITC cancer immunotherapy resource document: a compass in the land of biomarker discovery.

Authors:  Siwen Hu-Lieskovan; Srabani Bhaumik; Kavita Dhodapkar; Jean-Charles J B Grivel; Sumati Gupta; Brent A Hanks; Sylvia Janetzki; Thomas O Kleen; Yoshinobu Koguchi; Amanda W Lund; Cristina Maccalli; Yolanda D Mahnke; Ruslan D Novosiadly; Senthamil R Selvan; Tasha Sims; Yingdong Zhao; Holden T Maecker
Journal:  J Immunother Cancer       Date:  2020-12       Impact factor: 13.751

4.  Comparison of the predictive accuracy of DNA array-based multigene classifiers across cDNA arrays and Affymetrix GeneChips.

Authors:  James Stec; Jing Wang; Kevin Coombes; Mark Ayers; Sebastian Hoersch; David L Gold; Jeffrey S Ross; Kenneth R Hess; Stephen Tirrell; Gerald Linette; Gabriel N Hortobagyi; W Fraser Symmans; Lajos Pusztai
Journal:  J Mol Diagn       Date:  2005-08       Impact factor: 5.568

Review 5.  Classification algorithms for phenotype prediction in genomics and proteomics.

Authors:  Habtom W Ressom; Rency S Varghese; Zhen Zhang; Jianhua Xuan; Robert Clarke
Journal:  Front Biosci       Date:  2008-01-01

Review 6.  DNA microarrays: a powerful genomic tool for biomedical and clinical research.

Authors:  Victor Trevino; Francesco Falciani; Hugo A Barrera-Saldaña
Journal:  Mol Med       Date:  2007 Sep-Oct       Impact factor: 6.354

7.  Lee Pedersen's work in theoretical and computational chemistry and biochemistry.

Authors:  Lee G Pedersen
Journal:  World J Biol Chem       Date:  2011-02-26

8.  Biomarker discovery using statistically significant gene sets.

Authors:  Hoon Kim; John Watkinson; Dimitris Anastassiou
Journal:  J Comput Biol       Date:  2011-04-01       Impact factor: 1.479

9.  Computational Systems Bioinformatics and Bioimaging for Pathway Analysis and Drug Screening.

Authors:  Xiaobo Zhou; Stephen T C Wong
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2008-08-01       Impact factor: 10.961

10.  Visualization of large-scale correlations in gene expressions.

Authors:  Kasper Astrup Eriksen; Michael Hörnquist; Kim Sneppen
Journal:  Funct Integr Genomics       Date:  2004-08-26       Impact factor: 3.410

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.