Literature DB >> 15680584

Gene selection from microarray data for cancer classification--a machine learning approach.

Yu Wang1, Igor V Tetko, Mark A Hall, Eibe Frank, Axel Facius, Klaus F X Mayer, Hans W Mewes.   

Abstract

A DNA microarray can track the expression levels of thousands of genes simultaneously. Previous research has demonstrated that this technology can be useful in the classification of cancers. Cancer microarray data normally contains a small number of samples which have a large number of gene expression levels as features. To select relevant genes involved in different types of cancer remains a challenge. In order to extract useful gene information from cancer microarray data and reduce dimensionality, feature selection algorithms were systematically investigated in this study. Using a correlation-based feature selector combined with machine learning algorithms such as decision trees, naïve Bayes and support vector machines, we show that classification performance at least as good as published results can be obtained on acute leukemia and diffuse large B-cell lymphoma microarray data sets. We also demonstrate that a combined use of different classification and feature selection approaches makes it possible to select relevant genes with high confidence. This is also the first paper which discusses both computational and biological evidence for the involvement of zyxin in leukaemogenesis.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15680584     DOI: 10.1016/j.compbiolchem.2004.11.001

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  59 in total

1.  Network-based Prediction of Cancer under Genetic Storm.

Authors:  Ahmet Ay; Dihong Gong; Tamer Kahveci
Journal:  Cancer Inform       Date:  2014-10-15

2.  Identification of tissue-specific tumor biomarker using different optimization algorithms.

Authors:  Shib Sankar Bhowmick; Debotosh Bhattacharjee; Luis Rato
Journal:  Genes Genomics       Date:  2018-12-08       Impact factor: 1.839

3.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

4.  High-dimensional pharmacogenetic prediction of a continuous trait using machine learning techniques with application to warfarin dose prediction in African Americans.

Authors:  Erdal Cosgun; Nita A Limdi; Christine W Duarte
Journal:  Bioinformatics       Date:  2011-03-30       Impact factor: 6.937

5.  Biomarker discovery using statistically significant gene sets.

Authors:  Hoon Kim; John Watkinson; Dimitris Anastassiou
Journal:  J Comput Biol       Date:  2011-04-01       Impact factor: 1.479

6.  SCoRS--A Method Based on Stability for Feature Selection and Mapping inNeuroimaging [corrected].

Authors:  Jane M Rondina; Tim Hahn; Leticia de Oliveira; Andre F Marquand; Thomas Dresler; Thomas Leitner; Andreas J Fallgatter; John Shawe-Taylor; Janaina Mourao-Miranda
Journal:  IEEE Trans Med Imaging       Date:  2013-09-11       Impact factor: 10.048

7.  Accurate molecular classification of cancer using simple rules.

Authors:  Xiaosheng Wang; Osamu Gotoh
Journal:  BMC Med Genomics       Date:  2009-10-30       Impact factor: 3.063

8.  Identification of protein functions using a machine-learning approach based on sequence-derived properties.

Authors:  Bum Ju Lee; Moon Sun Shin; Young Joon Oh; Hae Seok Oh; Keun Ho Ryu
Journal:  Proteome Sci       Date:  2009-08-09       Impact factor: 2.480

9.  Uncovering metabolic pathways relevant to phenotypic traits of microbial genomes.

Authors:  Gabi Kastenmüller; Maria Elisabeth Schenk; Johann Gasteiger; Hans-Werner Mewes
Journal:  Genome Biol       Date:  2009-03-10       Impact factor: 13.583

10.  An integrated method for cancer classification and rule extraction from microarray data.

Authors:  Liang-Tsung Huang
Journal:  J Biomed Sci       Date:  2009-02-24       Impact factor: 8.410

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.