Literature DB >> 17048468

Significance of gene ranking for classification of microarray samples.

Chaolin Zhang1, Xuesong Lu, Xuegong Zhang.   

Abstract

Many methods for classification and gene selection with microarray data have been developed. These methods usually give a ranking of genes. Evaluating the statistical significance of the gene ranking is important for understanding the results and for further biological investigations, but this question has not been well addressed for machine learning methods in existing works. Here, we address this problem by formulating it in the framework of hypothesis testing and propose a solution based on resampling. The proposed r-test methods convert gene ranking results into position p-values to evaluate the significance of genes. The methods are tested on three real microarray data sets and three simulation data sets with support vector machines as the method of classification and gene selection. The obtained position p-values help to determine the number of genes to be selected and enable scientists to analyze selection results by sophisticated multivariate methods under the same statistical inference paradigm as for simple hypothesis testing methods.

Mesh:

Year:  2006        PMID: 17048468     DOI: 10.1109/TCBB.2006.42

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  9 in total

1.  Biomarker discovery using statistically significant gene sets.

Authors:  Hoon Kim; John Watkinson; Dimitris Anastassiou
Journal:  J Comput Biol       Date:  2011-04-01       Impact factor: 1.479

2.  Enhancement of plant metabolite fingerprinting by machine learning.

Authors:  Ian M Scott; Cornelia P Vermeer; Maria Liakata; Delia I Corol; Jane L Ward; Wanchang Lin; Helen E Johnson; Lynne Whitehead; Baldeep Kular; John M Baker; Sean Walsh; Anuja Dave; Tony R Larson; Ian A Graham; Trevor L Wang; Ross D King; John Draper; Michael H Beale
Journal:  Plant Physiol       Date:  2010-06-21       Impact factor: 8.340

3.  Optimized ranking and selection methods for feature selection with application in microarray experiments.

Authors:  Xinping Cui; Haibing Zhao; Jason Wilson
Journal:  J Biopharm Stat       Date:  2010-03       Impact factor: 1.051

4.  Using rule-based machine learning for candidate disease gene prioritization and sample classification of cancer gene expression data.

Authors:  Enrico Glaab; Jaume Bacardit; Jonathan M Garibaldi; Natalio Krasnogor
Journal:  PLoS One       Date:  2012-07-11       Impact factor: 3.240

5.  Semantically linking and browsing PubMed abstracts with gene ontology.

Authors:  Bhanu C Vanteru; Jahangheer S Shaik; Mohammed Yeasin
Journal:  BMC Genomics       Date:  2008       Impact factor: 3.969

6.  Large-scale RNA-Seq Transcriptome Analysis of 4043 Cancers and 548 Normal Tissue Controls across 12 TCGA Cancer Types.

Authors:  Li Peng; Xiu Wu Bian; Di Kang Li; Chuan Xu; Guang Ming Wang; Qing You Xia; Qing Xiong
Journal:  Sci Rep       Date:  2015-08-21       Impact factor: 4.379

7.  Reducing the complexity of complex gene coexpression networks by coupling multiweighted labeling with topological analysis.

Authors:  Alfredo Benso; Paolo Cornale; Stefano Di Carlo; Gianfranco Politano; Alessandro Savino
Journal:  Biomed Res Int       Date:  2013-10-07       Impact factor: 3.411

8.  The identification of informative genes from multiple datasets with increasing complexity.

Authors:  S Yahya Anvar; Peter A C 't Hoen; Allan Tucker
Journal:  BMC Bioinformatics       Date:  2010-01-15       Impact factor: 3.169

9.  A unified framework for finding differentially expressed genes from microarray experiments.

Authors:  Jahangheer S Shaik; Mohammed Yeasin
Journal:  BMC Bioinformatics       Date:  2007-09-18       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.