Literature DB >> 11301299

Validating clustering for gene expression data.

K Y Yeung1, D R Haynor, W L Ruzzo.   

Abstract

MOTIVATION: Many clustering algorithms have been proposed for the analysis of gene expression data, but little guidance is available to help choose among them. We provide a systematic framework for assessing the results of clustering algorithms. Clustering algorithms attempt to partition the genes into groups exhibiting similar patterns of variation in expression level. Our methodology is to apply a clustering algorithm to the data from all but one experimental condition. The remaining condition is used to assess the predictive power of the resulting clusters-meaningful clusters should exhibit less variation in the remaining condition than clusters formed by chance.
RESULTS: We successfully applied our methodology to compare six clustering algorithms on four gene expression data sets. We found our quantitative measures of cluster quality to be positively correlated with external standards of cluster quality.

Mesh:

Year:  2001        PMID: 11301299     DOI: 10.1093/bioinformatics/17.4.309

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  133 in total

1.  GenePublisher: Automated analysis of DNA microarray data.

Authors:  Steen Knudsen; Christopher Workman; Thomas Sicheritz-Ponten; Carsten Friis
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Judging the quality of gene expression-based clustering methods using gene annotation.

Authors:  Francis D Gibbons; Frederick P Roth
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

3.  A gene recommender algorithm to identify coexpressed genes in C. elegans.

Authors:  Art B Owen; Josh Stuart; Kathy Mach; Anne M Villeneuve; Stuart Kim
Journal:  Genome Res       Date:  2003-08       Impact factor: 9.043

4.  EXCAVATOR: a computer program for efficiently mining gene expression data.

Authors:  Dong Xu; Victor Olman; Li Wang; Ying Xu
Journal:  Nucleic Acids Res       Date:  2003-10-01       Impact factor: 16.971

5.  Protein turnover quantification in a multilabeling approach: from data calculation to evaluation.

Authors:  Christian Trötschel; Stefan P Albaum; Daniel Wolff; Simon Schröder; Alexander Goesmann; Tim W Nattkemper; Ansgar Poetsch
Journal:  Mol Cell Proteomics       Date:  2012-04-06       Impact factor: 5.911

6.  Cell cycle progression in the pericycle is not sufficient for SOLITARY ROOT/IAA14-mediated lateral root initiation in Arabidopsis thaliana.

Authors:  Steffen Vanneste; Bert De Rybel; Gerrit T S Beemster; Karin Ljung; Ive De Smet; Gert Van Isterdael; Mirande Naudts; Ryusuke Iida; Wilhelm Gruissem; Masao Tasaka; Dirk Inzé; Hidehiro Fukaki; Tom Beeckman
Journal:  Plant Cell       Date:  2005-10-21       Impact factor: 11.277

Review 7.  Data clustering in life sciences.

Authors:  Ying Zhao; George Karypis
Journal:  Mol Biotechnol       Date:  2005-09       Impact factor: 2.695

8.  FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data.

Authors:  Limin Fu; Enzo Medico
Journal:  BMC Bioinformatics       Date:  2007-01-04       Impact factor: 3.169

9.  A kinetic analysis of the auxin transcriptome reveals cell wall remodeling proteins that modulate lateral root development in Arabidopsis.

Authors:  Daniel R Lewis; Amy L Olex; Stacey R Lundy; William H Turkett; Jacquelyn S Fetrow; Gloria K Muday
Journal:  Plant Cell       Date:  2013-09-17       Impact factor: 11.277

10.  Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes.

Authors:  Jeff W Chou; Tong Zhou; William K Kaufmann; Richard S Paules; Pierre R Bushel
Journal:  BMC Bioinformatics       Date:  2007-11-02       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.