Literature DB >> 12801871

Novel clustering algorithm for microarray expression data in a truncated SVD space.

David Horn1, Inon Axel.   

Abstract

MOTIVATION: This paper introduces the application of a novel clustering method to microarray expression data. Its first stage involves compression of dimensions that can be achieved by applying SVD to the gene-sample matrix in microarray problems. Thus the data (samples or genes) can be represented by vectors in a truncated space of low dimensionality, 4 and 5 in the examples studied here. We find it preferable to project all vectors onto the unit sphere before applying a clustering algorithm. The clustering algorithm used here is the quantum clustering method that has one free scale parameter. Although the method is not hierarchical, it can be modified to allow hierarchy in terms of this scale parameter.
RESULTS: We apply our method to three data sets. The results are very promising. On cancer cell data we obtain a dendrogram that reflects correct groupings of cells. In an AML/ALL data set we obtain very good clustering of samples into four classes of the data. Finally, in clustering of genes in yeast cell cycle data we obtain four groups in a problem that is estimated to contain five families. AVAILABILITY: Software is available as Matlab programs at http://neuron.tau.ac.il/~horn/QC.htm.

Entities:  

Mesh:

Year:  2003        PMID: 12801871     DOI: 10.1093/bioinformatics/btg053

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Disentangling information flow in the Ras-cAMP signaling network.

Authors:  Gregory W Carter; Steffen Rupp; Gerald R Fink; Timothy Galitski
Journal:  Genome Res       Date:  2006-03-13       Impact factor: 9.043

2.  Systematic gene function prediction from gene expression data by using a fuzzy nearest-cluster method.

Authors:  Xiao-Li Li; Yin-Chet Tan; See-Kiong Ng
Journal:  BMC Bioinformatics       Date:  2006-12-12       Impact factor: 3.169

3.  Conserved transcription factor binding sites of cancer markers derived from primary lung adenocarcinoma microarrays.

Authors:  Yee Leng Yap; David C L Lam; Girard Luc; Xue Wu Zhang; David Hernandez; Robin Gras; Elaine Wang; S W Chiu; Lap Ping Chung; W K Lam; David K Smith; John D Minna; Antoine Danchin; Maria P Wong
Journal:  Nucleic Acids Res       Date:  2005-01-14       Impact factor: 16.971

4.  Linear fuzzy gene network models obtained from microarray data by exhaustive search.

Authors:  Bahrad A Sokhansanj; J Patrick Fitch; Judy N Quong; Andrew A Quong
Journal:  BMC Bioinformatics       Date:  2004-08-10       Impact factor: 3.169

5.  Global considerations in hierarchical clustering reveal meaningful patterns in data.

Authors:  Roy Varshavsky; David Horn; Michal Linial
Journal:  PLoS One       Date:  2008-05-21       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.