Literature DB >> 12671006

Spectral biclustering of microarray data: coclustering genes and conditions.

Yuval Kluger1, Ronen Basri, Joseph T Chang, Mark Gerstein.   

Abstract

Global analyses of RNA expression levels are useful for classifying genes and overall phenotypes. Often these classification problems are linked, and one wants to find "marker genes" that are differentially expressed in particular sets of "conditions." We have developed a method that simultaneously clusters genes and conditions, finding distinctive "checkerboard" patterns in matrices of gene expression data, if they exist. In a cancer context, these checkerboards correspond to genes that are markedly up- or downregulated in patients with particular types of tumors. Our method, spectral biclustering, is based on the observation that checkerboard structures in matrices of expression data can be found in eigenvectors corresponding to characteristic expression patterns across genes or conditions. In addition, these eigenvectors can be readily identified by commonly used linear algebra approaches, in particular the singular value decomposition (SVD), coupled with closely integrated normalization steps. We present a number of variants of the approach, depending on whether the normalization over genes and conditions is done independently or in a coupled fashion. We then apply spectral biclustering to a selection of publicly available cancer expression data sets, and examine the degree to which the approach is able to identify checkerboard structures. Furthermore, we compare the performance of our biclustering methods against a number of reasonable benchmarks (e.g., direct application of SVD or normalized cuts to raw data).

Entities:  

Mesh:

Year:  2003        PMID: 12671006      PMCID: PMC430175          DOI: 10.1101/gr.648603

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  23 in total

1.  'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns.

Authors:  T Hastie; R Tibshirani; M B Eisen; A Alizadeh; R Levy; L Staudt; W C Chan; D Botstein; P Brown
Journal:  Genome Biol       Date:  2000-08-04       Impact factor: 13.583

2.  Coupled two-way clustering analysis of gene microarray data.

Authors:  G Getz; E Levine; E Domany
Journal:  Proc Natl Acad Sci U S A       Date:  2000-10-24       Impact factor: 11.205

3.  Using Bayesian networks to analyze expression data.

Authors:  N Friedman; M Linial; I Nachman; D Pe'er
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

4.  Fundamental patterns underlying gene expression profiles: simplicity from complexity.

Authors:  N S Holter; M Mitra; A Maritan; M Cieplak; J R Banavar; N V Fedoroff
Journal:  Proc Natl Acad Sci U S A       Date:  2000-07-18       Impact factor: 11.205

5.  Bootstrapping cluster analysis: assessing the reliability of conclusions from microarray experiments.

Authors:  M K Kerr; G A Churchill
Journal:  Proc Natl Acad Sci U S A       Date:  2001-07-24       Impact factor: 11.205

6.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

Authors:  E P Xing; R M Karp
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

7.  Singular value decomposition for genome-wide expression data processing and modeling.

Authors:  O Alter; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

8.  Analysis of gene expression microarrays for phenotype classification.

Authors:  A Califano; G Stolovitzky; Y Tu
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

9.  Genomic and proteomic analysis of the myeloid differentiation program.

Authors:  Z Lian; L Wang; S Yamaga; W Bonds; Y Beazer-Barclay; Y Kluger; M Gerstein; P E Newburger; N Berliner; S M Weissman
Journal:  Blood       Date:  2001-08-01       Impact factor: 22.113

10.  Principal components analysis to summarize microarray experiments: application to sporulation time series.

Authors:  S Raychaudhuri; J M Stuart; R B Altman
Journal:  Pac Symp Biocomput       Date:  2000
View more
  87 in total

1.  Lineage specificity of gene expression patterns.

Authors:  Yuval Kluger; David P Tuck; Joseph T Chang; Yasuhiro Nakayama; Ranjana Poddar; Naohiko Kohya; Zheng Lian; Abdelhakim Ben Nasr; H Ruth Halaban; Diane S Krause; Xueqing Zhang; Peter E Newburger; Sherman M Weissman
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-19       Impact factor: 11.205

2.  Biclustering of linear patterns in gene expression data.

Authors:  Qinghui Gao; Christine Ho; Yingmin Jia; Jingyi Jessica Li; Haiyan Huang
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

3.  A comparative analysis of biclustering algorithms for gene expression data.

Authors:  Kemal Eren; Mehmet Deveci; Onur Küçüktunç; Ümit V Çatalyürek
Journal:  Brief Bioinform       Date:  2012-07-06       Impact factor: 11.622

Review 4.  Advantages and limitations of current network inference methods.

Authors:  Riet De Smet; Kathleen Marchal
Journal:  Nat Rev Microbiol       Date:  2010-08-31       Impact factor: 60.633

5.  Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering.

Authors:  Peng Sun; Nora K Speicher; Richard Röttger; Jiong Guo; Jan Baumbach
Journal:  Nucleic Acids Res       Date:  2014-03-20       Impact factor: 16.971

6.  Model validation for gene selection and regulation maps.

Authors:  Enrico Capobianco
Journal:  Funct Integr Genomics       Date:  2007-12-07       Impact factor: 3.410

7.  An efficient voting algorithm for finding additive biclusters with random background.

Authors:  Jing Xiao; Lusheng Wang; Xiaowen Liu; Tao Jiang
Journal:  J Comput Biol       Date:  2008-12       Impact factor: 1.479

8.  NETWORKS, BIOLOGY AND SYSTEMS ENGINEERING: A CASE STUDY IN INFLAMMATION.

Authors:  P T Foteinou; E Yang; I P Androulakis
Journal:  Comput Chem Eng       Date:  2009-12-10       Impact factor: 3.845

9.  Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Authors:  Saurav Mallik; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

10.  Transcription factor network reconstruction using the living cell array.

Authors:  Eric Yang; Martin L Yarmush; Ioannis P Androulakis
Journal:  J Theor Biol       Date:  2008-10-22       Impact factor: 2.691

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.