Literature DB >> 23424126

Spectral clustering strategies for heterogeneous disease expression data.

Grace T Huang1, Kathryn I Cunningham, Panayiotis V Benos, Chakra S Chennubhotla.   

Abstract

Clustering of gene expression data simplifies subsequent data analyses and forms the basis of numerous approaches for biomarker identification, prediction of clinical outcome, and personalized therapeutic strategies. The most popular clustering methods such as K-means and hierarchical clustering are intuitive and easy to use, but they require arbitrary choices on their various parameters (number of clusters for K-means, and a threshold to cut the tree for hierarchical clustering). Human disease gene expression data are in general more difficult to cluster efficiently due to background (genotype) heterogeneity, disease stage and progression differences and disease subtyping; all of which cause gene expression datasets to be more heterogeneous. Spectral clustering has been recently introduced in many fields as a promising alternative to standard clustering methods. The idea is that pairwise comparisons can help reveal global features through the eigen techniques. In this paper, we developed a new recursive K-means spectral clustering method (ReKS) for disease gene expression data. We benchmarked ReKS on three large-scale cancer datasets and we compared it to different clustering methods with respect to execution time, background models and external biological knowledge. We found ReKS to be superior to the hierarchical methods and equally good to K-means, but much faster than them and without the requirement for a priori knowledge of K. Overall, ReKS offers an attractive alternative for efficient clustering of human disease data.

Entities:  

Mesh:

Year:  2013        PMID: 23424126      PMCID: PMC4449339     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  14 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Systematic determination of genetic network architecture.

Authors:  S Tavazoie; J D Hughes; M J Campbell; R J Cho; G M Church
Journal:  Nat Genet       Date:  1999-07       Impact factor: 38.330

3.  Parallel spectral clustering in distributed systems.

Authors:  Wen-Yen Chen; Yangqiu Song; Hongjie Bai; Chih-Jen Lin; Edward Y Chang
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2011-03       Impact factor: 6.226

4.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

5.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure.

Authors:  Adrian Alexa; Jörg Rahnenführer; Thomas Lengauer
Journal:  Bioinformatics       Date:  2006-04-10       Impact factor: 6.937

6.  Clustering by passing messages between data points.

Authors:  Brendan J Frey; Delbert Dueck
Journal:  Science       Date:  2007-01-11       Impact factor: 47.728

7.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

8.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

9.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps.

Authors:  R R Coifman; S Lafon; A B Lee; M Maggioni; B Nadler; F Warner; S W Zucker
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-17       Impact factor: 12.779

10.  Partition decoupling for multi-gene analysis of gene expression profiling data.

Authors:  Rosemary Braun; Gregory Leibon; Scott Pauls; Daniel Rockmore
Journal:  BMC Bioinformatics       Date:  2011-12-30       Impact factor: 3.169

View more
  3 in total

1.  T-ReCS: stable selection of dynamically formed groups of features with application to prediction of clinical outcomes.

Authors:  Grace T Huang; Ioannis Tsamardinos; Vineet Raghu; Naftali Kaminski; Panayiotis V Benos
Journal:  Pac Symp Biocomput       Date:  2015

2.  MicroRNA expression profiling predicts clinical outcome of carboplatin/paclitaxel-based therapy in metastatic melanoma treated on the ECOG-ACRIN trial E2603.

Authors:  Liza C Villaruz; Grace Huang; Marjorie Romkes; John M Kirkwood; Shama C Buch; Tomoko Nukui; Keith T Flaherty; Sandra J Lee; Melissa A Wilson; Katherine L Nathanson; Panayiotis V Benos; Hussein A Tawbi
Journal:  Clin Epigenetics       Date:  2015-06-04       Impact factor: 6.551

3.  Identifying Windows of Susceptibility by Temporal Gene Analysis.

Authors:  Kristin P Bennett; Elisabeth M Brown; Hannah De Los Santos; Matthew Poegel; Thomas R Kiehl; Evan W Patton; Spencer Norris; Sally Temple; John Erickson; Deborah L McGuinness; Nathan C Boles
Journal:  Sci Rep       Date:  2019-02-26       Impact factor: 4.996

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.