Literature DB >> 18093965

Complementary hierarchical clustering.

Gen Nowak1, Robert Tibshirani.   

Abstract

When applying hierarchical clustering algorithms to cluster patient samples from microarray data, the clustering patterns generated by most algorithms tend to be dominated by groups of highly differentially expressed genes that have closely related expression patterns. Sometimes, these genes may not be relevant to the biological process under study or their functions may already be known. The problem is that these genes can potentially drown out the effects of other genes that are relevant or have novel functions. We propose a procedure called complementary hierarchical clustering that is designed to uncover the structures arising from these novel genes that are not as highly expressed. Simulation studies show that the procedure is effective when applied to a variety of examples. We also define a concept called relative gene importance that can be used to identify the influential genes in a given clustering. Finally, we analyze a microarray data set from 295 breast cancer patients, using clustering with the correlation-based distance measure. The complementary clustering reveals a grouping of the patients which is uncorrelated with a number of known prognostic signatures and significantly differing distant metastasis-free probabilities.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18093965      PMCID: PMC3294318          DOI: 10.1093/biostatistics/kxm046

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  14 in total

1.  Model-based clustering and data transformations for gene expression data.

Authors:  K Y Yeung; C Fraley; A Murua; A E Raftery; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-10       Impact factor: 6.937

2.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

Authors:  E P Xing; R M Karp
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Biclustering microarray data by Gibbs sampling.

Authors:  Qizheng Sheng; Yves Moreau; Bart De Moor
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

4.  Biclustering algorithms for biological data analysis: a survey.

Authors:  Sara C Madeira; Arlindo L Oliveira
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2004 Jan-Mar       Impact factor: 3.710

5.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

6.  Spectral biclustering of microarray data: coclustering genes and conditions.

Authors:  Yuval Kluger; Ronen Basri; Joseph T Chang; Mark Gerstein
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

7.  Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival.

Authors:  Howard Y Chang; Dimitry S A Nuyten; Julie B Sneddon; Trevor Hastie; Robert Tibshirani; Therese Sørlie; Hongyue Dai; Yudong D He; Laura J van't Veer; Harry Bartelink; Matt van de Rijn; Patrick O Brown; Marc J van de Vijver
Journal:  Proc Natl Acad Sci U S A       Date:  2005-02-08       Impact factor: 11.205

8.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

9.  A gene-expression signature as a predictor of survival in breast cancer.

Authors:  Marc J van de Vijver; Yudong D He; Laura J van't Veer; Hongyue Dai; Augustinus A M Hart; Dorien W Voskuil; George J Schreiber; Johannes L Peterse; Chris Roberts; Matthew J Marton; Mark Parrish; Douwe Atsma; Anke Witteveen; Annuska Glas; Leonie Delahaye; Tony van der Velde; Harry Bartelink; Sjoerd Rodenhuis; Emiel T Rutgers; Stephen H Friend; René Bernards
Journal:  N Engl J Med       Date:  2002-12-19       Impact factor: 91.245

10.  Model-based cluster analysis of microarray gene-expression data.

Authors:  Wei Pan; Jizhen Lin; Chap T Le
Journal:  Genome Biol       Date:  2002-01-29       Impact factor: 13.583

View more
  6 in total

1.  A framework for feature selection in clustering.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

2.  Identification of relevant subtypes via preweighted sparse clustering.

Authors:  Sheila Gaynor; Eric Bair
Journal:  Comput Stat Data Anal       Date:  2017-06-23       Impact factor: 1.681

3.  Semi-supervised clustering methods.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013

4.  Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering.

Authors:  Dwain Mefford; Joel A Mefford
Journal:  BMC Genomics       Date:  2010-08-23       Impact factor: 3.969

5.  Robustification of Naïve Bayes Classifier and Its Application for Microarray Gene Expression Data Analysis.

Authors:  Md Shakil Ahmed; Md Shahjaman; Md Masud Rana; Md Nurul Haque Mollah
Journal:  Biomed Res Int       Date:  2017-08-07       Impact factor: 3.411

6.  Identifying and Assessing Interesting Subgroups in a Heterogeneous Population.

Authors:  Woojoo Lee; Andrey Alexeyenko; Maria Pernemalm; Justine Guegan; Philippe Dessen; Vladimir Lazar; Janne Lehtiö; Yudi Pawitan
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.