Literature DB >> 28099990

Statistical significance for hierarchical clustering.

Patrick K Kimes1, Yufeng Liu1,2,3,4,5, David Neil Hayes5, James Stephen Marron1,2,5.   

Abstract

Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high-dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clustering structure. A critical and challenging question in cluster analysis is whether the identified clusters represent important underlying structure or are artifacts of natural sampling variation. Few approaches have been proposed for addressing this problem in the context of hierarchical clustering, for which the problem is further complicated by the natural tree structure of the partition, and the multiplicity of tests required to parse the layers of nested clusters. In this article, we propose a Monte Carlo based approach for testing statistical significance in hierarchical clustering which addresses these issues. The approach is implemented as a sequential testing procedure guaranteeing control of the family-wise error rate. Theoretical justification is provided for our approach, and its power to detect true clustering structure is illustrated through several simulation studies and applications to two cancer gene expression datasets.
© 2017, The International Biometric Society.

Entities:  

Keywords:  High-dimension; Hypothesis testing; Multiple correction; Unsupervised learning

Mesh:

Year:  2017        PMID: 28099990      PMCID: PMC5708128          DOI: 10.1111/biom.12647

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  14 in total

1.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

2.  Statistical Significance of Clustering using Soft Thresholding.

Authors:  Hanwen Huang; Yufeng Liu; Ming Yuan; J S Marron
Journal:  J Comput Graph Stat       Date:  2015-12-10       Impact factor: 2.302

3.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

4.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

5.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

6.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

Review 7.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

8.  Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer.

Authors:  Aleix Prat; Joel S Parker; Olga Karginova; Cheng Fan; Chad Livasy; Jason I Herschkowitz; Xiaping He; Charles M Perou
Journal:  Breast Cancer Res       Date:  2010-09-02       Impact factor: 6.466

9.  Microarray-based class discovery for molecular classification of breast cancer: analysis of interobserver agreement.

Authors:  Alan Mackay; Britta Weigelt; Anita Grigoriadis; Bas Kreike; Rachael Natrajan; Roger A'Hern; David S P Tan; Mitch Dowsett; Alan Ashworth; Jorge S Reis-Filho
Journal:  J Natl Cancer Inst       Date:  2011-03-18       Impact factor: 13.506

10.  PAM50 breast cancer subtyping by RT-qPCR and concordance with standard clinical molecular markers.

Authors:  Roy R L Bastien; Álvaro Rodríguez-Lescure; Mark T W Ebbert; Aleix Prat; Blanca Munárriz; Leslie Rowe; Patricia Miller; Manuel Ruiz-Borrego; Daniel Anderson; Bradley Lyons; Isabel Álvarez; Tracy Dowell; David Wall; Miguel Ángel Seguí; Lee Barley; Kenneth M Boucher; Emilio Alba; Lisa Pappas; Carole A Davis; Ignacio Aranda; Christiane Fauron; Inge J Stijleman; José Palacios; Antonio Antón; Eva Carrasco; Rosalía Caballero; Matthew J Ellis; Torsten O Nielsen; Charles M Perou; Mark Astill; Philip S Bernard; Miguel Martín
Journal:  BMC Med Genomics       Date:  2012-10-04       Impact factor: 3.063

View more
  30 in total

1.  Interpreting k-mer-based signatures for antibiotic resistance prediction.

Authors:  Magali Jaillard; Mattia Palmieri; Alex van Belkum; Pierre Mahé
Journal:  Gigascience       Date:  2020-10-17       Impact factor: 6.524

2.  A multi-omic single-cell landscape of human gynecologic malignancies.

Authors:  Matthew J Regner; Kamila Wisniewska; Susana Garcia-Recio; Aatish Thennavan; Raul Mendez-Giraldez; Venkat S Malladi; Gabrielle Hawkins; Joel S Parker; Charles M Perou; Victoria L Bae-Jump; Hector L Franco
Journal:  Mol Cell       Date:  2021-11-04       Impact factor: 17.970

3.  Tracking Regulatory T Cell Development in the Thymus Using Single-Cell RNA Sequencing/TCR Sequencing.

Authors:  David L Owen; Rebecca S La Rue; Sarah A Munro; Michael A Farrar
Journal:  J Immunol       Date:  2022-08-29       Impact factor: 5.426

4.  A systematic review of illness representation clusters in chronic conditions.

Authors:  Eleanor Rivera; Colleen Corte; Holli A DeVon; Eileen G Collins; Alana Steffen
Journal:  Res Nurs Health       Date:  2020-02-17       Impact factor: 2.228

5.  An unsupervised machine learning method for discovering patient clusters based on genetic signatures.

Authors:  Christian Lopez; Scott Tucker; Tarik Salameh; Conrad Tucker
Journal:  J Biomed Inform       Date:  2018-07-29       Impact factor: 6.317

6.  Ultrastructural analysis of dendritic spine necks reveals a continuum of spine morphologies.

Authors:  Netanel Ofer; Daniel R Berger; Narayanan Kasthuri; Jeff W Lichtman; Rafael Yuste
Journal:  Dev Neurobiol       Date:  2021-05-30       Impact factor: 3.964

Review 7.  Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources.

Authors:  Tara Eicher; Garrett Kinnebrew; Andrew Patt; Kyle Spencer; Kevin Ying; Qin Ma; Raghu Machiraju; And Ewy A Mathé
Journal:  Metabolites       Date:  2020-05-15

8.  Renal Histologic Analysis Provides Complementary Information to Kidney Function Measurement for Patients with Early Diabetic or Hypertensive Disease.

Authors:  Ghazal Z Quinn; Amin Abedini; Hongbo Liu; Ziyuan Ma; Andrew Cucchiara; Andrea Havasi; Jon Hill; Matthew B Palmer; Katalin Susztak
Journal:  J Am Soc Nephrol       Date:  2021-08-04       Impact factor: 10.121

9.  Identify specific gene pairs for subarachnoid hemorrhage based on wavelet analysis and genetic algorithm.

Authors:  Pengcheng Zhao; Shaonian Xu; Zhenshan Huang; Pengcheng Deng; Yongming Zhang
Journal:  PLoS One       Date:  2021-06-17       Impact factor: 3.240

10.  Application of the Interaction between Tissue Immunohistochemistry Staining and Clinicopathological Factors for Evaluating the Risk of Oral Cancer Progression by Hierarchical Clustering Analysis: A Case-Control Study in a Taiwanese Population.

Authors:  Hui-Ching Wang; Meng-Chun Chou; Chun-Chieh Wu; Leong-Perng Chan; Sin-Hua Moi; Mei-Ren Pan; Ta-Chih Liu; Cheng-Hong Yang
Journal:  Diagnostics (Basel)       Date:  2021-05-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.