Literature DB >> 27330233

Meta-analytic framework for sparse K-means to identify disease subtypes in multiple transcriptomic studies.

Zhiguang Huo1, Ying Ding2, Silvia Liu3, Steffi Oesterreich4, George Tseng5.   

Abstract

Disease phenotyping by omics data has become a popular approach that potentially can lead to better personalized treatment. Identifying disease subtypes via unsupervised machine learning is the first step towards this goal. In this paper, we extend a sparse K-means method towards a meta-analytic framework to identify novel disease subtypes when expression profiles of multiple cohorts are available. The lasso regularization and meta-analysis identify a unique set of gene features for subtype characterization. An additional pattern matching reward function guarantees consistent subtype signatures across studies. The method was evaluated by simulations and leukemia and breast cancer data sets. The identified disease subtypes from meta-analysis were characterized with improved accuracy and stability compared to single study analysis. The breast cancer model was applied to an independent METABRIC dataset and generated improved survival difference between subtypes. These results provide a basis for diagnosis and development of targeted treatments for disease subgroups.

Entities:  

Keywords:  Disease subtype discovery; K-means; Lasso; Meta-analysis; Unsupervised machine learning

Year:  2016        PMID: 27330233      PMCID: PMC4908837          DOI: 10.1080/01621459.2015.1086354

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  37 in total

1.  A mixture model-based approach to the clustering of microarray expression data.

Authors:  G J McLachlan; R W Bean; D Peel
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

2.  Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies.

Authors:  Brian D Lehmann; Joshua A Bauer; Xi Chen; Melinda E Sanders; A Bapsi Chakravarthy; Yu Shyr; Jennifer A Pietenpol
Journal:  J Clin Invest       Date:  2011-07       Impact factor: 14.808

3.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

4.  Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series.

Authors:  Christine Desmedt; Fanny Piette; Sherene Loi; Yixin Wang; Françoise Lallemand; Benjamin Haibe-Kains; Giuseppe Viale; Mauro Delorenzi; Yi Zhang; Mahasti Saghatchian d'Assignies; Jonas Bergh; Rosette Lidereau; Paul Ellis; Adrian L Harris; Jan G M Klijn; John A Foekens; Fatima Cardoso; Martine J Piccart; Marc Buyse; Christos Sotiriou
Journal:  Clin Cancer Res       Date:  2007-06-01       Impact factor: 12.531

5.  A colorectal cancer classification system that associates cellular phenotype and responses to therapy.

Authors:  Anguraj Sadanandam; Costas A Lyssiotis; Krisztian Homicsko; Eric A Collisson; William J Gibb; Stephan Wullschleger; Liliane C Gonzalez Ostos; William A Lannon; Carsten Grotzinger; Maguy Del Rio; Benoit Lhermitte; Adam B Olshen; Bertram Wiedenmann; Lewis C Cantley; Joe W Gray; Douglas Hanahan
Journal:  Nat Med       Date:  2013-04-14       Impact factor: 53.440

6.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

7.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

Review 8.  Comprehensive literature review and statistical considerations for GWAS meta-analysis.

Authors:  Ferdouse Begum; Debashis Ghosh; George C Tseng; Eleanor Feingold
Journal:  Nucleic Acids Res       Date:  2012-01-12       Impact factor: 16.971

9.  Microarray-based class discovery for molecular classification of breast cancer: analysis of interobserver agreement.

Authors:  Alan Mackay; Britta Weigelt; Anita Grigoriadis; Bas Kreike; Rachael Natrajan; Roger A'Hern; David S P Tan; Mitch Dowsett; Alan Ashworth; Jorge S Reis-Filho
Journal:  J Natl Cancer Inst       Date:  2011-03-18       Impact factor: 13.506

10.  A prediction-based resampling method for estimating the number of clusters in a dataset.

Authors:  Sandrine Dudoit; Jane Fridlyand
Journal:  Genome Biol       Date:  2002-06-25       Impact factor: 13.583

View more
  7 in total

1.  Event Surrogate from Clinical Pathway Completion to Daily Meal for Availability Extension Using Standard Electronic Medical Records: a Retrospective Cohort Study.

Authors:  Hiroki Furuhata; Kenji Araki; Taisuke Ogawa
Journal:  J Med Syst       Date:  2021-02-05       Impact factor: 4.460

2.  Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.

Authors:  Zhiguang Huo; George Tseng
Journal:  Ann Appl Stat       Date:  2017-07-20       Impact factor: 2.083

3.  Meta-analytic principal component analysis in integrative omics application.

Authors:  SungHwan Kim; Dongwan Kang; Zhiguang Huo; Yongseok Park; George C Tseng
Journal:  Bioinformatics       Date:  2018-04-15       Impact factor: 6.937

Review 4.  Advances and Opportunities in Single-Cell Transcriptomics for Plant Research.

Authors:  Carolin Seyfferth; Jim Renema; Jos R Wendrich; Thomas Eekhout; Ruth Seurinck; Niels Vandamme; Bernhard Blob; Yvan Saeys; Yrjo Helariutta; Kenneth D Birnbaum; Bert De Rybel
Journal:  Annu Rev Plant Biol       Date:  2021-03-17       Impact factor: 26.379

5.  Flexible experimental designs for valid single-cell RNA-sequencing experiments allowing batch effects correction.

Authors:  Fangda Song; Ga Ming Angus Chan; Yingying Wei
Journal:  Nat Commun       Date:  2020-07-01       Impact factor: 14.919

6.  Detecting survival-associated biomarkers from heterogeneous populations.

Authors:  Takumi Saegusa; Zhiwei Zhao; Hongjie Ke; Zhenyao Ye; Zhongying Xu; Shuo Chen; Tianzhou Ma
Journal:  Sci Rep       Date:  2021-02-05       Impact factor: 4.379

7.  Biomarker Categorization in Transcriptomic Meta-Analysis by Concordant Patterns With Application to Pan-Cancer Studies.

Authors:  Zhenyao Ye; Hongjie Ke; Shuo Chen; Raul Cruz-Cano; Xin He; Jing Zhang; Joanne Dorgan; Donald K Milton; Tianzhou Ma
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.