Literature DB >> 28959370

Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.

Zhiguang Huo1, George Tseng1.   

Abstract

Cancer subtypes discovery is the first step to deliver personalized medicine to cancer patients. With the accumulation of massive multi-level omics datasets and established biological knowledge databases, omics data integration with incorporation of rich existing biological knowledge is essential for deciphering a biological mechanism behind the complex diseases. In this manuscript, we propose an integrative sparse K-means (is-K means) approach to discover disease subtypes with the guidance of prior biological knowledge via sparse overlapping group lasso. An algorithm using an alternating direction method of multiplier (ADMM) will be applied for fast optimization. Simulation and three real applications in breast cancer and leukemia will be used to compare is-K means with existing methods and demonstrate its superior clustering accuracy, feature selection, functional annotation of detected molecular features and computing efficiency.

Entities:  

Keywords:  Cancer subtype; admm; omics integrative analysis; overlapping group lasso

Year:  2017        PMID: 28959370      PMCID: PMC5613668          DOI: 10.1214/17-AOAS1033

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  38 in total

Review 1.  Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification.

Authors:  Richard Simon; Michael D Radmacher; Kevin Dobbin; Lisa M McShane
Journal:  J Natl Cancer Inst       Date:  2003-01-01       Impact factor: 13.506

2.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

3.  Bayesian consensus clustering.

Authors:  Eric F Lock; David B Dunson
Journal:  Bioinformatics       Date:  2013-08-28       Impact factor: 6.937

4.  Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome.

Authors:  Richard W Tothill; Anna V Tinker; Joshy George; Robert Brown; Stephen B Fox; Stephen Lade; Daryl S Johnson; Melanie K Trivett; Dariush Etemadmoghadam; Bianca Locandro; Nadia Traficante; Sian Fereday; Jillian A Hung; Yoke-Eng Chiew; Izhak Haviv; Dorota Gertig; Anna DeFazio; David D L Bowtell
Journal:  Clin Cancer Res       Date:  2008-08-15       Impact factor: 12.531

5.  The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma.

Authors:  Andreas Rosenwald; George Wright; Wing C Chan; Joseph M Connors; Elias Campo; Richard I Fisher; Randy D Gascoyne; H Konrad Muller-Hermelink; Erlend B Smeland; Jena M Giltnane; Elaine M Hurt; Hong Zhao; Lauren Averett; Liming Yang; Wyndham H Wilson; Elaine S Jaffe; Richard Simon; Richard D Klausner; John Powell; Patricia L Duffey; Dan L Longo; Timothy C Greiner; Dennis D Weisenburger; Warren G Sanger; Bhavana J Dave; James C Lynch; Julie Vose; James O Armitage; Emilio Montserrat; Armando López-Guillermo; Thomas M Grogan; Thomas P Miller; Michel LeBlanc; German Ott; Stein Kvaloy; Jan Delabie; Harald Holte; Peter Krajci; Trond Stokke; Louis M Staudt
Journal:  N Engl J Med       Date:  2002-06-20       Impact factor: 91.245

6.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

7.  An integrated genomic analysis of human glioblastoma multiforme.

Authors:  D Williams Parsons; Siân Jones; Xiaosong Zhang; Jimmy Cheng-Ho Lin; Rebecca J Leary; Philipp Angenendt; Parminder Mankoo; Hannah Carter; I-Mei Siu; Gary L Gallia; Alessandro Olivi; Roger McLendon; B Ahmed Rasheed; Stephen Keir; Tatiana Nikolskaya; Yuri Nikolsky; Dana A Busam; Hanna Tekleab; Luis A Diaz; James Hartigan; Doug R Smith; Robert L Strausberg; Suely Kazue Nagahashi Marie; Sueli Mieko Oba Shinjo; Hai Yan; Gregory J Riggins; Darell D Bigner; Rachel Karchin; Nick Papadopoulos; Giovanni Parmigiani; Bert Vogelstein; Victor E Velculescu; Kenneth W Kinzler
Journal:  Science       Date:  2008-09-04       Impact factor: 47.728

Review 8.  Comprehensive literature review and statistical considerations for microarray meta-analysis.

Authors:  George C Tseng; Debashis Ghosh; Eleanor Feingold
Journal:  Nucleic Acids Res       Date:  2012-01-19       Impact factor: 16.971

Review 9.  Practical Aspects of microRNA Target Prediction.

Authors:  T M Witkos; E Koscianska; W J Krzyzosiak
Journal:  Curr Mol Med       Date:  2011-03       Impact factor: 2.222

10.  Consensus clustering and functional interpretation of gene-expression data.

Authors:  Stephen Swift; Allan Tucker; Veronica Vinciotti; Nigel Martin; Christine Orengo; Xiaohui Liu; Paul Kellam
Journal:  Genome Biol       Date:  2004-11-01       Impact factor: 13.583

View more
  11 in total

1.  Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.

Authors:  Zhiguang Huo; George Tseng
Journal:  Ann Appl Stat       Date:  2017-07-20       Impact factor: 2.083

2.  Assisted gene expression-based clustering with AWNCut.

Authors:  Yang Li; Ruofan Bie; Sebastian J Teran Hidalgo; Yichen Qin; Mengyun Wu; Shuangge Ma
Journal:  Stat Med       Date:  2018-08-09       Impact factor: 2.373

3.  Incorporating prior information with fused sparse group lasso: Application to prediction of clinical measures from neuroimages.

Authors:  Joanne C Beer; Howard J Aizenstein; Stewart J Anderson; Robert T Krafty
Journal:  Biometrics       Date:  2019-06-17       Impact factor: 2.571

4.  Integrative clustering methods for multi-omics data.

Authors:  Xiaoyu Zhang; Zhenwei Zhou; Hanfei Xu; Ching-Ti Liu
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2021-02-07

5.  BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Authors:  Eric F Lock; Jun Young Park; Katherine A Hoadley
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

6.  Integration of Proteomics and Other Omics Data.

Authors:  Mengyun Wu; Yu Jiang; Shuangge Ma
Journal:  Methods Mol Biol       Date:  2021

7.  Bayesian integrative model for multi-omics data with missingness.

Authors:  Zhou Fang; Tianzhou Ma; Gong Tang; Li Zhu; Qi Yan; Ting Wang; Juan C Celedón; Wei Chen; George C Tseng
Journal:  Bioinformatics       Date:  2018-11-15       Impact factor: 6.931

Review 8.  Statistical and Machine-Learning Analyses in Nutritional Genomics Studies.

Authors:  Leila Khorraminezhad; Mickael Leclercq; Arnaud Droit; Jean-François Bilodeau; Iwona Rudkowska
Journal:  Nutrients       Date:  2020-10-14       Impact factor: 5.717

9.  SMRT: Randomized Data Transformation for Cancer Subtyping and Big Data Analysis.

Authors:  Hung Nguyen; Duc Tran; Bang Tran; Monikrishna Roy; Adam Cassell; Sergiu Dascalu; Sorin Draghici; Tin Nguyen
Journal:  Front Oncol       Date:  2021-10-20       Impact factor: 6.244

10.  DeLUCS: Deep learning for unsupervised clustering of DNA sequences.

Authors:  Pablo Millán Arias; Fatemeh Alipour; Kathleen A Hill; Lila Kari
Journal:  PLoS One       Date:  2022-01-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.