Literature DB >> 19759197

Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.

Ronglai Shen1, Adam B Olshen, Marc Ladanyi.   

Abstract

MOTIVATION: The molecular complexity of a tumor manifests itself at the genomic, epigenomic, transcriptomic and proteomic levels. Genomic profiling at these multiple levels should allow an integrated characterization of tumor etiology. However, there is a shortage of effective statistical and bioinformatic tools for truly integrative data analysis. The standard approach to integrative clustering is separate clustering followed by manual integration. A more statistically powerful approach would incorporate all data types simultaneously and generate a single integrated cluster assignment.
METHODS: We developed a joint latent variable model for integrative clustering. We call the resulting methodology iCluster. iCluster incorporates flexible modeling of the associations between different data types and the variance-covariance structure within data types in a single framework, while simultaneously reducing the dimensionality of the datasets. Likelihood-based inference is obtained through the Expectation-Maximization algorithm.
RESULTS: We demonstrate the iCluster algorithm using two examples of joint analysis of copy number and gene expression data, one from breast cancer and one from lung cancer. In both cases, we identified subtypes characterized by concordant DNA copy number changes and gene expression as well as unique profiles specific to one or the other in a completely automated fashion. In addition, the algorithm discovers potentially novel subtypes by combining weak yet consistent alteration patterns across data types. AVAILABILITY: R code to implement iCluster can be downloaded at http://www.mskcc.org/mskcc/html/85130.cfm

Entities:  

Mesh:

Year:  2009        PMID: 19759197      PMCID: PMC2800366          DOI: 10.1093/bioinformatics/btp543

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  Singular value decomposition for genome-wide expression data processing and modeling.

Authors:  O Alter; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

2.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

3.  Circular binary segmentation for the analysis of array-based DNA copy number data.

Authors:  Adam B Olshen; E S Venkatraman; Robert Lucito; Michael Wigler
Journal:  Biostatistics       Date:  2004-10       Impact factor: 5.899

4.  Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets.

Authors:  Benjamin P Lewis; Christopher B Burge; David P Bartel
Journal:  Cell       Date:  2005-01-14       Impact factor: 41.582

5.  A faster circular binary segmentation algorithm for the analysis of array CGH data.

Authors:  E S Venkatraman; Adam B Olshen
Journal:  Bioinformatics       Date:  2007-01-18       Impact factor: 6.937

6.  Impact of DNA amplification on gene expression patterns in breast cancer.

Authors:  Elizabeth Hyman; Päivikki Kauraniemi; Sampsa Hautaniemi; Maija Wolf; Spyro Mousses; Ester Rozenblum; Markus Ringnér; Guido Sauter; Outi Monni; Abdel Elkahloun; Olli-P Kallioniemi; Anne Kallioniemi
Journal:  Cancer Res       Date:  2002-11-01       Impact factor: 12.701

7.  Integrative analysis reveals the direct and indirect interactions between DNA copy number aberrations and gene expression changes.

Authors:  Hyunju Lee; Sek Won Kong; Peter J Park
Journal:  Bioinformatics       Date:  2008-02-08       Impact factor: 6.937

8.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways.

Authors: 
Journal:  Nature       Date:  2008-09-04       Impact factor: 49.962

9.  An integrative analysis of microRNA and mRNA expression--a case study.

Authors:  Li-Xuan Qin
Journal:  Cancer Inform       Date:  2008-06-17

10.  Integrated genomics identifies five medulloblastoma subtypes with distinct genetic profiles, pathway signatures and clinicopathological features.

Authors:  Marcel Kool; Jan Koster; Jens Bunt; Nancy E Hasselt; Arjan Lakeman; Peter van Sluis; Dirk Troost; Netteke Schouten-van Meeteren; Huib N Caron; Jacqueline Cloos; Alan Mrsić; Bauke Ylstra; Wieslawa Grajkowska; Wolfgang Hartmann; Torsten Pietsch; David Ellison; Steven C Clifford; Rogier Versteeg
Journal:  PLoS One       Date:  2008-08-28       Impact factor: 3.240

View more
  260 in total

Review 1.  Triple-negative breast cancer: present challenges and new perspectives.

Authors:  Franca Podo; Lutgarde M C Buydens; Hadassa Degani; Riet Hilhorst; Edda Klipp; Ingrid S Gribbestad; Sabine Van Huffel; Hanneke W M van Laarhoven; Jan Luts; Daniel Monleon; Geert J Postma; Nicole Schneiderhan-Marra; Filippo Santoro; Hans Wouters; Hege G Russnes; Therese Sørlie; Elda Tagliabue; Anne-Lise Børresen-Dale
Journal:  Mol Oncol       Date:  2010-04-24       Impact factor: 6.603

Review 2.  Statistical approaches for the analysis of DNA methylation microarray data.

Authors:  Kimberly D Siegmund
Journal:  Hum Genet       Date:  2011-04-26       Impact factor: 4.132

3.  Lessons from a decade of integrating cancer copy number alterations with gene expression profiles.

Authors:  Norman Huang; Parantu K Shah; Cheng Li
Journal:  Brief Bioinform       Date:  2011-09-23       Impact factor: 11.622

4.  A statistical framework for Illumina DNA methylation arrays.

Authors:  Pei Fen Kuan; Sijian Wang; Xin Zhou; Haitao Chu
Journal:  Bioinformatics       Date:  2010-09-29       Impact factor: 6.937

5.  Evaluation of hierarchical models for integrative genomic analyses.

Authors:  Marie Denis; Mahlet G Tadesse
Journal:  Bioinformatics       Date:  2015-11-05       Impact factor: 6.937

Review 6.  Annual Research Review: Discovery science strategies in studies of the pathophysiology of child and adolescent psychiatric disorders--promises and limitations.

Authors:  Yihong Zhao; F Xavier Castellanos
Journal:  J Child Psychol Psychiatry       Date:  2016-01-06       Impact factor: 8.982

Review 7.  Advances in sarcoma genomics and new therapeutic targets.

Authors:  Barry S Taylor; Jordi Barretina; Robert G Maki; Cristina R Antonescu; Samuel Singer; Marc Ladanyi
Journal:  Nat Rev Cancer       Date:  2011-07-14       Impact factor: 60.716

8.  Bayesian consensus clustering.

Authors:  Eric F Lock; David B Dunson
Journal:  Bioinformatics       Date:  2013-08-28       Impact factor: 6.937

9.  SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS.

Authors:  Ronglai Shen; Sijian Wang; Qianxing Mo
Journal:  Ann Appl Stat       Date:  2013-04-09       Impact factor: 2.083

10.  Similarity network fusion for aggregating data types on a genomic scale.

Authors:  Bo Wang; Aziz M Mezlini; Feyyaz Demir; Marc Fiume; Zhuowen Tu; Michael Brudno; Benjamin Haibe-Kains; Anna Goldenberg
Journal:  Nat Methods       Date:  2014-01-26       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.