Literature DB >> 26756854

Integrative and regularized principal component analysis of multiple sources of data.

Binghui Liu1,2,3, Xiaotong Shen2, Wei Pan3.   

Abstract

Integration of data of disparate types has become increasingly important to enhancing the power for new discoveries by combining complementary strengths of multiple types of data. One application is to uncover tumor subtypes in human cancer research in which multiple types of genomic data are integrated, including gene expression, DNA copy number, and DNA methylation data. In spite of their successes, existing approaches based on joint latent variable models require stringent distributional assumptions and may suffer from unbalanced scales (or units) of different types of data and non-scalability of the corresponding algorithms. In this paper, we propose an alternative based on integrative and regularized principal component analysis, which is distribution-free, computationally efficient, and robust against unbalanced scales. The new method performs dimension reduction simultaneously on multiple types of data, seeking data-adaptive sparsity and scaling. As a result, in addition to feature selection for each type of data, integrative clustering is achieved. Numerically, the proposed method compares favorably against its competitors in terms of accuracy (in identifying hidden clusters), computational efficiency, and robustness against unbalanced scales. In particular, compared with a popular method, the new method was competitive in identifying tumor subtypes associated with distinct patient survival patterns when applied to a combined analysis of DNA copy number, mRNA expression, and DNA methylation data in a glioblastoma multiforme study.
Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

Entities:  

Keywords:  PCA; integrative clustering; tumor subtypes

Mesh:

Year:  2016        PMID: 26756854      PMCID: PMC4853304          DOI: 10.1002/sim.6866

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  23 in total

1.  Merging multiple omics datasets in silico: statistical analyses and data interpretation.

Authors:  Kazuharu Arakawa; Masaru Tomita
Journal:  Methods Mol Biol       Date:  2013

2.  Matrix factorization methods for integrative cancer genomics.

Authors:  Shihua Zhang; Xianghong Jasmine Zhou
Journal:  Methods Mol Biol       Date:  2014

3.  DNA methylation profiles of long- and short-term glioblastoma survivors.

Authors:  Thoraia Shinawi; Victoria K Hill; Dietmar Krex; Gabriele Schackert; Dean Gentle; Mark R Morris; Wenbin Wei; Garth Cruickshank; Eamonn R Maher; Farida Latif
Journal:  Epigenetics       Date:  2013-01-04       Impact factor: 4.528

4.  JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.

Authors:  Eric F Lock; Katherine A Hoadley; J S Marron; Andrew B Nobel
Journal:  Ann Appl Stat       Date:  2013-03-01       Impact factor: 2.083

5.  Analysis of phosphotyrosine signaling in glioblastoma identifies STAT5 as a novel downstream target of ΔEGFR.

Authors:  Vaibhav Chumbalkar; Khatri Latha; YeoHyeon Hwang; Rebecca Maywald; Lauren Hawley; Raymond Sawaya; Lixia Diao; Keith Baggerly; Webster K Cavenee; Frank B Furnari; Oliver Bogler
Journal:  J Proteome Res       Date:  2011-02-14       Impact factor: 4.466

6.  Radiogenomic mapping of edema/cellular invasion MRI-phenotypes in glioblastoma multiforme.

Authors:  Pascal O Zinn; Bhanu Mahajan; Bhanu Majadan; Pratheesh Sathyan; Sanjay K Singh; Sadhan Majumder; Ferenc A Jolesz; Rivka R Colen
Journal:  PLoS One       Date:  2011-10-05       Impact factor: 3.240

7.  Integrative subtype discovery in glioblastoma using iCluster.

Authors:  Ronglai Shen; Qianxing Mo; Nikolaus Schultz; Venkatraman E Seshan; Adam B Olshen; Jason Huse; Marc Ladanyi; Chris Sander
Journal:  PLoS One       Date:  2012-04-23       Impact factor: 3.240

8.  Expression of nicotinamide N-methyltransferase in hepatocellular carcinoma is associated with poor prognosis.

Authors:  Jongmin Kim; Seok Joo Hong; Eun Kyung Lim; Yun-Suk Yu; Seung Whan Kim; Ji Hyeon Roh; In-Gu Do; Jae-Won Joh; Dae Shick Kim
Journal:  J Exp Clin Cancer Res       Date:  2009-02-16

9.  Promoter hypermethylation-mediated inactivation of LRRC4 in gliomas.

Authors:  Zuping Zhang; Dan Li; Minghua Wu; Bo Xiang; Li Wang; Ming Zhou; Pan Chen; Xiaoling Li; Shourong Shen; Guiyuan Li
Journal:  BMC Mol Biol       Date:  2008-11-03       Impact factor: 2.946

10.  Sparse canonical methods for biological data integration: application to a cross-platform study.

Authors:  Kim-Anh Lê Cao; Pascal G P Martin; Christèle Robert-Granié; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2009-01-26       Impact factor: 3.169

View more
  3 in total

1.  Integrative factorization of bidimensionally linked matrices.

Authors:  Jun Young Park; Eric F Lock
Journal:  Biometrics       Date:  2019-11-10       Impact factor: 2.571

2.  A New Algorithm and Theory for Penalized Regression-based Clustering.

Authors:  Chong Wu; Sunghoon Kwon; Xiaotong Shen; Wei Pan
Journal:  J Mach Learn Res       Date:  2016       Impact factor: 3.654

3.  Uncovering Large-Scale Conformational Change in Molecular Dynamics without Prior Knowledge.

Authors:  Ryan L Melvin; Ryan C Godwin; Jiajie Xiao; William G Thompson; Kenneth S Berenhaut; Freddie R Salsbury
Journal:  J Chem Theory Comput       Date:  2016-11-10       Impact factor: 6.006

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.