Literature DB >> 31444786

Integrative factorization of bidimensionally linked matrices.

Jun Young Park1, Eric F Lock1.   

Abstract

Advances in molecular "omics" technologies have motivated new methodologies for the integration of multiple sources of high-content biomedical data. However, most statistical methods for integrating multiple data matrices only consider data shared vertically (one cohort on multiple platforms) or horizontally (different cohorts on a single platform). This is limiting for data that take the form of bidimensionally linked matrices (eg, multiple cohorts measured on multiple platforms), which are increasingly common in large-scale biomedical studies. In this paper, we propose bidimensional integrative factorization (BIDIFAC) for integrative dimension reduction and signal approximation of bidimensionally linked data matrices. Our method factorizes data into (a) globally shared, (b) row-shared, (c) column-shared, and (d) single-matrix structural components, facilitating the investigation of shared and unique patterns of variability. For estimation, we use a penalized objective function that extends the nuclear norm penalization for a single matrix. As an alternative to the complicated rank selection problem, we use results from the random matrix theory to choose tuning parameters. We apply our method to integrate two genomics platforms (messenger RNA and microRNA expression) across two sample cohorts (tumor samples and normal tissue samples) using the breast cancer data from the Cancer Genome Atlas. We provide R code for fitting BIDIFAC, imputing missing values, and generating simulated data.
© 2019 The International Biometric Society.

Entities:  

Keywords:  BIDIFAC; bidimensional data; cancer genomics; data integration; dimension reduction; principal component analysis

Year:  2019        PMID: 31444786      PMCID: PMC7036334          DOI: 10.1111/biom.13141

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  25 in total

1.  Group Component Analysis for Multiblock Data: Common and Individual Feature Extraction.

Authors:  Guoxu Zhou; Andrzej Cichocki; Yu Zhang; Danilo P Mandic
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2015-10-28       Impact factor: 10.451

2.  Integrative and regularized principal component analysis of multiple sources of data.

Authors:  Binghui Liu; Xiaotong Shen; Wei Pan
Journal:  Stat Med       Date:  2016-01-12       Impact factor: 2.373

3.  R.JIVE for exploration of multi-source molecular data.

Authors:  Michael J O'Connell; Eric F Lock
Journal:  Bioinformatics       Date:  2016-06-06       Impact factor: 6.937

4.  Integrative approaches for large-scale transcriptome-wide association studies.

Authors:  Alexander Gusev; Arthur Ko; Huwenbo Shi; Gaurav Bhatia; Wonil Chung; Brenda W J H Penninx; Rick Jansen; Eco J C de Geus; Dorret I Boomsma; Fred A Wright; Patrick F Sullivan; Elina Nikkola; Marcus Alvarez; Mete Civelek; Aldons J Lusis; Terho Lehtimäki; Emma Raitoharju; Mika Kähönen; Ilkka Seppälä; Olli T Raitakari; Johanna Kuusisto; Markku Laakso; Alkes L Price; Päivi Pajukanta; Bogdan Pasaniuc
Journal:  Nat Genet       Date:  2016-02-08       Impact factor: 38.330

5.  JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.

Authors:  Eric F Lock; Katherine A Hoadley; J S Marron; Andrew B Nobel
Journal:  Ann Appl Stat       Date:  2013-03-01       Impact factor: 2.083

6.  SWISS MADE: Standardized WithIn Class Sum of Squares to evaluate methodologies and dataset elements.

Authors:  Christopher R Cabanski; Yuan Qi; Xiaoying Yin; Eric Bair; Michele C Hayward; Cheng Fan; Jianying Li; Matthew D Wilkerson; J S Marron; Charles M Perou; D Neil Hayes
Journal:  PLoS One       Date:  2010-03-26       Impact factor: 3.240

7.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.

Authors:  Ronglai Shen; Adam B Olshen; Marc Ladanyi
Journal:  Bioinformatics       Date:  2009-09-16       Impact factor: 6.937

8.  TCGA2STAT: simple TCGA data access for integrated statistical analysis in R.

Authors:  Ying-Wooi Wan; Genevera I Allen; Zhandong Liu
Journal:  Bioinformatics       Date:  2015-11-14       Impact factor: 6.937

Review 9.  Comprehensive literature review and statistical considerations for microarray meta-analysis.

Authors:  George C Tseng; Debashis Ghosh; Eleanor Feingold
Journal:  Nucleic Acids Res       Date:  2012-01-19       Impact factor: 16.971

10.  Transcriptional Profiles from Paired Normal Samples Offer Complementary Information on Cancer Patient Survival--Evidence from TCGA Pan-Cancer Data.

Authors:  Xiu Huang; David F Stern; Hongyu Zhao
Journal:  Sci Rep       Date:  2016-02-03       Impact factor: 4.379

View more
  2 in total

1.  Two-stage linked component analysis for joint decomposition of multiple biologically related data sets.

Authors:  Huan Chen; Brian Caffo; Genevieve Stein-O'Brien; Jinrui Liu; Ben Langmead; Carlo Colantuoni; Luo Xiao
Journal:  Biostatistics       Date:  2022-10-14       Impact factor: 5.279

2.  BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Authors:  Eric F Lock; Jun Young Park; Katherine A Hoadley
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.