Literature DB >> 35505906

BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Eric F Lock1, Jun Young Park2, Katherine A Hoadley3.   

Abstract

Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, pan-omics pan-cancer analysis, have extended our knowledge of molecular heterogeneity beyond what was observed in single tumor and single platform studies. However, these studies have been limited by available statistical methodology. We propose a flexible approach to the simultaneous factorization and decomposition of variation across such bidimensionally linked matrices, BIDIFAC+. BIDIFAC+ decomposes variation into a series of low-rank components that may be shared across any number of row sets (e.g., omics platforms) or column sets (e.g., cancer types). This builds on a growing literature for the factorization and decomposition of linked matrices which has primarily focused on multiple matrices that are linked in one dimension (rows or columns) only. Our objective function extends nuclear norm penalization, is motivated by random matrix theory, gives a unique decomposition under relatively mild conditions, and can be shown to give the mode of a Bayesian posterior distribution. We apply BIDIFAC+ to pan-omics pan-cancer data from TCGA, identifying shared and specific modes of variability across four different omics platforms and 29 different cancer types.

Entities:  

Keywords:  Cancer genomics; data integration; low-rank matrix factorization; missing data imputation; nuclear norm penalization

Year:  2022        PMID: 35505906      PMCID: PMC9060567          DOI: 10.1214/21-AOAS1495

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   1.959


  30 in total

1.  The Cancer Genome Atlas: Creating Lasting Value beyond Its Data.

Authors:  Carolyn Hutter; Jean Claude Zenklusen
Journal:  Cell       Date:  2018-04-05       Impact factor: 41.582

2.  Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.

Authors:  Zhiguang Huo; George Tseng
Journal:  Ann Appl Stat       Date:  2017-07-20       Impact factor: 2.083

3.  R.JIVE for exploration of multi-source molecular data.

Authors:  Michael J O'Connell; Eric F Lock
Journal:  Bioinformatics       Date:  2016-06-06       Impact factor: 6.937

4.  Integrative clustering of high-dimensional data with joint and individual clusters.

Authors:  Kristoffer H Hellton; Magne Thoresen
Journal:  Biostatistics       Date:  2016-02-24       Impact factor: 5.899

5.  A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data.

Authors:  Zi Yang; George Michailidis
Journal:  Bioinformatics       Date:  2015-09-15       Impact factor: 6.937

6.  A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data.

Authors:  Qianxing Mo; Ronglai Shen; Cui Guo; Marina Vannucci; Keith S Chan; Susan G Hilsenbeck
Journal:  Biostatistics       Date:  2018-01-01       Impact factor: 5.899

7.  BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Authors:  Eric F Lock; Jun Young Park; Katherine A Hoadley
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

8.  Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin.

Authors:  Katherine A Hoadley; Christina Yau; Denise M Wolf; Andrew D Cherniack; David Tamborero; Sam Ng; Max D M Leiserson; Beifang Niu; Michael D McLellan; Vladislav Uzunangelov; Jiashan Zhang; Cyriac Kandoth; Rehan Akbani; Hui Shen; Larsson Omberg; Andy Chu; Adam A Margolin; Laura J Van't Veer; Nuria Lopez-Bigas; Peter W Laird; Benjamin J Raphael; Li Ding; A Gordon Robertson; Lauren A Byers; Gordon B Mills; John N Weinstein; Carter Van Waes; Zhong Chen; Eric A Collisson; Christopher C Benz; Charles M Perou; Joshua M Stuart
Journal:  Cell       Date:  2014-08-07       Impact factor: 41.582

9.  Comprehensive molecular profiling of lung adenocarcinoma.

Authors: 
Journal:  Nature       Date:  2014-07-09       Impact factor: 49.962

10.  A pan-cancer proteomic perspective on The Cancer Genome Atlas.

Authors:  Rehan Akbani; Patrick Kwok Shing Ng; Henrica M J Werner; Maria Shahmoradgoli; Fan Zhang; Zhenlin Ju; Wenbin Liu; Ji-Yeon Yang; Kosuke Yoshihara; Jun Li; Shiyun Ling; Elena G Seviour; Prahlad T Ram; John D Minna; Lixia Diao; Pan Tong; John V Heymach; Steven M Hill; Frank Dondelinger; Nicolas Städler; Lauren A Byers; Funda Meric-Bernstam; John N Weinstein; Bradley M Broom; Roeland G W Verhaak; Han Liang; Sach Mukherjee; Yiling Lu; Gordon B Mills
Journal:  Nat Commun       Date:  2014-05-29       Impact factor: 14.919

View more
  3 in total

1.  Two-stage linked component analysis for joint decomposition of multiple biologically related data sets.

Authors:  Huan Chen; Brian Caffo; Genevieve Stein-O'Brien; Jinrui Liu; Ben Langmead; Carlo Colantuoni; Luo Xiao
Journal:  Biostatistics       Date:  2022-10-14       Impact factor: 5.279

2.  BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.

Authors:  Eric F Lock; Jun Young Park; Katherine A Hoadley
Journal:  Ann Appl Stat       Date:  2022-03-28       Impact factor: 1.959

3.  A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data.

Authors:  Sarah Samorodnitsky; Katherine A Hoadley; Eric F Lock
Journal:  BMC Bioinformatics       Date:  2022-06-17       Impact factor: 3.307

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.