Literature DB >> 28042188

Structured Matrix Completion with Applications to Genomic Data Integration.

Tianxi Cai1, T Tony Cai2, Anru Zhang3.   

Abstract

Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics and electrical engineering. Current literature on matrix completion focuses primarily on independent sampling models under which the individual observed entries are sampled independently. Motivated by applications in genomic data integration, we propose a new framework of structured matrix completion (SMC) to treat structured missingness by design. Specifically, our proposed method aims at efficient matrix recovery when a subset of the rows and columns of an approximately low-rank matrix are observed. We provide theoretical justification for the proposed SMC method and derive lower bound for the estimation errors, which together establish the optimal rate of recovery over certain classes of approximately low-rank matrices. Simulation studies show that the method performs well in finite sample under a variety of configurations. The method is applied to integrate several ovarian cancer genomic studies with different extent of genomic measurements, which enables us to construct more accurate prediction rules for ovarian cancer survival.

Entities:  

Keywords:  Constrained minimization; genomic data integration; low-rank matrix; matrix completion; singular value decomposition; structured matrix completion

Year:  2016        PMID: 28042188      PMCID: PMC5198844          DOI: 10.1080/01621459.2015.1021005

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  17 in total

1.  Recovering the missing components in a large noisy low-rank matrix: application to SFM.

Authors:  Pei Chen; David Suter
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2004-08       Impact factor: 6.226

2.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Authors:  Paul Scheet; Matthew Stephens
Journal:  Am J Hum Genet       Date:  2006-02-17       Impact factor: 11.025

3.  Methods to impute missing genotypes for population data.

Authors:  Zhaoxia Yu; Daniel J Schaid
Journal:  Hum Genet       Date:  2007-09-13       Impact factor: 4.132

4.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2009-02-05       Impact factor: 11.025

Review 5.  Ovarian cancer: epidemiology, biology, and prognostic factors.

Authors:  C H Holschneider; J S Berek
Journal:  Semin Surg Oncol       Date:  2000 Jul-Aug

6.  Patterns of gene expression that characterize long-term survival in advanced stage serous ovarian cancers.

Authors:  Andrew Berchuck; Edwin S Iversen; Johnathan M Lancaster; Jennifer Pittman; Jingqin Luo; Paula Lee; Susan Murphy; Holly K Dressman; Phillip G Febbo; Mike West; Joseph R Nevins; Jeffrey R Marks
Journal:  Clin Cancer Res       Date:  2005-05-15       Impact factor: 12.531

7.  An integrated genomic-based approach to individualized treatment of patients with advanced-stage ovarian cancer.

Authors:  Holly K Dressman; Andrew Berchuck; Gina Chan; Jun Zhai; Andrea Bild; Robyn Sayer; Janiel Cragun; Jennifer Clarke; Regina S Whitaker; Lihua Li; Jonathan Gray; Jeffrey Marks; Geoffrey S Ginsburg; Anil Potti; Mike West; Joseph R Nevins; Johnathan M Lancaster
Journal:  J Clin Oncol       Date:  2007-02-10       Impact factor: 44.544

8.  A prognostic gene expression index in ovarian cancer - validation across different independent data sets.

Authors:  Carsten Denkert; Jan Budczies; Silvia Darb-Esfahani; Balazs Györffy; Jalid Sehouli; Dominique Könsgen; Robert Zeillinger; Wilko Weichert; Aurelia Noske; Ann-Christin Buckendahl; Berit M Müller; Manfred Dietel; Hermann Lage
Journal:  J Pathol       Date:  2009-06       Impact factor: 7.996

9.  Integrated genomic analyses of ovarian carcinoma.

Authors: 
Journal:  Nature       Date:  2011-06-29       Impact factor: 49.962

10.  Missing value estimation for DNA microarray gene expression data by Support Vector Regression imputation and orthogonal coding scheme.

Authors:  Xian Wang; Ao Li; Zhaohui Jiang; Huanqing Feng
Journal:  BMC Bioinformatics       Date:  2006-01-22       Impact factor: 3.169

View more
  5 in total

1.  Integrative linear discriminant analysis with guaranteed error rate improvement.

Authors:  Quefeng Li; Lexin Li
Journal:  Biometrika       Date:  2018-10-22       Impact factor: 2.445

2.  Simultaneous Covariance Inference for Multimodal Integrative Analysis.

Authors:  Yin Xia; Lexin Li; Samuel N Lockhart; William J Jagust
Journal:  J Am Stat Assoc       Date:  2019-06-28       Impact factor: 5.033

3.  A general framework for integrative analysis of incomplete multiomics data.

Authors:  Dan-Yu Lin; Donglin Zeng; David Couper
Journal:  Genet Epidemiol       Date:  2020-07-21       Impact factor: 2.135

4.  High-dimensional integrative copula discriminant analysis for multiomics data.

Authors:  Yong He; Hao Chen; Hao Sun; Jiadong Ji; Yufeng Shi; Xinsheng Zhang; Lei Liu
Journal:  Stat Med       Date:  2020-10-15       Impact factor: 2.373

5.  Strengthening Causal Inference in Exposomics Research: Application of Genetic Data and Methods.

Authors:  Christy L Avery; Annie Green Howard; Anna F Ballou; Victoria L Buchanan; Jason M Collins; Carolina G Downie; Stephanie M Engel; Mariaelisa Graff; Heather M Highland; Moa P Lee; Adam G Lilly; Kun Lu; Julia E Rager; Brooke S Staley; Kari E North; Penny Gordon-Larsen
Journal:  Environ Health Perspect       Date:  2022-05-09       Impact factor: 11.035

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.