Literature DB >> 26877823

TRANSPOSABLE REGULARIZED COVARIANCE MODELS WITH AN APPLICATION TO MISSING DATA IMPUTATION.

Genevera I Allen1, Robert Tibshirani2.   

Abstract

Missing data estimation is an important challenge with high-dimensional data arranged in the form of a matrix. Typically this data matrix is transposable, meaning that either the rows, columns or both can be treated as features. To model transposable data, we present a modification of the matrix-variate normal, the mean-restricted matrix-variate normal, in which the rows and columns each have a separate mean vector and covariance matrix. By placing additive penalties on the inverse covariance matrices of the rows and columns, these so called transposable regularized covariance models allow for maximum likelihood estimation of the mean and non-singular covariance matrices. Using these models, we formulate EM-type algorithms for missing data imputation in both the multivariate and transposable frameworks. We present theoretical results exploiting the structure of our transposable models that allow these models and imputation methods to be applied to high-dimensional data. Simulations and results on microarray data and the Netflix data show that these imputation techniques often outperform existing methods and offer a greater degree of flexibility.

Entities:  

Keywords:  EM algorithm; covariance estimation; imputation; matrix-variate normal; transposable data

Year:  2010        PMID: 26877823      PMCID: PMC4751046          DOI: 10.1214/09-AOAS314

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  6 in total

1.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

2.  Missing value estimation for DNA microarray gene expression data: local least squares imputation.

Authors:  Hyunsoo Kim; Gene H Golub; Haesun Park
Journal:  Bioinformatics       Date:  2004-08-27       Impact factor: 6.937

3.  Sparse inverse covariance estimation with the graphical lasso.

Authors:  Jerome Friedman; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2007-12-12       Impact factor: 5.899

4.  Are a set of microarrays independent of each other?

Authors:  Bradley Efron
Journal:  Ann Appl Stat       Date:  2009-01-01       Impact factor: 2.083

5.  Covariance-regularized regression and classification for high-dimensional problems.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2009-02-20       Impact factor: 4.488

6.  Gene expression profiling predicts survival in conventional renal cell carcinoma.

Authors:  Hongjuan Zhao; Börje Ljungberg; Kjell Grankvist; Torgny Rasmuson; Robert Tibshirani; James D Brooks
Journal:  PLoS Med       Date:  2005-12-06       Impact factor: 11.069

  6 in total
  6 in total

1.  Model Selection and Estimation in the Matrix Normal Graphical Model.

Authors:  Jianxin Yin; Hongzhe Li
Journal:  J Multivar Anal       Date:  2012-05-01       Impact factor: 1.473

2.  Foundational Principles for Large-Scale Inference: Illustrations Through Correlation Mining.

Authors:  Alfred O Hero; Bala Rajaratnam
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2015-12-21       Impact factor: 10.961

3.  Testing for nodal dependence in relational data matrices.

Authors:  Alexander Volfovsky; Peter D Hoff
Journal:  J Am Stat Assoc       Date:  2015-11-07       Impact factor: 5.033

4.  Inference with Transposable Data: Modeling the Effects of Row and Column Correlations.

Authors:  Genevera I Allen; Robert Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-03-16       Impact factor: 4.488

5.  Inferring Phenotypic Trait Evolution on Large Trees With Many Incomplete Measurements.

Authors:  Gabriel Hassler; Max R Tolkoff; William L Allen; Lam Si Tung Ho; Philippe Lemey; Marc A Suchard
Journal:  J Am Stat Assoc       Date:  2020-09-16       Impact factor: 4.369

6.  A multiple-phenotype imputation method for genetic studies.

Authors:  Andrew Dahl; Valentina Iotchkova; Amelie Baud; Åsa Johansson; Ulf Gyllensten; Nicole Soranzo; Richard Mott; Andreas Kranis; Jonathan Marchini
Journal:  Nat Genet       Date:  2016-02-22       Impact factor: 38.330

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.