Literature DB >> 23049130

Sparse estimation of a covariance matrix.

Jacob Bien1, Robert J Tibshirani.   

Abstract

We suggest a method for estimating a covariance matrix on the basis of a sample of vectors drawn from a multivariate normal distribution. In particular, we penalize the likelihood with a lasso penalty on the entries of the covariance matrix. This penalty plays two important roles: it reduces the effective number of parameters, which is important even when the dimension of the vectors is smaller than the sample size since the number of parameters grows quadratically in the number of variables, and it produces an estimate which is sparse. In contrast to sparse inverse covariance estimation, our method's close relative, the sparsity attained here is in the covariance matrix itself rather than in the inverse matrix. Zeros in the covariance matrix correspond to marginal independencies; thus, our method performs model selection while providing a positive definite estimate of the covariance. The proposed penalized maximum likelihood problem is not convex, so we use a majorize-minimize approach in which we iteratively solve convex approximations to the original nonconvex problem. We discuss tuning parameter selection and demonstrate on a flow-cytometry dataset how our method produces an interpretable graphical display of the relationship between variables. We perform simulations that suggest that simple elementwise thresholding of the empirical covariance matrix is competitive with our method for identifying the sparsity structure. Additionally, we show how our method can be used to solve a previously studied special case in which a desired sparsity pattern is prespecified.

Year:  2011        PMID: 23049130      PMCID: PMC3413177          DOI: 10.1093/biomet/asr054

Source DB:  PubMed          Journal:  Biometrika        ISSN: 0006-3444            Impact factor:   2.445


  6 in total

1.  Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks.

Authors:  A J Butte; P Tamayo; D Slonim; T R Golub; I S Kohane
Journal:  Proc Natl Acad Sci U S A       Date:  2000-10-24       Impact factor: 11.205

2.  The concave-convex procedure.

Authors:  A L Yuille; Anand Rangarajan
Journal:  Neural Comput       Date:  2003-04       Impact factor: 2.026

3.  Causal protein-signaling networks derived from multiparameter single-cell data.

Authors:  Karen Sachs; Omar Perez; Dana Pe'er; Douglas A Lauffenburger; Garry P Nolan
Journal:  Science       Date:  2005-04-22       Impact factor: 47.728

4.  Sparse inverse covariance estimation with the graphical lasso.

Authors:  Jerome Friedman; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2007-12-12       Impact factor: 5.899

5.  Variable Selection using MM Algorithms.

Authors:  David R Hunter; Runze Li
Journal:  Ann Stat       Date:  2005       Impact factor: 4.028

6.  Sparsistency and Rates of Convergence in Large Covariance Matrix Estimation.

Authors:  Clifford Lam; Jianqing Fan
Journal:  Ann Stat       Date:  2009       Impact factor: 4.028

  6 in total
  27 in total

1.  The immune system as a biomonitor: explorations in innate and adaptive immunity.

Authors:  Niclas Thomas; James Heather; Gabriel Pollara; Nandi Simpson; Theres Matjeka; John Shawe-Taylor; Mahdad Noursadeghi; Benjamin Chain
Journal:  Interface Focus       Date:  2013-04-06       Impact factor: 3.906

2.  Sparse Covariance Matrix Estimation With Eigenvalue Constraints.

Authors:  Han Liu; Lie Wang; Tuo Zhao
Journal:  J Comput Graph Stat       Date:  2014-04       Impact factor: 2.302

3.  Learning Graphical Models With Hubs.

Authors:  Kean Ming Tan; Palma London; Karthik Mohan; Su-In Lee; Maryam Fazel; Daniela Witten
Journal:  J Mach Learn Res       Date:  2014-10       Impact factor: 3.654

4.  Multitask Quantile Regression under the Transnormal Model.

Authors:  Jianqing Fan; Lingzhou Xue; Hui Zou
Journal:  J Am Stat Assoc       Date:  2017-01-05       Impact factor: 5.033

5.  MM Algorithms For Variance Components Models.

Authors:  Hua Zhou; Liuyi Hu; Jin Zhou; Kenneth Lange
Journal:  J Comput Graph Stat       Date:  2019-03-09       Impact factor: 2.302

6.  A Dynamic Bayesian Model for Characterizing Cross-Neuronal Interactions During Decision-Making.

Authors:  Bo Zhou; David E Moorman; Sam Behseta; Hernando Ombao; Babak Shahbaba
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

7.  Estimating and Identifying Unspecified Correlation Structure for Longitudinal Data.

Authors:  Jianhua Hu; Peng Wang; Annie Qu
Journal:  J Comput Graph Stat       Date:  2015-04-01       Impact factor: 2.302

8.  Knockoff boosted tree for model-free variable selection.

Authors:  Tao Jiang; Yuanyuan Li; Alison A Motsinger-Reif
Journal:  Bioinformatics       Date:  2021-05-17       Impact factor: 6.937

9.  Estimating Large Correlation Matrices for International Migration.

Authors:  Jonathan J Azose; Adrian E Raftery
Journal:  Ann Appl Stat       Date:  2018-07-28       Impact factor: 2.083

10.  A Brief Survey of Modern Optimization for Statisticians.

Authors:  Kenneth Lange; Eric C Chi; Hua Zhou
Journal:  Int Stat Rev       Date:  2014-04-01       Impact factor: 2.217

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.