Literature DB >> 29930437

Embracing the Blessing of Dimensionality in Factor Models.

Quefeng Li1, Guang Cheng2, Jianqing Fan3,4, Yuyan Wang3.   

Abstract

Factor modeling is an essential tool for exploring intrinsic dependence structures among high-dimensional random variables. Much progress has been made for estimating the covariance matrix from a high-dimensional factor model. However, the blessing of dimensionality has not yet been fully embraced in the literature: much of the available data are often ignored in constructing covariance matrix estimates. If our goal is to accurately estimate a covariance matrix of a set of targeted variables, shall we employ additional data, which are beyond the variables of interest, in the estimation? In this article, we provide sufficient conditions for an affirmative answer, and further quantify its gain in terms of Fisher information and convergence rate. In fact, even an oracle-like result (as if all the factors were known) can be achieved when a sufficiently large number of variables is used. The idea of using data as much as possible brings computational challenges. A divide-and-conquer algorithm is thus proposed to alleviate the computational burden, and also shown not to sacrifice any statistical accuracy in comparison with a pooled analysis. Simulation studies further confirm our advocacy for the use of full data, and demonstrate the effectiveness of the above algorithm. Our proposal is applied to a microarray data example that shows empirical benefits of using more data. Supplementary materials for this article are available online.

Entities:  

Keywords:  Asymptotic normality; Auxiliary data; Divide-and-conquer; Factor model; Fisher information; High-dimensionality

Year:  2017        PMID: 29930437      PMCID: PMC6005696          DOI: 10.1080/01621459.2016.1256815

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  7 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  HIGH DIMENSIONAL COVARIANCE MATRIX ESTIMATION IN APPROXIMATE FACTOR MODELS.

Authors:  Jianqing Fan; Yuan Liao; Martina Mincheva
Journal:  Ann Stat       Date:  2011-01-01       Impact factor: 4.028

3.  TTC7B emerges as a novel risk factor for ischemic stroke through the convergence of several genome-wide approaches.

Authors:  Tiago Krug; João Paulo Gabriel; Ricardo Taipa; Benedita V Fonseca; Sophie Domingues-Montanari; Israel Fernandez-Cadenas; Helena Manso; Liliana O Gouveia; João Sobral; Isabel Albergaria; Gisela Gaspar; Jordi Jiménez-Conde; Raquel Rabionet; José M Ferro; Joan Montaner; Astrid M Vicente; Mário Rui Silva; Ilda Matos; Gabriela Lopes; Sofia A Oliveira
Journal:  J Cereb Blood Flow Metab       Date:  2012-03-28       Impact factor: 6.200

4.  Sparsistency and Rates of Convergence in Large Covariance Matrix Estimation.

Authors:  Clifford Lam; Jianqing Fan
Journal:  Ann Stat       Date:  2009       Impact factor: 4.028

5.  High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics.

Authors:  Carlos M Carvalho; Jeffrey Chang; Joseph E Lucas; Joseph R Nevins; Quanli Wang; Mike West
Journal:  J Am Stat Assoc       Date:  2008-12-01       Impact factor: 5.033

6.  Large Covariance Estimation by Thresholding Principal Orthogonal Complements.

Authors:  Jianqing Fan; Yuan Liao; Martina Mincheva
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2013-09-01       Impact factor: 4.488

7.  A factor analysis model for functional genomics.

Authors:  Rafal Kustra; Romy Shioda; Mu Zhu
Journal:  BMC Bioinformatics       Date:  2006-04-21       Impact factor: 3.169

  7 in total
  2 in total

Review 1.  High-Dimensional Brain in a High-Dimensional World: Blessing of Dimensionality.

Authors:  Alexander N Gorban; Valery A Makarov; Ivan Y Tyukin
Journal:  Entropy (Basel)       Date:  2020-01-09       Impact factor: 2.524

2.  Association of Screening and Brief Intervention With Substance Use in Massachusetts Middle and High Schools.

Authors:  Sharon Levy; Lauren E Wisk; Machiko Minegishi; Benjamin Ertman; Julie Lunstead; Melissa Brogna; Elissa R Weitzman
Journal:  JAMA Netw Open       Date:  2022-08-01
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.