Literature DB >> 21116443

Model selection procedure for high-dimensional data.

Yongli Zhang1, Xiaotong Shen.   

Abstract

For high-dimensional regression, the number of predictors may greatly exceed the sample size but only a small fraction of them are related to the response. Therefore, variable selection is inevitable, where consistent model selection is the primary concern. However, conventional consistent model selection criteria like BIC may be inadequate due to their nonadaptivity to the model space and infeasibility of exhaustive search. To address these two issues, we establish a probability lower bound of selecting the smallest true model by an information criterion, based on which we propose a model selection criterion, what we call RIC(c), which adapts to the model space. Furthermore, we develop a computationally feasible method combining the computational power of least angle regression (LAR) with of RIC(c). Both theoretical and simulation studies show that this method identifies the smallest true model with probability converging to one if the smallest true model is selected by LAR. The proposed method is applied to real data from the power market and outperforms the backward variable selection in terms of price forecasting accuracy.

Entities:  

Year:  2010        PMID: 21116443      PMCID: PMC2992390          DOI: 10.1002/sam.10088

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.051


  4 in total

1.  A novel joint sparse partial correlation method for estimating group functional networks.

Authors:  Xiaoyun Liang; Alan Connelly; Fernando Calamante
Journal:  Hum Brain Mapp       Date:  2015-12-21       Impact factor: 5.038

2.  Data integration with high dimensionality.

Authors:  Xin Gao; Raymond J Carroll
Journal:  Biometrika       Date:  2017-05-09       Impact factor: 2.445

3.  Adaptive Modeling Procedure Selection by Data Perturbation.

Authors:  Yongli Zhang; Xiaotong Shen
Journal:  J Bus Econ Stat       Date:  2014-10-02       Impact factor: 6.565

4.  Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects.

Authors:  Haoran Xue; Xiaotong Shen; Wei Pan
Journal:  Am J Hum Genet       Date:  2021-07-01       Impact factor: 11.043

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.