Literature DB >> 26640319

Adaptive Modeling Procedure Selection by Data Perturbation.

Yongli Zhang1, Xiaotong Shen2.   

Abstract

Many procedures have been developed to deal with the high-dimensional problem that is emerging in various business and economics areas. To evaluate and compare these procedures, modeling uncertainty caused by model selection and parameter estimation has to be assessed and integrated into a modeling process. To do this, a data perturbation method estimates the modeling uncertainty inherited in a selection process by perturbing the data. Critical to data perturbation is the size of perturbation, as the perturbed data should resemble the original dataset. To account for the modeling uncertainty, we derive the optimal size of perturbation, which adapts to the data, the model space, and other relevant factors in the context of linear regression. On this basis, we develop an adaptive data-perturbation method that, unlike its nonadaptive counterpart, performs well in different situations. This leads to a data-adaptive model selection method. Both theoretical and numerical analysis suggest that the data-adaptive model selection method adapts to distinct situations in that it yields consistent model selection and optimal prediction, without knowing which situation exists a priori. The proposed method is applied to real data from the commodity market and outperforms its competitors in terms of price forecasting accuracy.

Entities:  

Keywords:  Adaptive model selection; high-dimensional data analysis; modeling uncertainty

Year:  2014        PMID: 26640319      PMCID: PMC4668003          DOI: 10.1080/07350015.2014.965307

Source DB:  PubMed          Journal:  J Bus Econ Stat        ISSN: 0735-0015            Impact factor:   6.565


  4 in total

1.  Likelihood-based selection and sharp parameter estimation.

Authors:  Xiaotong Shen; Wei Pan; Yunzhang Zhu
Journal:  J Am Stat Assoc       Date:  2012-06-11       Impact factor: 5.033

2.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors:  Patrick Breheny; Jian Huang
Journal:  Ann Appl Stat       Date:  2011-01-01       Impact factor: 2.083

3.  Model selection procedure for high-dimensional data.

Authors:  Yongli Zhang; Xiaotong Shen
Journal:  Stat Anal Data Min       Date:  2010-10-01       Impact factor: 1.051

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.