Literature DB >> 33328272

A polynomial algorithm for best-subset selection problem.

Junxian Zhu1, Canhong Wen2, Jin Zhu1, Heping Zhang3, Xueqin Wang4.   

Abstract

Best-subset selection aims to find a small subset of predictors, so that the resulting linear model is expected to have the most desirable prediction accuracy. It is not only important and imperative in regression analysis but also has far-reaching applications in every facet of research, including computer science and medicine. We introduce a polynomial algorithm, which, under mild conditions, solves the problem. This algorithm exploits the idea of sequencing and splicing to reach a stable solution in finite steps when the sparsity level of the model is fixed but unknown. We define an information criterion that helps the algorithm select the true sparsity level with a high probability. We show that when the algorithm produces a stable optimal solution, that solution is the oracle estimator of the true parameters with probability one. We also demonstrate the power of the algorithm in several numerical studies.

Keywords:  best-subset selection; high dimensional; splicing

Mesh:

Year:  2020        PMID: 33328272      PMCID: PMC7777147          DOI: 10.1073/pnas.2014241117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  4 in total

1.  Regularization Parameter Selections via Generalized Information Criterion.

Authors:  Yiyun Zhang; Runze Li; Chih-Ling Tsai
Journal:  J Am Stat Assoc       Date:  2010-03-01       Impact factor: 5.033

2.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors:  Patrick Breheny; Jian Huang
Journal:  Ann Appl Stat       Date:  2011-01-01       Impact factor: 2.083

3.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

4.  CALIBRATING NON-CONVEX PENALIZED REGRESSION IN ULTRA-HIGH DIMENSION.

Authors:  Lan Wang; Yongdai Kim; Runze Li
Journal:  Ann Stat       Date:  2013-10-01       Impact factor: 4.028

  4 in total
  3 in total

1.  Fast Sparse Classification for Generalized Linear and Additive Models.

Authors:  Jiachang Liu; Chudi Zhong; Margo Seltzer; Cynthia Rudin
Journal:  Proc Mach Learn Res       Date:  2022-03

2.  Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects.

Authors:  Haoran Xue; Xiaotong Shen; Wei Pan
Journal:  Am J Hum Genet       Date:  2021-07-01       Impact factor: 11.043

3.  A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study.

Authors:  Maoxuan Miao; Jinran Wu; Fengjing Cai; You-Gan Wang
Journal:  Animals (Basel)       Date:  2022-01-15       Impact factor: 2.752

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.