Literature DB >> 23519603

Variable selection and estimation in generalized linear models with the seamless L0 penalty.

Zilin Li1, Sijian Wang, Xihong Lin.   

Abstract

In this paper, we propose variable selection and estimation in generalized linear models using the seamless L0 (SELO) penalized likelihood approach. The SELO penalty is a smooth function that very closely resembles the discontinuous L0 penalty. We develop an e cient algorithm to fit the model, and show that the SELO-GLM procedure has the oracle property in the presence of a diverging number of variables. We propose a Bayesian Information Criterion (BIC) to select the tuning parameter. We show that under some regularity conditions, the proposed SELO-GLM/BIC procedure consistently selects the true model. We perform simulation studies to evaluate the finite sample performance of the proposed methods. Our simulation studies show that the proposed SELO-GLM procedure has a better finite sample performance than several existing methods, especially when the number of variables is large and the signals are weak. We apply the SELO-GLM to analyze a breast cancer genetic dataset to identify the SNPs that are associated with breast cancer risk.

Entities:  

Keywords:  BIC; Consistency; Coordinate descent algorithm; Model selection; Oracle property; Penalized likelihood methods; SELO penalty; Tuning parameter selection

Year:  2012        PMID: 23519603      PMCID: PMC3600656          DOI: 10.1002/cjs.11165

Source DB:  PubMed          Journal:  Can J Stat        ISSN: 0319-5724            Impact factor:   0.875


  4 in total

1.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer.

Authors:  David J Hunter; Peter Kraft; Kevin B Jacobs; David G Cox; Meredith Yeager; Susan E Hankinson; Sholom Wacholder; Zhaoming Wang; Robert Welch; Amy Hutchinson; Junwen Wang; Kai Yu; Nilanjan Chatterjee; Nick Orr; Walter C Willett; Graham A Colditz; Regina G Ziegler; Christine D Berg; Saundra S Buys; Catherine A McCarty; Heather Spencer Feigelson; Eugenia E Calle; Michael J Thun; Richard B Hayes; Margaret Tucker; Daniela S Gerhard; Joseph F Fraumeni; Robert N Hoover; Gilles Thomas; Stephen J Chanock
Journal:  Nat Genet       Date:  2007-05-27       Impact factor: 38.330

2.  Tuning parameter selectors for the smoothly clipped absolute deviation method.

Authors:  Hansheng Wang; Runze Li; Chih-Ling Tsai
Journal:  Biometrika       Date:  2007-08-01       Impact factor: 2.445

3.  Regularization Parameter Selections via Generalized Information Criterion.

Authors:  Yiyun Zhang; Runze Li; Chih-Ling Tsai
Journal:  J Am Stat Assoc       Date:  2010-03-01       Impact factor: 5.033

4.  ON THE ADAPTIVE ELASTIC-NET WITH A DIVERGING NUMBER OF PARAMETERS.

Authors:  Hui Zou; Hao Helen Zhang
Journal:  Ann Stat       Date:  2009       Impact factor: 4.028

  4 in total
  5 in total

1.  Simultaneous supervised clustering and feature selection over a graph.

Authors:  Xiaotong Shen; Hsin-Cheng Huang; Wei Pan
Journal:  Biometrika       Date:  2012-10-18       Impact factor: 2.445

2.  Efficient ℓ0 -norm feature selection based on augmented and penalized minimization.

Authors:  Xiang Li; Shanghong Xie; Donglin Zeng; Yuanjia Wang
Journal:  Stat Med       Date:  2017-10-30       Impact factor: 2.373

3.  Modeling Pregnancy Outcomes through Sequentially Nested Regression Models.

Authors:  Xuan Bi; Long Feng; Cai Li; Heping Zhang
Journal:  J Am Stat Assoc       Date:  2022-01-05       Impact factor: 4.369

4.  Constrained Ordination Analysis with Enrichment of Bell-Shaped Response Functions.

Authors:  Yingjie Zhang; Olivier Thas
Journal:  PLoS One       Date:  2016-04-21       Impact factor: 3.240

5.  Model-free feature screening for categorical outcomes: Nonlinear effect detection and false discovery rate control.

Authors:  Qingyang Zhang; Yuchun Du
Journal:  PLoS One       Date:  2019-05-31       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.