Literature DB >> 33286314

Prediction and Variable Selection in High-Dimensional Misspecified Binary Classification.

Konrad Furmańczyk1, Wojciech Rejchel2.   

Abstract

In this paper, we consider prediction and variable selection in the misspecified binary classification models under the high-dimensional scenario. We focus on two approaches to classification, which are computationally efficient, but lead to model misspecification. The first one is to apply penalized logistic regression to the classification data, which possibly do not follow the logistic model. The second method is even more radical: we just treat class labels of objects as they were numbers and apply penalized linear regression. In this paper, we investigate thoroughly these two approaches and provide conditions, which guarantee that they are successful in prediction and variable selection. Our results hold even if the number of predictors is much larger than the sample size. The paper is completed by the experimental results.

Entities:  

Keywords:  misclassification risk; model misspecification; penalized estimation; supervised classification; variable selection consistency

Year:  2020        PMID: 33286314      PMCID: PMC7517038          DOI: 10.3390/e22050543

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.524


  5 in total

1.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

2.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

3.  Regularized Quantile Regression and Robust Feature Screening for Single Index Models.

Authors:  Wei Zhong; Liping Zhu; Runze Li; Hengjian Cui
Journal:  Stat Sin       Date:  2016-01       Impact factor: 1.261

4.  Estimation and Selection via Absolute Penalized Convex Minimization And Its Multistage Adaptive Applications.

Authors:  Jian Huang; Cun-Hui Zhang
Journal:  J Mach Learn Res       Date:  2012-06-01       Impact factor: 3.654

5.  ORACLE INEQUALITIES FOR THE LASSO IN THE COX MODEL.

Authors:  Jian Huang; Tingni Sun; Zhiliang Ying; Yi Yu; Cun-Hui Zhang
Journal:  Ann Stat       Date:  2013-06-01       Impact factor: 4.028

  5 in total
  1 in total

1.  Nonparametric Statistical Inference with an Emphasis on Information-Theoretic Methods.

Authors:  Jan Mielniczuk
Journal:  Entropy (Basel)       Date:  2022-04-15       Impact factor: 2.524

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.