Literature DB >> 23954908

On robust regression with high-dimensional predictors.

Noureddine El Karoui1, Derek Bean, Peter J Bickel, Chinghway Lim, Bin Yu.   

Abstract

We study regression M-estimates in the setting where p, the number of covariates, and n, the number of observations, are both large, but p ≤ n. We find an exact stochastic representation for the distribution of β = argmin(β∈ℝ(p)) Σ(i=1)(n) ρ(Y(i) - X(i')β) at fixed p and n under various assumptions on the objective function ρ and our statistical model. A scalar random variable whose deterministic limit rρ(κ) can be studied when p/n → κ > 0 plays a central role in this representation. We discover a nonlinear system of two deterministic equations that characterizes rρ(κ). Interestingly, the system shows that rρ(κ) depends on ρ through proximal mappings of ρ as well as various aspects of the statistical model underlying our study. Several surprising results emerge. In particular, we show that, when p/n is large enough, least squares becomes preferable to least absolute deviations for double-exponential errors.

Keywords:  concentration of measure; high-dimensional statistics; prox function

Mesh:

Year:  2013        PMID: 23954908      PMCID: PMC3767561          DOI: 10.1073/pnas.1307842110

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  1 in total

1.  Optimal M-estimation in high-dimensional regression.

Authors:  Derek Bean; Peter J Bickel; Noureddine El Karoui; Bin Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-16       Impact factor: 11.205

  1 in total
  3 in total

1.  Optimal M-estimation in high-dimensional regression.

Authors:  Derek Bean; Peter J Bickel; Noureddine El Karoui; Bin Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-16       Impact factor: 11.205

2.  A modern maximum-likelihood theory for high-dimensional logistic regression.

Authors:  Pragya Sur; Emmanuel J Candès
Journal:  Proc Natl Acad Sci U S A       Date:  2019-07-01       Impact factor: 11.205

3.  Nonuniformity of P-values Can Occur Early in Diverging Dimensions.

Authors:  Yingying Fan; Emre Demirkaya; Jinchi Lv
Journal:  J Mach Learn Res       Date:  2019       Impact factor: 5.177

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.