Literature DB >> 28479862

Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions.

Jianqing Fan1, Quefeng Li1, Yuyan Wang1.   

Abstract

Data subject to heavy-tailed errors are commonly encountered in various scientific fields. To address this problem, procedures based on quantile regression and Least Absolute Deviation (LAD) regression have been developed in recent years. These methods essentially estimate the conditional median (or quantile) function. They can be very different from the conditional mean functions, especially when distributions are asymmetric and heteroscedastic. How can we efficiently estimate the mean regression functions in ultra-high dimensional setting with existence of only the second moment? To solve this problem, we propose a penalized Huber loss with diverging parameter to reduce biases created by the traditional Huber loss. Such a penalized robust approximate quadratic (RA-quadratic) loss will be called RA-Lasso. In the ultra-high dimensional setting, where the dimensionality can grow exponentially with the sample size, our results reveal that the RA-lasso estimator produces a consistent estimator at the same rate as the optimal rate under the light-tail situation. We further study the computational convergence of RA-Lasso and show that the composite gradient descent algorithm indeed produces a solution that admits the same optimal rate after sufficient iterations. As a byproduct, we also establish the concentration inequality for estimating population mean when there exists only the second moment. We compare RA-Lasso with other regularized robust estimators based on quantile regression and LAD regression. Extensive simulation studies demonstrate the satisfactory finite-sample performance of RA-Lasso.

Entities:  

Keywords:  High dimension; Huber loss; M-estimator; Optimal rate; Robust regularization

Year:  2016        PMID: 28479862      PMCID: PMC5412601          DOI: 10.1111/rssb.12166

Source DB:  PubMed          Journal:  J R Stat Soc Series B Stat Methodol        ISSN: 1369-7412            Impact factor:   4.488


  7 in total

1.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

2.  Variance estimation using refitted cross-validation in ultrahigh dimensional regression.

Authors:  Jianqing Fan; Shaojun Guo; Ning Hao
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-01-01       Impact factor: 4.488

3.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

4.  ADAPTIVE ROBUST VARIABLE SELECTION.

Authors:  Jianqing Fan; Yingying Fan; Emre Barut
Journal:  Ann Stat       Date:  2014-02-01       Impact factor: 4.028

5.  Correlated z-values and the accuracy of large-scale statistical estimates.

Authors:  Bradley Efron
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

6.  Large Covariance Estimation by Thresholding Principal Orthogonal Complements.

Authors:  Jianqing Fan; Yuan Liao; Martina Mincheva
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2013-09-01       Impact factor: 4.488

7.  Activated TLR signaling in atherosclerosis among women with lower Framingham risk score: the multi-ethnic study of atherosclerosis.

Authors:  Chiang-Ching Huang; Kiang Liu; Richard M Pope; Pan Du; Simon Lin; Nalini M Rajamannan; Qi-Quan Huang; Nadereh Jafari; Gregory L Burke; Wendy Post; Karol E Watson; Craig Johnson; Martha L Daviglus; Donald M Lloyd-Jones
Journal:  PLoS One       Date:  2011-06-16       Impact factor: 3.240

  7 in total
  8 in total

1.  Integrative linear discriminant analysis with guaranteed error rate improvement.

Authors:  Quefeng Li; Lexin Li
Journal:  Biometrika       Date:  2018-10-22       Impact factor: 2.445

2.  LARGE COVARIANCE ESTIMATION THROUGH ELLIPTICAL FACTOR MODELS.

Authors:  Jianqing Fan; Han Liu; Weichen Wang
Journal:  Ann Stat       Date:  2018-06-27       Impact factor: 4.028

3.  Robust Covariance Estimation for Approximate Factor Models.

Authors:  Jianqing Fan; Weichen Wang; Yiqiao Zhong
Journal:  J Econom       Date:  2018-10-06       Impact factor: 2.388

4.  Robust high dimensional factor models with applications to statistical machine learning.

Authors:  Jianqing Fan; Kaizheng Wang; Yiqiao Zhong; Ziwei Zhu
Journal:  Stat Sci       Date:  2021-04-19       Impact factor: 2.901

5.  Adaptive Huber Regression on Markov-dependent Data.

Authors:  Jianqing Fan; Yongyi Guo; Bai Jiang
Journal:  Stoch Process Their Appl       Date:  2019-09-25       Impact factor: 1.430

6.  Robust estimation of high-dimensional covariance and precision matrices.

Authors:  Marco Avella-Medina; Heather S Battey; Jianqing Fan; Quefeng Li
Journal:  Biometrika       Date:  2018-03-27       Impact factor: 2.445

7.  A NEW PERSPECTIVE ON ROBUST M-ESTIMATION: FINITE SAMPLE THEORY AND APPLICATIONS TO DEPENDENCE-ADJUSTED MULTIPLE TESTING.

Authors:  Wen-Xin Zhou; Koushiki Bose; Jianqing Fan; Han Liu
Journal:  Ann Stat       Date:  2018-08-17       Impact factor: 4.028

8.  An l Eigenvector Perturbation Bound and Its Application to Robust Covariance Estimation.

Authors:  Jianqing Fan; Weichen Wang; Yiqiao Zhong
Journal:  J Mach Learn Res       Date:  2018-04       Impact factor: 3.654

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.