Literature DB >> 30856279

MEBoost: Variable selection in the presence of measurement error.

Ben Brown1, Timothy Weaver2, Julian Wolfson3.   

Abstract

We present a novel method for variable selection in regression models when covariates are measured with error. The iterative algorithm we propose, Measurement Error Boosting (MEBoost), follows a path defined by estimating equations that correct for covariate measurement error. We illustrate the use of MEBoost in practice by analyzing data from the Box Lunch Study, a clinical trial in nutrition where several variables are based on self-report and, hence, measured with error, where we are interested in performing model selection from a large data set to select variables that are related to the number of times a subject binge ate in the last 28 days. Furthermore, we evaluated our method and compared its performance to the recently proposed Convex Conditioned Lasso and to the "naive" Lasso, which does not correct for measurement error through a simulation study. Increasing the degree of measurement error increased prediction error and decreased the probability of accurate covariate selection, but this loss of accuracy occurred to a lesser degree when using MEBoost. Through simulations, we also make a case for the consistency of the model selected.
© 2019 John Wiley & Sons, Ltd.

Keywords:  boosting; high-dimensional data; machine learning; measurement error; variable selection

Mesh:

Year:  2019        PMID: 30856279     DOI: 10.1002/sim.8130

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  2 in total

1.  8DEstablishment and validation of a hypoxia-related signature predicting prognosis in hepatocellular carcinoma.

Authors:  Congbo Cai; Lei Yang; Kena Zhou
Journal:  BMC Gastroenterol       Date:  2021-12-12       Impact factor: 3.067

2.  Block coordinate descent algorithm improves variable selection and estimation in error-in-variables regression.

Authors:  Célia Escribe; Tianyuan Lu; Julyan Keller-Baruch; Vincenzo Forgetta; Bowei Xiao; J Brent Richards; Sahir Bhatnagar; Karim Oualkacha; Celia M T Greenwood
Journal:  Genet Epidemiol       Date:  2021-09-01       Impact factor: 2.344

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.