| Literature DB >> 35706573 |
Camillo Cammarota1, Alessandro Pinto2.
Abstract
In prediction problems both response and covariates may have high correlation with a second group of influential regressors, that can be considered as background variables. An important challenge is to perform variable selection and importance assessment among the covariates in the presence of these variables. A clinical example is the prediction of the lean body mass (response) from bioimpedance (covariates), where anthropometric measures play the role of background variables. We introduce a reduced dataset in which the variables are defined as the residuals with respect to the background, and perform variable selection and importance assessment both in linear and random forest models. Using a clinical dataset of multi-frequency bioimpedance, we show the effectiveness of this method to select the most relevant predictors of the lean body mass beyond anthropometry.Entities:
Keywords: Variable selection; anthropometric variables; bioimpedance; importance; lean body mass; linear model; multi-frequency; random forests
Year: 2020 PMID: 35706573 PMCID: PMC9042145 DOI: 10.1080/02664763.2020.1763930
Source DB: PubMed Journal: J Appl Stat ISSN: 0266-4763 Impact factor: 1.416