Literature DB >> 19255599

Cross-Validated Bagged Learning.

Maya L Petersen1, Annette M Molinaro, Sandra E Sinisi, Mark J van der Laan.   

Abstract

Many applications aim to learn a high dimensional parameter of a data generating distribution based on a sample of independent and identically distributed observations. For example, the goal might be to estimate the conditional mean of an outcome given a list of input variables. In this prediction context, bootstrap aggregating (bagging) has been introduced as a method to reduce the variance of a given estimator at little cost to bias. Bagging involves applying an estimator to multiple bootstrap samples, and averaging the result across bootstrap samples. In order to address the curse of dimensionality, a common practice has been to apply bagging to estimators which themselves use cross-validation, thereby using cross-validation within a bootstrap sample to select fine-tuning parameters trading off bias and variance of the bootstrap sample-specific candidate estimators. In this article we point out that in order to achieve the correct bias variance trade-off for the parameter of interest, one should apply the cross-validation selector externally to candidate bagged estimators indexed by these fine-tuning parameters. We use three simulations to compare the new cross-validated bagging method with bagging of cross-validated estimators and bagging of non-cross-validated estimators.

Year:  2008        PMID: 19255599      PMCID: PMC2367370          DOI: 10.1016/j.jmva.2007.07.004

Source DB:  PubMed          Journal:  J Multivar Anal        ISSN: 0047-259X            Impact factor:   1.473


  4 in total

1.  Asymptotic optimality of likelihood-based cross-validation.

Authors:  Mark J van der Laan; Sandrine Dudoit; Sunduz Keles
Journal:  Stat Appl Genet Mol Biol       Date:  2004-03-22

2.  Multiple testing and data adaptive regression: an application to HIV-1 sequence data.

Authors:  Merrill D Birkner; Sandra E Sinisi; Mark J van der Laan
Journal:  Stat Appl Genet Mol Biol       Date:  2005-04-18

3.  Deletion/substitution/addition algorithm in learning with applications in genomics.

Authors:  Sandra E Sinisi; Mark J van der Laan
Journal:  Stat Appl Genet Mol Biol       Date:  2004-08-12

4.  Super learning: an application to the prediction of HIV-1 drug resistance.

Authors:  Sandra E Sinisi; Eric C Polley; Maya L Petersen; Soo-Yon Rhee; Mark J van der Laan
Journal:  Stat Appl Genet Mol Biol       Date:  2007-02-23
  4 in total
  3 in total

1.  Machine learning-based prediction of motor status in glioma patients using diffusion MRI metrics along the corticospinal tract.

Authors:  Boshra Shams; Ziqian Wang; Timo Roine; Dogu Baran Aydogan; Peter Vajkoczy; Christoph Lippert; Thomas Picht; Lucius S Fekonja
Journal:  Brain Commun       Date:  2022-05-27

2.  Interplay between components of pupil-linked phasic arousal and its role in driving behavioral choice in Go/No-Go perceptual decision-making.

Authors:  Brian J Schriver; Sean M Perkins; Paul Sajda; Qi Wang
Journal:  Psychophysiology       Date:  2020-03-30       Impact factor: 4.016

3.  SRIQ clustering: A fusion of Random Forest, QT clustering, and KNN concepts.

Authors:  Jacob Karlström; Mattias Aine; Johan Staaf; Srinivas Veerla
Journal:  Comput Struct Biotechnol J       Date:  2022-04-04       Impact factor: 6.155

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.