Literature DB >> 16646820

Asymptotic optimality of likelihood-based cross-validation.

Mark J van der Laan1, Sandrine Dudoit, Sunduz Keles.   

Abstract

Likelihood-based cross-validation is a statistical tool for selecting a density estimate based on n i.i.d. observations from the true density among a collection of candidate density estimators. General examples are the selection of a model indexing a maximum likelihood estimator, and the selection of a bandwidth indexing a nonparametric (e.g. kernel) density estimator. In this article, we establish a finite sample result for a general class of likelihood-based cross-validation procedures (as indexed by the type of sample splitting used, e.g. V-fold cross-validation). This result implies that the cross-validation selector performs asymptotically as well (w.r.t. to the Kullback-Leibler distance to the true density) as a benchmark model selector which is optimal for each given dataset and depends on the true density. Crucial conditions of our theorem are that the size of the validation sample converges to infinity, which excludes leave-one-out cross-validation, and that the candidate density estimates are bounded away from zero and infinity. We illustrate these asymptotic results and the practical performance of likelihood-based cross-validation for the purpose of bandwidth selection with a simulation study. Moreover, we use likelihood-based cross-validation in the context of regulatory motif detection in DNA sequences.

Year:  2004        PMID: 16646820     DOI: 10.2202/1544-6115.1036

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  22 in total

1.  Simple, efficient estimators of treatment effects in randomized trials using generalized linear models to leverage baseline variables.

Authors:  Michael Rosenblum; Mark J van der Laan
Journal:  Int J Biostat       Date:  2010-04-01       Impact factor: 0.968

2.  Collaborative double robust targeted maximum likelihood estimation.

Authors:  Mark J van der Laan; Susan Gruber
Journal:  Int J Biostat       Date:  2010-05-17       Impact factor: 0.968

3.  Robust extraction of covariate information to improve estimation efficiency in randomized trials.

Authors:  Kelly L Moore; Romain Neugebauer; Thamban Valappil; Mark J Laan
Journal:  Stat Med       Date:  2011-07-12       Impact factor: 2.373

4.  Cross-Validated Bagged Learning.

Authors:  Maya L Petersen; Annette M Molinaro; Sandra E Sinisi; Mark J van der Laan
Journal:  J Multivar Anal       Date:  2008-03       Impact factor: 1.473

5.  Semiparametric Estimation of the Impacts of Longitudinal Interventions on Adolescent Obesity using Targeted Maximum-Likelihood: Accessible Estimation with the ltmle Package.

Authors:  Anna L Decker; Alan Hubbard; Catherine M Crespi; Edmund Y W Seto; May C Wang
Journal:  J Causal Inference       Date:  2014-03

6.  Targeted maximum likelihood estimation of effect modification parameters in survival analysis.

Authors:  Ori M Stitelman; C William Wester; Victor De Gruttola; Mark J van der Laan
Journal:  Int J Biostat       Date:  2011-03-30       Impact factor: 0.968

7.  Generalized Score Functions for Causal Discovery.

Authors:  Biwei Huang; Kun Zhang; Yizhu Lin; Bernhard Schölkopf; Clark Glymour
Journal:  KDD       Date:  2018-08

8.  Estimating and Testing Vaccine Sieve Effects Using Machine Learning.

Authors:  David Benkeser; Peter B Gilbert; Marco Carone
Journal:  J Am Stat Assoc       Date:  2019-04-03       Impact factor: 5.033

9.  Super Learner Analysis of Electronic Adherence Data Improves Viral Prediction and May Provide Strategies for Selective HIV RNA Monitoring.

Authors:  Maya L Petersen; Erin LeDell; Joshua Schwab; Varada Sarovar; Robert Gross; Nancy Reynolds; Jessica E Haberer; Kathy Goggin; Carol Golin; Julia Arnsten; Marc I Rosen; Robert H Remien; David Etoori; Ira B Wilson; Jane M Simoni; Judith A Erlen; Mark J van der Laan; Honghu Liu; David R Bangsberg
Journal:  J Acquir Immune Defic Syndr       Date:  2015-05-01       Impact factor: 3.731

10.  Targeted maximum likelihood estimation for prediction calibration.

Authors:  Jordan Brooks; Mark J van der Laan; Alan S Go
Journal:  Int J Biostat       Date:  2012-10-31       Impact factor: 0.968

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.