Literature DB >> 29023235

A Generally Efficient Targeted Minimum Loss Based Estimator based on the Highly Adaptive Lasso.

Mark van der Laan1.   

Abstract

Suppose we observe n $n$ independent and identically distributed observations of a finite dimensional bounded random variable. This article is concerned with the construction of an efficient targeted minimum loss-based estimator (TMLE) of a pathwise differentiable target parameter of the data distribution based on a realistic statistical model. The only smoothness condition we will enforce on the statistical model is that the nuisance parameters of the data distribution that are needed to evaluate the canonical gradient of the pathwise derivative of the target parameter are multivariate real valued cadlag functions (right-continuous and left-hand limits, (G. Neuhaus. On weak convergence of stochastic processes with multidimensional time parameter. Ann Stat 1971;42:1285-1295.) and have a finite supremum and (sectional) variation norm. Each nuisance parameter is defined as a minimizer of the expectation of a loss function over over all functions it its parameter space. For each nuisance parameter, we propose a new minimum loss based estimator that minimizes the loss-specific empirical risk over the functions in its parameter space under the additional constraint that the variation norm of the function is bounded by a set constant. The constant is selected with cross-validation. We show such an MLE can be represented as the minimizer of the empirical risk over linear combinations of indicator basis functions under the constraint that the sum of the absolute value of the coefficients is bounded by the constant: i.e., the variation norm corresponds with this L1 $L_1$-norm of the vector of coefficients. We will refer to this estimator as the highly adaptive Lasso (HAL)-estimator. We prove that for all models the HAL-estimator converges to the true nuisance parameter value at a rate that is faster than n-1/4 $n^{-1/4}$ w.r.t. square-root of the loss-based dissimilarity. We also show that if this HAL-estimator is included in the library of an ensemble super-learner, then the super-learner will at minimal achieve the rate of convergence of the HAL, but, by previous results, it will actually be asymptotically equivalent with the oracle (i.e., in some sense best) estimator in the library. Subsequently, we establish that a one-step TMLE using such a super-learner as initial estimator for each of the nuisance parameters is asymptotically efficient at any data generating distribution in the model, under weak structural conditions on the target parameter mapping and model and a strong positivity assumption (e.g., the canonical gradient is uniformly bounded). We demonstrate our general theorem by constructing such a one-step TMLE of the average causal effect in a nonparametric model, and establishing that it is asymptotically efficient.

Entities:  

Keywords:  Donsker class; asymptotic linear estimator; canonical gradient; cross-validated targeted minimum loss estimation (CV-TMLE); efficient estimator; efficient influence curve; empirical process; entropy; highly adaptive Lasso; influence curve; one-step TMLE; super-learning; targeted minimum loss estimation (TMLE)

Mesh:

Year:  2017        PMID: 29023235      PMCID: PMC6054860          DOI: 10.1515/ijb-2015-0097

Source DB:  PubMed          Journal:  Int J Biostat        ISSN: 1557-4679            Impact factor:   0.968


  13 in total

1.  A local maximal inequality under uniform entropy.

Authors:  Aad van der Vaart; Jon A Wellner
Journal:  Electron J Stat       Date:  2011       Impact factor: 1.125

2.  Asymptotic optimality of likelihood-based cross-validation.

Authors:  Mark J van der Laan; Sandrine Dudoit; Sunduz Keles
Journal:  Stat Appl Genet Mol Biol       Date:  2004-03-22

3.  Doubly robust estimation in missing data and causal inference models.

Authors:  Heejung Bang; James M Robins
Journal:  Biometrics       Date:  2005-12       Impact factor: 2.571

4.  Super learner.

Authors:  Mark J van der Laan; Eric C Polley; Alan E Hubbard
Journal:  Stat Appl Genet Mol Biol       Date:  2007-09-16

5.  Causal effect models for realistic individualized treatment and intention to treat rules.

Authors:  Mark J van der Laan; Maya L Petersen
Journal:  Int J Biostat       Date:  2007       Impact factor: 0.968

6.  Sensitivity analysis for causal inference under unmeasured confounding and measurement error problems.

Authors:  Iván Díaz; Mark J van der Laan
Journal:  Int J Biostat       Date:  2013-11-19       Impact factor: 0.968

7.  An application of collaborative targeted maximum likelihood estimation in causal inference and genomics.

Authors:  Susan Gruber; Mark J van der Laan
Journal:  Int J Biostat       Date:  2010-05-17       Impact factor: 0.968

8.  The Highly Adaptive Lasso Estimator.

Authors:  David Benkeser; Mark van der Laan
Journal:  Proc Int Conf Data Sci Adv Anal       Date:  2016-12-26

9.  ADAPTIVE MATCHING IN RANDOMIZED TRIALS AND OBSERVATIONAL STUDIES.

Authors:  Mark J van der Laan; Laura B Balzer; Maya L Petersen
Journal:  J Stat Res       Date:  2012-12-01

10.  One-Step Targeted Minimum Loss-based Estimation Based on Universal Least Favorable One-Dimensional Submodels.

Authors:  Mark van der Laan; Susan Gruber
Journal:  Int J Biostat       Date:  2016-05-01       Impact factor: 0.968

View more
  6 in total

1.  Improved small-sample estimation of nonlinear cross-validated prediction metrics.

Authors:  David Benkeser; Maya Petersen; Mark J van der Laan
Journal:  J Am Stat Assoc       Date:  2019-10-21       Impact factor: 5.033

2.  Methodological considerations when analysing and interpreting real-world data.

Authors:  Til Stürmer; Tiansheng Wang; Yvonne M Golightly; Alex Keil; Jennifer L Lund; Michele Jonsson Funk
Journal:  Rheumatology (Oxford)       Date:  2020-01-01       Impact factor: 7.580

3.  An alternative robust estimator of average treatment effect in causal inference.

Authors:  Jianxuan Liu; Yanyuan Ma; Lan Wang
Journal:  Biometrics       Date:  2018-02-13       Impact factor: 2.571

4.  The Highly Adaptive Lasso Estimator.

Authors:  David Benkeser; Mark van der Laan
Journal:  Proc Int Conf Data Sci Adv Anal       Date:  2016-12-26

5.  Efficient nonparametric inference on the effects of stochastic interventions under two-phase sampling, with applications to vaccine efficacy trials.

Authors:  Nima S Hejazi; Mark J van der Laan; Holly E Janes; Peter B Gilbert; David C Benkeser
Journal:  Biometrics       Date:  2020-09-28       Impact factor: 2.571

6.  Efficiently transporting causal direct and indirect effects to new populations under intermediate confounding and with multiple mediators.

Authors:  Kara E Rudolph; Iván Díaz
Journal:  Biostatistics       Date:  2022-07-18       Impact factor: 5.279

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.