Literature DB >> 27227727

Second-Order Inference for the Mean of a Variable Missing at Random.

Iván Díaz, Marco Carone, Mark J van der Laan.   

Abstract

We present a second-order estimator of the mean of a variable subject to missingness, under the missing at random assumption. The estimator improves upon existing methods by using an approximate second-order expansion of the parameter functional, in addition to the first-order expansion employed by standard doubly robust methods. This results in weaker assumptions about the convergence rates necessary to establish consistency, local efficiency, and asymptotic linearity. The general estimation strategy is developed under the targeted minimum loss-based estimation (TMLE) framework. We present a simulation comparing the sensitivity of the first and second-order estimators to the convergence rate of the initial estimators of the outcome regression and missingness score. In our simulation, the second-order TMLE always had a coverage probability equal or closer to the nominal value 0.95, compared to its first-order counterpart. In the best-case scenario, the proposed second-order TMLE had a coverage probability of 0.86 when the first-order TMLE had a coverage probability of zero. We also present a novel first-order estimator inspired by a second-order expansion of the parameter functional. This estimator only requires one-dimensional smoothing, whereas implementation of the second-order TMLE generally requires kernel smoothing on the covariate space. The first-order estimator proposed is expected to have improved finite sample performance compared to existing first-order estimators. In the best-case scenario of our simulation study, the novel first-order TMLE improved the coverage probability from 0 to 0.90. We provide an illustration of our methods using a publicly available dataset to determine the effect of an anticoagulant on health outcomes of patients undergoing percutaneous coronary intervention. We provide R code implementing the proposed estimator.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27227727     DOI: 10.1515/ijb-2015-0031

Source DB:  PubMed          Journal:  Int J Biostat        ISSN: 1557-4679            Impact factor:   0.968


  3 in total

1.  One-Step Targeted Minimum Loss-based Estimation Based on Universal Least Favorable One-Dimensional Submodels.

Authors:  Mark van der Laan; Susan Gruber
Journal:  Int J Biostat       Date:  2016-05-01       Impact factor: 0.968

Review 2.  Stacked generalization: an introduction to super learning.

Authors:  Ashley I Naimi; Laura B Balzer
Journal:  Eur J Epidemiol       Date:  2018-04-10       Impact factor: 8.082

3.  Machine learning as a strategy to account for dietary synergy: an illustration based on dietary intake and adverse pregnancy outcomes.

Authors:  Lisa M Bodnar; Abigail R Cartus; Sharon I Kirkpatrick; Katherine P Himes; Edward H Kennedy; Hyagriv N Simhan; William A Grobman; Jennifer Y Duffy; Robert M Silver; Samuel Parry; Ashley I Naimi
Journal:  Am J Clin Nutr       Date:  2020-06-01       Impact factor: 8.472

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.