Literature DB >> 26980459

Sequential BART for imputation of missing covariates.

Dandan Xu1, Michael J Daniels2, Almut G Winterstein3.   

Abstract

To conduct comparative effectiveness research using electronic health records (EHR), many covariates are typically needed to adjust for selection and confounding biases. Unfortunately, it is typical to have missingness in these covariates. Just using cases with complete covariates will result in considerable efficiency losses and likely bias. Here, we consider the covariates missing at random with missing data mechanism either depending on the response or not. Standard methods for multiple imputation can either fail to capture nonlinear relationships or suffer from the incompatibility and uncongeniality issues. We explore a flexible Bayesian nonparametric approach to impute the missing covariates, which involves factoring the joint distribution of the covariates with missingness into a set of sequential conditionals and applying Bayesian additive regression trees to model each of these univariate conditionals. Using data augmentation, the posterior for each conditional can be sampled simultaneously. We provide details on the computational algorithm and make comparisons to other methods, including parametric sequential imputation and two versions of multiple imputation by chained equations. We illustrate the proposed approach on EHR data from an affiliated tertiary care institution to examine factors related to hyperglycemia.
© The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian additive regression trees; Congenial models; Multiple imputation

Mesh:

Year:  2016        PMID: 26980459      PMCID: PMC4915613          DOI: 10.1093/biostatistics/kxw009

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  4 in total

1.  Multiple imputation for missing data via sequential regression trees.

Authors:  Lane F Burgette; Jerome P Reiter
Journal:  Am J Epidemiol       Date:  2010-09-14       Impact factor: 4.897

2.  Using the outcome for imputation of missing predictor values was preferred.

Authors:  Karel G M Moons; Rogier A R T Donders; Theo Stijnen; Frank E Harrell
Journal:  J Clin Epidemiol       Date:  2006-06-19       Impact factor: 6.437

3.  Semiparametric models for missing covariate and response data in regression models.

Authors:  Qingxia Chen; Joseph G Ibrahim
Journal:  Biometrics       Date:  2006-03       Impact factor: 2.571

4.  Fully Bayesian inference under ignorable missingness in the presence of auxiliary covariates.

Authors:  M J Daniels; C Wang; B H Marcus
Journal:  Biometrics       Date:  2013-12-10       Impact factor: 2.571

  4 in total
  11 in total

1.  Bayesian additive regression trees and the General BART model.

Authors:  Yaoyuan Vincent Tan; Jason Roy
Journal:  Stat Med       Date:  2019-08-28       Impact factor: 2.373

2.  Discussion of PENCOMP.

Authors:  Joseph Antonelli; Michael J Daniels
Journal:  J Am Stat Assoc       Date:  2019-04-19       Impact factor: 5.033

3.  Nonparametric competing risks analysis using Bayesian Additive Regression Trees.

Authors:  Rodney Sparapani; Brent R Logan; Robert E McCulloch; Purushottam W Laud
Journal:  Stat Methods Med Res       Date:  2019-01-07       Impact factor: 3.021

4.  Elucidating age and sex-dependent association between frontal EEG asymmetry and depression: An application of multiple imputation in functional regression.

Authors:  Adam Ciarleglio; Eva Petkova; Ofer Harel
Journal:  J Am Stat Assoc       Date:  2021-07-26       Impact factor: 5.033

5.  ROBUST INFERENCE WHEN COMBINING INVERSE-PROBABILITY WEIGHTING AND MULTIPLE IMPUTATION TO ADDRESS MISSING DATA WITH APPLICATION TO AN ELECTRONIC HEALTH RECORDS-BASED STUDY OF BARIATRIC SURGERY.

Authors:  Tanayott Thaweethai; David E Arterburn; Karen J Coleman; Sebastien Haneuse
Journal:  Ann Appl Stat       Date:  2021-03       Impact factor: 1.959

6.  Bayesian Approaches for Missing Not at Random Outcome Data: The Role of Identifying Restrictions.

Authors:  Antonio R Linero; Michael J Daniels
Journal:  Stat Sci       Date:  2018-05-03       Impact factor: 2.901

7.  Synthetic data method to incorporate external information into a current study.

Authors:  Tian Gu; Jeremy M G Taylor; Wenting Cheng; Bhramar Mukherjee
Journal:  Can J Stat       Date:  2019-06-26       Impact factor: 0.875

8.  Bayesian semi-parametric G-computation for causal inference in a cohort study with MNAR dropout and death.

Authors:  Maria Josefsson; Michael J Daniels
Journal:  J R Stat Soc Ser C Appl Stat       Date:  2021-01-06       Impact factor: 1.864

9.  Non-parametric recurrent events analysis with BART and an application to the hospital admissions of patients with diabetes.

Authors:  Rodney A Sparapani; Lisa E Rein; Sergey S Tarima; Tourette A Jackson; John R Meurer
Journal:  Biostatistics       Date:  2020-01-01       Impact factor: 5.279

10.  On the Relation between Prediction and Imputation Accuracy under Missing Covariates.

Authors:  Burim Ramosaj; Justus Tulowietzki; Markus Pauly
Journal:  Entropy (Basel)       Date:  2022-03-09       Impact factor: 2.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.