| Literature DB >> 31797368 |
David Cheng1, Abhishek Chakrabortty2, Ashwin N Ananthakrishnan3, Tianxi Cai4.
Abstract
We consider estimating average treatment effects (ATE) of a binary treatment in observational data when data-driven variable selection is needed to select relevant covariates from a moderately large number of available covariates X . To leverage covariates among X predictive of the outcome for efficiency gain while using regularization to fit a parametric propensity score (PS) model, we consider a dimension reduction of X based on fitting both working PS and outcome models using adaptive LASSO. A novel PS estimator, the Double-index Propensity Score (DiPS), is proposed, in which the treatment status is smoothed over the linear predictors for X from both the initial working models. The ATE is estimated by using the DiPS in a normalized inverse probability weighting estimator, which is found to maintain double robustness and also local semiparametric efficiency with a fixed number of covariates p. Under misspecification of working models, the smoothing step leads to gains in efficiency and robustness over traditional doubly robust estimators. These results are extended to the case where p diverges with sample size and working models are sparse. Simulations show the benefits of the approach in finite samples. We illustrate the method by estimating the ATE of statins on colorectal cancer risk in an electronic medical record study and the effect of smoking on C-reactive protein in the Framingham Offspring Study.Entities:
Keywords: causal inference; double-robustness; electronic medical records; kernel smoothing; regularization; semiparametric efficiency
Mesh:
Year: 2019 PMID: 31797368 PMCID: PMC7370895 DOI: 10.1111/biom.13195
Source DB: PubMed Journal: Biometrics ISSN: 0006-341X Impact factor: 2.571