| Literature DB >> 32826524 |
Linda Nab1, Rolf H H Groenwold1,2, Maarten van Smeden1, Ruth H Keogh3.
Abstract
Observational data are increasingly used with the aim of estimating causal effects of treatments, through careful control for confounding. Marginal structural models estimated using inverse probability weighting (MSMs-IPW), like other methods to control for confounding, assume that confounding variables are measured without error. The average treatment effect in an MSM-IPW may however be biased when a confounding variable is error prone. Using the potential outcome framework, we derive expressions for the bias due to confounder misclassification in analyses that aim to estimate the average treatment effect using an marginal structural model estimated using inverse probability weighting (MSM-IPW). We compare this bias with the bias due to confounder misclassification in analyses based on a conditional regression model. Focus is on a point-treatment study with a continuous outcome. Compared with bias in the average treatment effect in a conditional model, the bias in an MSM-IPW can be different in magnitude but is equal in sign. Also, we use a simulation study to investigate the finite sample performance of MSM-IPW and conditional models when a confounding variable is misclassified. Simulation results indicate that confidence intervals of the treatment effect obtained from MSM-IPW are generally wider, and coverage of the true treatment effect is higher compared with a conditional model, ranging from overcoverage if there is no confounder misclassification to undercoverage when there is confounder misclassification. Further, we illustrate in a study of blood pressure-lowering therapy, how the bias expressions can be used to inform a quantitative bias analysis to study the impact of confounder misclassification, supported by an online tool.Entities:
Mesh:
Year: 2020 PMID: 32826524 PMCID: PMC7523582 DOI: 10.1097/EDE.0000000000001239
Source DB: PubMed Journal: Epidemiology ISSN: 1044-3983 Impact factor: 4.860
FIGURE 1.Measurement error in variable that confounds the association between treatment and outcome in two settings illustrated in directed acyclic graphs. A, Setting 1: Treatment is initiated based on the error-prone confounding variable . B, Setting 2: Treatment is initiated based on confounding variable .
FIGURE 2.Visualization of the direction and magnitude of the bias in the average treatment effect estimator in relation to the prevalence of treatment among individuals with the confounding variable present. In this visualization, the confounding variable is misclassified with a sensitivity of 0.9 and specificity of 0.95. Consequently, the average treatment effect estimated in an MSM-IPW or conditional regression model is biased, independent of true average treatment effect. The prevalence of is 50% (i.e., ). The direction and magnitude of the bias depend on (1) the strength and direction of the association between and treatment (denoted by and , here set at in the left-hand-side plot and in the right-hand-side plot); and (2) the strength and direction of the association between and the outcome (denoted by in the text and here set at ). Larger values of will result in steeper curves; will mirror the graph in .
FIGURE 4.Visualization of the magnitude of the bias in the average treatment effect estimator in relation to specificity and sensitivity of a misclassified confounding variable. In this visualization, the prevalence of the confounding variable is 50% (i.e., ), the association between and treatment (denoted by and ) and outcome is positive (denoted by in the text and here set at ). Given these values, if is misclassified, the average treatment effect estimated in an MSM-IPW or conditional regression model is biased, independent of true average treatment effect. The magnitude of the bias depends on the specificity and sensitivity of and is maximal if sensitivity equals 1 − specificity. The strength of the association between and treatment is greater in the right-hand-side plot () compared with the left-hand-side plot (), and consequently, bias is greater. Larger values of will result in steeper curves.
FIGURE 3.Visualization of the magnitude of the bias in the average treatment effect estimator in relation to the prevalence of a confounding variable. In this visualization, the confounding variable is misclassified with a sensitivity of 0.9 and specificity of 0.95. Consequently, the average treatment effect estimated in an MSM-IPW or conditional regression model is biased, independent of true average treatment effect. The confounding variable is positively associated with treatment (i.e., here , where and ), and outcome (denoted by in the text and here set at ). The magnitude of the bias depends on the prevalence of the confounding variable (i.e., ). Larger values of will result in steeper curves.
Values of the Parameters in the Five Different Simulation Scenarios
| Scenario | |||||||
|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | 0.50 | 0.50 | 0.75 | 1 | 2 |
| 1 | 0.05 | 0.90 | 0.50 | 0.90 | 0.45 | 1 | 2 |
| 2 | 0.05 | 0.90 | 0.80 | 0.25 | 0.75 | 1 | 2 |
| 3 | 0.05 | 0.90 | 0.80 | 0.50 | 0.75 | 1 | 2 |
| 4 | 0.05 | 0.90 | 0.45 | 0.50 | 0.75 | 1 | 2 |
Results of Simulation Study Studying the Finite Sample Properties of a marginal structural model estimated using inverse probability weighting (MSM-IPW) and a CM If There Is Classification Error in the Confounding Variable
| Method | Sample Size | Scenarioa | Bias (Formula)b | Bias | MSEc | Coverage | empSEd | modelSEe |
|---|---|---|---|---|---|---|---|---|
| MSM-IPW | 1,000 | 0 | 0.00 | 0.00 (0.001) | 0.00 (0.000) | 0.99 (0.001) | 0.07 (0.001) | 0.10 (0.000) |
| 1 | −0.42 | −0.42 (0.001) | 0.18 (0.001) | 0.03 (0.002) | 0.10 (0.001) | 0.11 (0.000) | ||
| 2 | 0.14 | 0.14 (0.001) | 0.03 (0.000) | 0.67 (0.007) | 0.08 (0.001) | 0.09 (0.000) | ||
| 3 | 0.29 | 0.29 (0.001) | 0.09 (0.001) | 0.08 (0.004) | 0.08 (0.001) | 0.09 (0.000) | ||
| 4 | 0.15 | 0.15 (0.001) | 0.03 (0.000) | 0.68 (0.007) | 0.08 (0.001) | 0.10 (0.000) | ||
| 100 | 0 | 0.00 | 0.00 (0.003) | 0.05 (0.001) | 0.99 (0.001) | 0.22 (0.002) | 0.31 (0.000) | |
| 1 | −0.42 | −0.42 (0.005) | 0.29 (0.005) | 0.78 (0.006) | 0.34 (0.003) | 0.37 (0.001) | ||
| 2 | 0.14 | 0.14 (0.004) | 0.08 (0.002) | 0.94 (0.003) | 0.25 (0.003) | 0.29 (0.000) | ||
| 3 | 0.29 | 0.29 (0.004) | 0.15 (0.002) | 0.84 (0.005) | 0.26 (0.003) | 0.28 (0.000) | ||
| 4 | 0.15 | 0.15 (0.004) | 0.08 (0.002) | 0.95 (0.003) | 0.25 (0.002) | 0.31 (0.000) | ||
| CM | 1,000 | 0 | 0.00 | 0.00 (0.001) | 0.00 (0.000) | 0.95 (0.003) | 0.07 (0.001) | 0.07 (0.000) |
| 1 | −0.34 | −0.34 (0.001) | 0.12 (0.001) | 0.02 (0.002) | 0.09 (0.001) | 0.08 (0.000) | ||
| 2 | 0.16 | 0.16 (0.001) | 0.03 (0.000) | 0.46 (0.007) | 0.08 (0.001) | 0.08 (0.000) | ||
| 3 | 0.32 | 0.32 (0.001) | 0.11 (0.001) | 0.02 (0.002) | 0.08 (0.001) | 0.08 (0.000) | ||
| 4 | 0.15 | 0.15 (0.001) | 0.03 (0.000) | 0.49 (0.007) | 0.08 (0.001) | 0.07 (0.000) | ||
| 100 | 0 | 0.00 | 0.00 (0.003) | 0.05 (0.001) | 0.95 (0.003) | 0.22 (0.002) | 0.22 (0.000) | |
| 1 | −0.34 | −0.33 (0.004) | 0.19 (0.003) | 0.73 (0.006) | 0.29 (0.003) | 0.27 (0.000) | ||
| 2 | 0.16 | 0.16 (0.004) | 0.09 (0.002) | 0.90 (0.004) | 0.25 (0.003) | 0.25 (0.000) | ||
| 3 | 0.32 | 0.32 (0.004) | 0.17 (0.003) | 0.74 (0.006) | 0.26 (0.003) | 0.25 (0.000) | ||
| 4 | 0.15 | 0.15 (0.003) | 0.08 (0.002) | 0.90 (0.004) | 0.24 (0.002) | 0.24 (0.000) |
In all scenarios, the average treatment effect (estimand) is 1 () and the effect of the confounding variable on the outcome is 2 (). Five thousand data sets were generated. Monte Carlo standard errors are shown between brackets. In scenario 0, there is no classification error (specificity and sensitivity of the misclassified confounding variable are 1, i.e., and ). In scenarios 1–4, the specificity of the misclassified confounding variable is 0.95 (i.e., ) and the sensitivity is 0.9 (i.e., ). The prevalence of the confounding variable () and the probability of receiving treatment if the confounding is not present or present ( and , respectively) are set as follows in the scenarios: scenario 0: , , ; scenario 1: , , ; scenario 2: , , ; scenario 3: , , ; and scenario 4: , , .
Bias based on bias expressions (equations 3 and 4) in the text.
Mean squared error.
Empirical standard error.
Model-based standard error.
Average Treatment Effect of Diuretics Use Compared with Beta Blocker Use on Mean Systolic Blood Pressure in NHANES[36,37]
| Model | Effect Size (CI) |
|---|---|
| Unadjusted | −4.03 (−6.30, −1.76) |
| Marginal structural modela | −3.52 (−1.21, −5.74) |
| Conditional modelb | −3.48 (−1.27, −5.76) |
Estimated in a marginal structural model, by inverse weighting with the propensity for diuretic or beta blocker use given self-reported categorized body mass index (BMI).
Estimated in a conditional regression model with adjustment for self-reported categorical BMI.
FIGURE 5.Density of predicted bias due to classification error in self-reported BMI category in NHANES.[36,37] Bias in the average treatment effect of diuretics use compared with beta blocker use on mean systolic blood pressure by inverse weighting with the propensity for diuretic or beta blocker use given self-reported categorical BMI (MSM-IPW), and using a conditional linear regression with adjustment for self-reported categorical BMI. The specificity and sensitivity of self-reported BMI category range from 0.90 to 0.98 and are sampled from a uniform distribution, trapezoidal (with modes on one third and two third), and symmetrical triangular distribution.