| Literature DB >> 32684529 |
Tomohiro Shinozaki1, Etsuji Suzuki2.
Abstract
Epidemiologists are increasingly encountering complex longitudinal data, in which exposures and their confounders vary during follow-up. When a prior exposure affects the confounders of the subsequent exposures, estimating the effects of the time-varying exposures requires special statistical techniques, possibly with structural (ie, counterfactual) models for targeted effects, even if all confounders are accurately measured. Among the methods used to estimate such effects, which can be cast as a marginal structural model in a straightforward way, one popular approach is inverse probability weighting. Despite the seemingly intuitive theory and easy-to-implement software, misunderstandings (or "pitfalls") remain. For example, one may mistakenly equate marginal structural models with inverse probability weighting, failing to distinguish a marginal structural model encoding the causal parameters of interest from a nuisance model for exposure probability, and thereby failing to separate the problems of variable selection and model specification for these distinct models. Assuming the causal parameters of interest are identified given the study design and measurements, we provide a step-by-step illustration of generalized computation of standardization (called the g-formula) and inverse probability weighting, as well as the specification of marginal structural models, particularly for time-varying exposures. We use a novel hypothetical example, which allows us access to typically hidden potential outcomes. This illustration provides steppingstones (or "tips") to understand more concretely the estimation of the effects of complex time-varying exposures.Entities:
Keywords: causal inference; g-formula; inverse probability weighting; marginal structural model; time-varying exposure
Mesh:
Year: 2020 PMID: 32684529 PMCID: PMC7429147 DOI: 10.2188/jea.JE20200226
Source DB: PubMed Journal: J Epidemiol ISSN: 0917-5040 Impact factor: 3.211
Hypothetical cohort data with potential outcomes under point-exposure
| Stratum | Potential outcomea | Observed outcome | ||||||
| Risk | Risk | E[ | ||||||
| 1 | 1 | 280 | 168 | 0.6 | 210 | 0.75 | ||
| 1 | 0 | 720 | 540 | 0.75 | 432 | 0.6 | ||
| 0 | 1 | 180 | 45 | 0.25 | 60 | 0.333 | ||
| 0 | 0 | 60 | 20 | 0.333 | 15 | 0.25 | ||
| Total | 1,240 | 660 | 0.532 | 830 | 0.669 | |||
aUnobservable counterfactual distributions. Bold numbers are observed as Y = 1 (by consistency) in each stratum.
Hypothetical cohort data with potential outcomes under time-varying exposure
| Stratum | Potential outcomea | Observed outcome | |||||||||||
| Risk | Risk | Risk | Risk | E[ | |||||||||
| 1 | 1 | 1 | 720 | 648 | 0.9 | 648 | 0.9 | 432 | 0.6 | 576 | 0.8 | ||
| 1 | 1 | 0 | 180 | 162 | 0.9 | 162 | 0.9 | 144 | 0.8 | 108 | 0.6 | ||
| 1 | 0 | 1 | 1,800 | 1,080 | 0.6 | 990 | 0.55 | 900 | 0.5 | 720 | 0.4 | ||
| 1 | 0 | 0 | 1,800 | 1,080 | 0.6 | 990 | 0.55 | 720 | 0.4 | 900 | 0.5 | ||
| 0 | 1 | 1 | 5,670 | 5,103 | 0.9 | 2,835 | 0.5 | 3,402 | 0.6 | 4,536 | 0.8 | ||
| 0 | 1 | 0 | 630 | 504 | 0.8 | 315 | 0.5 | 378 | 0.6 | 567 | 0.9 | ||
| 0 | 0 | 1 | 840 | 252 | 0.3 | 462 | 0.55 | 252 | 0.3 | 294 | 0.35 | ||
| 0 | 0 | 0 | 3,360 | 1,176 | 0.35 | 1,848 | 0.55 | 1,008 | 0.3 | 1,008 | 0.3 | ||
| Total | 15,000 | 9,900 | 0.66 | 9,300 | 0.62 | 7,800 | 0.52 | 7,200 | 0.48 | ||||
aUnobservable counterfactual distributions. Bold numbers are observed as Y = 1 (by consistency) in each stratum.
Estimates of effects of time-varying exposure from hypothetical cohort data
| Risk | Risk | Risk | Risk | ||||||||||||
| 1 | 630 | 567 | 0.9 | 5,670 | 4,536 | 0.8 | 0.6 | 180 | 108 | 0.6 | 720 | 576 | 0.8 | 0.2 | 0.48 |
| 0 | 3,360 | 1,008 | 0.3 | 840 | 294 | 0.35 | 0.4 | 1,800 | 900 | 0.5 | 1,800 | 720 | 0.4 | 0.8 | 0.52 |
| Estimates of E[ | |||||||||||||||
| | 3,990 | 1,575 | 0.39 | 6,510 | 4,830 | 0.74 | 1,980 | 1,008 | 0.51 | 2,520 | 1,296 | 0.51 | |||
| Naïve standardizationc | 0.59 | 0.57 | 0.55 | 0.59 | |||||||||||
| G-formulad | 0.66 | 0.62 | 0.52 | 0.48 | |||||||||||
a(a1, a2) corresponds to the value of (A1, A2).
bCalculate E[Y|A1, A2] using N and Y = 1 data in the subgroup defined by (A1, A2).
cCalculate [Y|A1, A2, L2 = l2]p(l2), where data in “Risk” and “p(L2)” columns in each L2 = l2 (0 or 1) row are used for E[Y|A1, A2, L2 = l2] and p(l2), respectively.
dCalculate [Y|A1, A2, L2 = l2]p(l2|A1) as above, except for using probabilities in “p(L2|A2)” instead of “p(L2)” for the corresponding L2 and A2 values.
Figure 1. Causal DAGs and SWIGs compatible with example data, where U and W are unobserved variables: (a) causal DAG without W, in which A1–L2, A1–Y, and A2–Y are (conditionally) unconfounded given observed data; (b) causal DAG with W, in which A1–Y and A2–Y are (conditionally) unconfounded but A1–L2 is confounded given observed data; (c) a “template” under intervention (a1, a2) of SWIG that corresponds to causal DAG (a); (d) a “template” under intervention (a1, a2) of SWIG that corresponds to causal DAG (b).
Hypothetical cohort data weighted by inverse probability of exposures
| Unweighted number | IPW | Number multiplied by IPW | |||||||
| 1 | 1 | 1 | 720 | 576 | 0.3 | 0.8 | 4.17 | 3,000 | 2,400 |
| 1 | 1 | 0 | 180 | 108 | 0.3 | 0.2 | 16.67 | 3,000 | 1,800 |
| 1 | 0 | 1 | 1,800 | 720 | 0.3 | 0.5 | 6.67 | 12,000 | 4,800 |
| 1 | 0 | 0 | 1,800 | 900 | 0.3 | 0.5 | 6.67 | 12,000 | 6,000 |
| 0 | 1 | 1 | 5,670 | 4,536 | 0.7 | 0.9 | 1.59 | 9,000 | 7,200 |
| 0 | 1 | 0 | 630 | 567 | 0.7 | 0.1 | 14.29 | 9,000 | 8,100 |
| 0 | 0 | 1 | 840 | 294 | 0.7 | 0.2 | 7.14 | 6,000 | 2,100 |
| 0 | 0 | 0 | 3,360 | 1,008 | 0.7 | 0.8 | 1.79 | 6,000 | 1,800 |
IPW, inverse probability weight.
Inverse probability weighted estimates of marginal structural models from observed hypothetical cohort data (Table 3)
| MSM (3): Correct | MSM (4): Incorrect | MSM (5): Incorrect | MSM (6): Correct | |||||
| Estimatea | 95% CIb | Estimatea | 95% CIb | Estimatea | 95% CIb | Estimatea | 95% CIb | |
| Risk difference or ratio | ||||||||
| | −0.140 | −0.160, −0.120 | −0.090c | −0.104, −0.076 | 0.781 | 0.753, 0.810 | 0.788d | 0.746, 0.832 |
| | −0.040 | −0.060, −0.020 | −0.090c | −0.104, −0.076 | 0.932 | 0.900, 0.965 | 0.939e | 0.903, 0.978 |
| | — | — | — | 0.983f | 0.914, 1.057 | |||
| Potential outcome mean | ||||||||
| E[ | 0.660 | 0.643, 0.677 | 0.660 | 0.643, 0.677 | 0.663 | 0.645, 0.681 | 0.660 | 0.641, 0.680 |
| E[ | 0.620 | 0.605, 0.635 | 0.570 | 0.560, 0.580 | 0.618 | 0.602, 0.634 | 0.620 | 0.604, 0.637 |
| E[ | 0.520 | 0.501, 0.539 | 0.570 | 0.560, 0.580 | 0.518 | 0.499, 0.537 | 0.520 | 0.497, 0.544 |
| E[ | 0.480 | 0.463, 0.497 | 0.480 | 0.463, 0.497 | 0.483 | 0.466, 0.499 | 0.480 | 0.461, 0.500 |
CI, confidence interval; MSM, marginal structural model.
aRisk differences β (MSMs (3) and (4)) or risk ratios exp(β) (MSMs (5) and (6)) in the upper part.
bUsing sandwich estimator.
cCommon risk difference for A1 and A2.
dRisk ratio for A1 when controlling A2 at 0: E[Y1,0]/E[Y0,0].
eRisk ratio for A2 when controlling A1 at 0: E[Y0,1]/E[Y0,0].
fInteraction between A1 and A2 in risk ratio scale: (E[Y1,1]E[Y0,0])/(E[Y1,0]E[Y0,1]).
Inverse probability weighted estimates of marginal structural models using a misspecified exposure probability model
| MSM (3): Correct | MSM (4): Incorrect | MSM (5): Incorrect | MSM (6): Correct | |||||
| Estimatea | 95% CIb | Estimatea | 95% CIb | Estimatea | 95% CIb | Estimatea | 95% CIb | |
| Risk difference or ratio | ||||||||
| | −0.119 | −0.145, −0.092 | −0.081c | −0.095, −0.068 | 0.813 | 0.774, 0.855 | 0.886d | 0.819, 0.958 |
| | −0.045 | −0.073, −0.017 | −0.081c | −0.095, −0.068 | 0.924 | 0.880, 0.969 | 1.022e | 0.983, 1.063 |
| | — | — | — | 0.822f | 0.749, 0.902 | |||
| Potential outcome mean | ||||||||
| E[ | 0.655 | 0.635, 0.674 | 0.649 | 0.628, 0.671 | 0.658 | 0.638, 0.679 | 0.625 | 0.606, 0.644 |
| E[ | 0.610 | 0.593, 0.628 | 0.568 | 0.552, 0.584 | 0.608 | 0.590, 0.626 | 0.639 | 0.624, 0.654 |
| E[ | 0.536 | 0.503, 0.570 | 0.568 | 0.552, 0.584 | 0.535 | 0.502, 0.569 | 0.554 | 0.515, 0.595 |
| E[ | 0.491 | 0.472, 0.511 | 0.487 | 0.466, 0.507 | 0.494 | 0.475, 0.513 | 0.465 | 0.445, 0.486 |
CI, confidence interval; MSM, marginal structural model.
aRisk differences β (MSMs (3) and (4)) or risk ratios exp(β) (MSMs (5) and (6)) in the upper part.
bUsing sandwich estimator.
cCommon risk difference for A1 and A2.
dRisk ratio for A1 when controlling A2 at 0: E[Y1,0]/E[Y0,0].
eRisk ratio for A2 when controlling A1 at 0: E[Y0,1]/E[Y0,0].
fInteraction between A1 and A2 in risk ratio scale: (E[Y1,1]E[Y0,0])/(E[Y1,0]E[Y0,1]).
Key messages for clear understanding of marginal structural modeling
| • | Marginal structural models (MSMs) should be distinguished from inverse probability weighting |
| • | MSM shows prespecified assumptions on causal estimands, while an exposure probability model is an imposed restriction on observed distribution |
| • | As MSM and exposure probability model are used for different purposes, misspecification of these models would lead to biases in different ways |
| • | Model specifications of MSMs and exposure probability models raise different challenges in real data analysis |
| • | G-formula, which shares identifiability assumptions with inverse probability weighting, can be used to fit MSMs only when the models are saturated |