| Literature DB >> 32242973 |
Laine E Thomas1, Siyun Yang1, Daniel Wojdyla2, Douglas E Schaubel3.
Abstract
Observational studies of treatment effects attempt to mimic a randomized experiment by balancing the covariate distribution in treated and control groups, thus removing biases related to measured confounders. Methods such as weighting, matching, and stratification, with or without a propensity score, are common in cross-sectional data. When treatments are initiated over longitudinal follow-up, a target pragmatic trial can be emulated using appropriate matching methods. The ideal experiment of interest is simple; patients would be enrolled sequentially, randomized to one or more treatments and followed subsequently. This tutorial defines a class of longitudinal matching methods that emulate this experiment and provides a review of existing variations, with guidance regarding study design, execution, and analysis. These principles are illustrated in application to the study of statins on cardiovascular outcomes in the Framingham Offspring cohort. We identify avenues for future research and highlight the relevance of this methodology to high-quality comparative effectiveness studies in the era of big data.Entities:
Keywords: longitudinal matching; new-user design; real-world evidence; time-dependent confounding; time-varying treatment
Mesh:
Year: 2020 PMID: 32242973 PMCID: PMC7384144 DOI: 10.1002/sim.8533
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Variations on longitudinal matching
| Name | Reference | Contribution |
|---|---|---|
|
| ||
| Balanced risk set matching3 | Li et al, 2001 | Original concept and theory |
| Haviland et al, 2007 | Group‐based trajectory models for longitudinal history | |
| Zubizarreta et al, 2014 | “Isolation” to reduce unmeasured confounding | |
| Propensity score matching with time‐dep. covariates3 | Lu, 2005 | Time‐dependent propensity score |
| Sequential stratification2 | Schaubel et al, 2006 | Strong theory and feasibility in big data |
| Schaubel et al, 2009 | Interaction with time‐dep. covariates, and IPCW for treatment switching | |
| Kennedy et al, 2010 | Methods comparison | |
| Taylor et al, 2014 | Simulation study and methods comparison | |
| Smith et al, 2015 | Use of a prognostic score matching and recurrent events | |
| Sequential Cox models1 | Gran et al, 2010 | Regression‐based approach with inverse probability of censoring weights for switching |
| Matching methods for … | Li et al, 2014 | Survival estimation and counterfactual theory |
| time‐dependent treatment | He et al (in press) | Prognostic score matching |
|
| ||
| Matched cohort design | Seeger et al, 2005 | Intuitive framework and worked example |
| Emulating a target trial1 | Hernan et al, 2008 | Target trial concept with compelling example |
| Danaei et al, 2013 | Worked example with clear rationale and detail | |
| Hernan et al, 2016 | Connection to big data | |
| Hernan et al, 2016 | Common failures in target trial emulation | |
| Incident user cohort design3 | Schneeweiss et al, 2010 | Design considerations in healthcare data |
| Balanced sequential cohort3 | Schneeweiss et al, 2011 | Review of challenges in early marketing |
| Sequential matched cohort3 | Gagne et al, 2012 | Semi‐automated safety monitoring |
| Inc. user cohort de.3 | Rassen et al, 2012 | Adds high‐dimensional propensity score |
| Rolling entry matching3 | Witman et al, 2019 | Similar to Lu (2005) with software |
| Jones et al, 2017 | Software for rolling entry matching | |
|
| ||
| None3 | Sianesi, 2004 | Counterfactual theory with applied focus |
| Dynamic treatment | Fredriksson et al, 2008 | Counterfactual theory and survival estimation |
| matching2,3 | Crepon et al, 2009 | Additional theory and example |
| Vikstrom, 2017 | Variable selection and new estimands | |
| Sequential causal models | Lechner, 2009 | Alternative approaches |
|
| ||
| None1,2 | Ray et al, 2002 | Intuitive approach, strong application with active control |
| Staggered cohort study | Blackburn et al, 2017 | Application in psychology with clear rationale |
Note: Superscripts 1,2, or 3 correspond to approaches in Section 5.2.1, 5.2.2, and 5.2.3, respectively
Figure 1Schema of enrollment across j=1,…,n longitudinal pseudo‐experiments. Time scale, s, represents time since first eligibility for treatment initiation. {j} represents the set of eligible patients at time s satisfying R (s )=1, , and S ≥s for i=1,…,n patients. Any patients with S =s are treated and the remainder with S >s are controls. Covariate history is available to the jth experiment. A, Generic scheme; variable follow‐up. B, Specific example; homogenous follow‐up
Key steps and decisions in longitudinal matching
| Section | |
|---|---|
| Design | 4 |
| Target trial specification helps to guide analytic decisions | 4.1 |
| Multiple time scales are important and the primary scale should be clinically meaningful | 4.2 |
| Use LM methods to facilitate a new‐user design. Use a wash‐out period as needed. | 4.3 |
| Select active vs inactive controls to align with the target trial | 4.4 |
| Eligibility can be required for entry into an experiment but is not relevant afterwards | 4.5 |
| Treatment switching; specify which are acceptable and which depart from the question of interest | 4.6 |
| Matching | 5 |
| Select confounders that best aggregate prognostic information | 5.1 |
| Longitudinal matching methods | 5.2 |
| ‐ Option 1: Match on time and eligibility, parametric regression models address confounding | 5.2.1 |
| ‐ Option 2: Exact matching on important strata, plus parametric regression models | 5.2.2 |
| ‐ Option 3: Matching on all relevant confounders, consider propensity and prognostic scores | 5.2.3 |
| Recent recommendations for matching with cross sectional data are relevant | 5.3 |
| Assessment of balance is important when relying on covariate matching (Option 3) | 5.4 |
| Analysis of outcome | 6 |
| LM methods are typically conditional on covariates, rather than marginal | 6.1 |
| Describe treatment initiation times, and subsequent follow‐up to inform generalizability | 6.2 |
| Censoring may introduce bias; consider weighting (IPCW) | 6.3 |
| Effect modification by time‐varying factors; facilitated by Options 1 and 2 | 6.4 |
| Variance estimation should account for correlation induced by repeated data | 6.5 |
Figure 2Follow‐up time scales and censoring dates. Time scale, s, represents time since first eligibility for enrollment. Censoring is purely administrative and occurs at a common time point, C =τ , for all patients. We are interested in studying outcomes after treatment initiation for a minimum of τ years. Treatment initiation times must be limited to s∈(0,τ). As before, {j} represents the set of eligible patients at time s satisfying R (s )=1, , and S ≥s for i=1,…,n patients. Any patients with S =s are treated and the remainder with S >s are controls in the jth experiment. A, Extended follow‐up. B, Restricted follow‐up
Figure 3Illustration of two hypothetical patients across three time scales (discrete for simplicity). Available data range across calendar time 2006‐2012. The relevant time scale for matching (s) is time since diagnosis. Follow‐up for outcomes (time scale t) begins after treatment (Tx) initiation. Censoring time with respect to s is C (s) and with respect to t is C (t)
Figure 4Balance check at examination 7. Absolute standardized mean differences are the difference in covariate means between two groups (treated vs untreated) divided by the standard deviation of the same covariate. Unadjusted: patients who initiate statins are compared to eligible controls at examination 7. Matched: after matching on a time‐dependent propensity score, patients who initiate statins and their matched controls at examination 7 are compared
Figure 5Hazar d ratios for the treatment effect of statins on cardiovascular outcomes over 15 years follow‐up
Figure 6Subgroup analysis of the ITT effect of statins on cardiovascular outcomes over 15 years follow‐up. Sequential stratification (dashed line) and Sequential Cox model (solid line)