| Literature DB >> 32847911 |
Michelle Samuel1,2, Brice Batomen2, Julie Rouette2, Joanne Kim2, Robert W Platt2, James M Brophy1,2, Jay S Kaufman3.
Abstract
BACKGROUND: Propensity score (PS) methods are frequently used in cardiovascular clinical research. Previous evaluations revealed poor reporting of PS methods, however a comprehensive and current evaluation of PS use and reporting is lacking. The objectives of the present survey were to (1) evaluate the quality of PS methods in cardiovascular publications, (2) summarise PS methods and (3) propose key reporting elements for PS publications.Entities:
Keywords: cardiac epidemiology; cardiology; epidemiology
Mesh:
Year: 2020 PMID: 32847911 PMCID: PMC7451534 DOI: 10.1136/bmjopen-2020-036961
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Key terms
| Terms | Description |
| Average treatment effect | Average treatment effect of moving the entire population from untreated to treated, regardless of the treatment received. |
| Average treatment effect in the treated | Average effect among treated subjects. Treated sample becomes the reference group to which the treated and untreated subjects are being standardised. |
| Average treatment effect in the untreated | Average effect in subjects who were untreated. Untreated samples become the reference group to which the untreated and treated subjects are being standardised. |
| Conditional effects | Treatment effect at the individual-level and consists of moving individual subjects with the same covariate pattern from untreated to treated. |
| Marginal effects | Average treatment effect at the population level. |
| Variance ratio | Analytic toll to assess balance by comparing the variances of baseline characteristics between treatment groups. |
| Positivity assumption | Probability of subjects being assigned treatment, non-treatment or varying levels of treatment is greater than 0%. |
Figure 1Number of articles by propensity score method over time.
Figure 2Estimated effects inferred based on propensity score method. ATE, average treatment effect; ATT, average treatment effect in the treated; ATU, average treatment effect in the untreated.
Figure 3Distribution of measures to assess propensity score (PS) model success. *Articles using multiple measures to assess PS model success were included in multiple categories. SMD, standardised mean difference.
Figure 4Proportion of treated subjects with no match after propensity score matching.
Characteristics of studies using PS matching (N=210)
| Yes (n (%)) | Not reported (n (%)) | |
| Matches: | 8 (3.8) | |
| 1:1 | 178 (84.7) | |
| 1:2 | 8 (3.8) | |
| 1:3 | 5 (2.4) | |
| 1:4 | 5 (2.4) | |
| Other | 4 (1.9) | |
| Multiple matching ratios | 2 (1) | |
| Calliper scale (nearest neighbour matching with callipers): | 2 (1.7) | |
| PS | 52 (43.3) | |
| SD of Logit PS | 39 (32.4) | |
| SD of PS | 21 (17.5) | |
| Logit PS | 3 (3.3) | |
| Mahalanobis distance | 2 (1.7) | |
| Matches found without replacement | 79 (37.6) | 120 (57.1) |
| Matching algorithm: | 120 (57.1) | |
| Greedy | 80 (38.1) | |
| Optimal | 10 (4.8) | |
| Assessment of balance | ||
| Balance of covariates assessed by: | 17 (8.1) | |
| Hypothesis testing | 68 (32.4) | |
| SMDs | 57 (27.1) | |
| Both | 60 (28.6) | |
| Stated balance was assessed | 8 (3.8) | |
| Threshold for successful balance explicitly stated (eg, 10% SMDs, p>0.05) | 142 (67.6) | --- |
| SMDs threshold: | 39 (33.3) | |
| <10% | 71 (60.7) | |
| <20% | 3 (2.6) | |
| Other | 4 (3.4) |
PS, propensity score; SMD, standardised mean difference.
Figure 5Propensity score matching strategies. *One article used both nearest neighbour matching without callipers and stratification.
Summary of recommendations for reporting propensity score (PS) methods
| Elements to be reported | Methodological recommendations |
| Variable selection strategy for PS model | Potential confounders Select variables a priori Optional: strong predictors of the outcome |
| Balance diagnostics | Standardised mean differences (threshold <10%) Graphical representation of PS distribution* Optional |
Ratio for matches Matching strategy Number of subjects and balance diagnostics pre-match and post- match Variance estimation | 1:1 or 1:2 matching is sufficient Nearest-neighbour with callipers strongly preferred 0.2 SD of the logit of the PS Untreated subjects chosen with or without replacement Without replacement—untreated matches chosen with greedy or optimal matching Account for matched pairs in outcome model with clustering, stratification or regression Account for matching with replacement |
Application of weights Extraneous values Variance estimation | Throughout; otherwise, if heterogeneity in treatment effect expected, apply subgroup-specific weights Use stabilisation, trimming and truncation, if appropriate Non-parametric bootstrap method preferred |
Number of strata Size of strata Combine estimates | Five strata Equal-sized or unequal-sized strata Pool stratum-specific estimates using the proportion of subjects in each stratum |
Conditional standardised difference or quantile regression | |
ATE ATT ATU | Methods consistent with target population Describe the inclusion criteria and: Treatment effect in treated and untreated groups Treatment effect in the treated subgroup only Treatment effect in the untreated subgroup only |
*Kernel density plots, histograms, cumulative distribution functions, quantile–quantile plots, side-by-side box plots, etc.
ATE, Average treatment effect; ATT, average treatment effect in the treated; ATU, average treatment effect in the untreated.